Article

Bisociative knowledge discovery for microarray data analysis

Mozetic , I , Lavrac , N , Podpecan , V , Novak , P K , Motaln , H , Petek , M , Gruden , K , Toivonen , H & Kulovesi , K 2010 , ' Bisociative knowledge discovery for microarray data analysis ' , pp. 190-199
Source: OAI

ABSTRACT The paper presents an approach to computational knowledge discovery through the mechanism of bisociation. Bisociative reasoning is at the heart of creative, accidental discovery (e.g., serendipity), and is focused on finding unexpected links by crossing contexts. Contextu- alization and linking between highly diverse and distributed data and knowledge sources is therefore crucial for the implementation of bisocia- tive reasoning. In the paper we explore these ideas on the problem of analysis of microarray data. We show how enriched gene sets are found by using ontology information as background knowledge in semantic sub- group discovery. These genes are then contextualized by the computation of probabilistic links to diverse bioinformatics resources. Preliminary ex- periments with microarray data illustrate the approach.

0 Bookmarks
 · 
107 Views
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: With the expanding of the Semantic Web and the availability of numerous ontologies which provide domain background knowledge and semantic descriptors to the data, the amount of semantic data is rapidly growing. The data mining community is faced with a paradigm shift: instead of mining the abundance of empirical data supported by the background knowledge, the new challenge is to mine the abundance of knowledge encoded in domain ontologies, constrained by the heuristics computed from the empirical data collection. We address this challenge by an approach, named semantic data mining, where domain ontologies define the hypothesis search space, and the data is used as means of constraining and guiding the process of hypothesis search and evaluation. The use of prototype semantic data mining systems SEGS and g-SEGS is demonstrated in a simple semantic data mining scenario and in two real-life functional genomics scenarios of mining biological ontologies with the support of experimental microarray data. KeywordsSemantic data mining–ontologies–background knowledge–relational data mining
    09/2011: pages 165-178;
  • [Show abstract] [Hide abstract]
    ABSTRACT: Bisociation Network (BisoNet) is a novel approach for creative information discovery, and it can be projected to many real application domains. Bisociation of business processes onto a network is one of such applications. In this paper, we investigate business processes on the BisoNet, and develop a directed graph model to map the relations between business process flows. Based on the BisoNet model, we analyze the real-world data provided by a call service centre. The network-based statistics show that some special process steps could be key steps that greatly affect the performance of the service, and could result in a case unsolved. The network is simplified through constructing the network with shortest path of each process flow, and the simplified network may represent an optimal process pattern. This may provide a reference to the business organization for improving the quality of their service.
    International Journal of Machine Learning and Cybernetics. 07/2012;
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We consider data mining methods on large graphs where a set of labels is associated to each vertex. A typical example concerns the social network of collaborating researchers where additional information concern the main publication targets (preferred conferences or journals) for each author. We investigate the extraction of sets of dense subgraphs such that the vertices in all subgraphs of a set share a large enough set of labels. As a first step, we consider here the special case of dense sub-graphs that are cliques. We proposed a method to compute all maximal homogeneous clique sets that satisfy user-defined constraints on the num-ber of separated cliques, on the size of the cliques, and on the number of labels shared by all the vertices. The empirical validation illustrates the scalability of our approach and it provides experimental feedback on two real datasets, more precisely an annotated social network derived from the DBLP database and an enriched biological network concerning protein-protein interactions. In both cases, we discuss the relevancy of extracted patterns thanks to available domain knowledge.
    Workshop on Analysis of Complex NEtworks (ACNE) co-located with ECML/PKDD; 09/2010

Full-text (2 Sources)

View
66 Downloads
Available from
Jun 3, 2014