Alexander Rüegg’s research while affiliated with Bielefeld University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (16)


Ontology based text indexing and querying for the semantic web
  • Article

December 2006

·

312 Reads

·

100 Citations

Knowledge-Based Systems

Jacob Köhler

·

·

Michael Specht

·

Alexander Rüegg

This publication shows how the gap between the HTML based internet and the RDF based vision of the semantic web might be bridged, by linking words in texts to concepts of ontologies. Most current search engines use indexes that are built at the syntactical level and return hits based on simple string comparisons. However, the indexes do not contain synonyms, cannot differentiate between homonyms (‘mouse’ as a pointing vs. ‘mouse’ as an animal) and users receive different search results when they use different conjugation forms of the same word. In this publication, we present a system that uses ontologies and Natural Language Processing techniques to index texts, and thus supports word sense disambiguation and the retrieval of texts that contain equivalent words, by indexing them to concepts of ontologies.For this purpose, we developed fully automated methods for mapping equivalent concepts of imported RDF ontologies (for this prototype WordNet, SUMO and OpenCyc). These methods will thus allow the seamless integration of domain specific ontologies for concept based information retrieval in different domains.To demonstrate the practical workability of this approach, a set of web pages that contain synonyms and homonyms were indexed and can be queried via a search engine like query frontend. However, the ontology based indexing approach can also be used for other data mining applications such text clustering, relation mining and for searching free text fields in biological databases. The ontology alignment methods and some of the text mining principles described in this publication are now incorporated into the ONDEX system http://ondex.sourceforge.net/.


Figure 1: Overview of the ONDEX architecture and its components  
Figure 2: System architecture of the graph analysis and visualisation components of ONDEX.  
Figure 3: The ONDEX front-end can visualise, filter and analyse microarray results in the context of hundreds of thousands of concepts integrated from several heterogeneous databases (AraCyc, KEGG, Transfac, Transpath and DRASTIC). Concepts from several concept classes (transcription factors proteins and genes) are highlighted according to their expression level in the microarray experiment. The layout was generated using the FastCircularLayout  
Figure 4: (a) The lignin pathway displayed in the graph analysis component showing only elements where the genes are up-or down-regulated. This figure shows that the four genes At1g67990, At1g09500, At1g72680 and At2g33590 are differentially expressed. They encode several proteins and enzymes which participate in the lignin biosynthesis pathway and fall into three enzyme classes (2.1.1.104, 1.1.1.195 and 1.2.1.44). Data from the DRASTIC database shows that these genes respond to different types of stress (ABA, sodium chloride, drought and wound). (b) Even though it was not differentially expressed on the microarray chip, according to the TRANSFAC database, rd29A (At5g52310) is known to be regulated by a transcription factor which was also differentially expressed in the microarray experiment. This gene is further annotated with 12 different stress types in the DRASTIC database (bottom left, blue rectangular boxes). These 12 stress types are also known to affect the expression of 120 other genes that were also differentially expressed in the analysed microarray experiment.  
Graph-based analysis and visualization of experimental results with ONDEX
  • Article
  • Full-text available

July 2006

·

800 Reads

·

200 Citations

Bioinformatics

Jacob Köhler

·

Jan Baumbach

·

·

[...]

·

Motivation: Assembling the relevant information needed to interpret the output from high-throughput, genome scale, experiments such as gene expression microarrays is challenging. Analysis reveals genes that show statistically significant changes in expression levels, but more information is needed to determine their biological relevance. The challenge is to bring these genes together with biological information distributed across hundreds of databases or buried in the scientific literature (millions of articles). Software tools are needed to automate this task which at present is labor-intensive and requires considerable informatics and biological expertise. Results: This article describes ONDEX and how it can be applied to the task of interpreting gene expression results. ONDEX is a database system that combines the features of semantic database integration and text mining with methods for graph-based analysis. An overview of the ONDEX system is presented, concentrating on recently developed features for graph-based analysis and visualization. A case study is used to show how ONDEX can help to identify causal relationships between stress response genes and metabolic pathways from gene expression data. ONDEX also discovered functional annotations for most of the genes that emerged as significant in the microarray experiment, but were previously of unknown function.

Download

Graph-based analysis and visualization of experimental results with ONDEX

June 2006

·

49 Reads

·

109 Citations

Bioinformatics

Motivation: Assembling the relevant information needed to interpret the output from high-throughput, genome scale, experiments such as gene expression microarrays is challenging. Analysis reveals genes that show statistically significant changes in expression levels, but more information is needed to determine their biological relevance. The challenge is to bring these genes together with biological information distributed across hundreds of databases or buried in the scientific literature (millions of articles). Software tools are needed to automate this task which at present is labor-intensive and requires considerable informatics and biological expertise. Results: This article describes ONDEX and how it can be applied to the task of interpreting gene expression results. ONDEX is a database system that combines the features of semantic database integration and text mining with methods for graph-based analysis. An overview of the ONDEX system is presented, concentrating on recently developed features for graph-based analysis and visualization. A case study is used to show how ONDEX can help to identify causal relationships between stress response genes and metabolic pathways from gene expression data. ONDEX also discovered functional annotations for most of the genes that emerged as significant in the microarray experiment, but were previously of unknown function.









Citations (7)


... However, the question is how to formulate these definitions in line with information systems aligned with modern digital environments (KÖHLER et al., 2006). The difficulty in creating definitions in ontologies is related to the fact that, besides the complexity and cost of the task, it is susceptible to errors and needs to be performed by trained people (TSATSARONIS et al., 2013). ...

Reference:

COMMUNICATION AMONG MEDICAL INFORMATION SYSTEMS principles for elaborating definitions
Quality control for terms and definitions in ontologies and taxonomies

BMC Bioinformatics

... However, its effectiveness in library systems remains inconclusive, with studies suggesting that while tags aid browsing, they lack the specificity of controlled vocabularies (Rolla, 2009;Pirmann, 2012). Ontologybased indexing enhances retrieval accuracy by linking text to structured semantic concepts, addressing limitations of traditional keyword-based indexing (Köhler et al., 2006). Hybrid models integrating these approaches are increasingly advocated. ...

Ontology based text indexing and querying for the semantic web
  • Citing Article
  • December 2006

Knowledge-Based Systems

... A tool, Visual Object Net++ (VON++) is employed for modelling and simulation of the HPNs. Other papers [35,36] propose a framework for the integration of information extracted from different biology databases, aimed at easing the development and the execution of modelling and simulation tools. More specifically, the authors propose an environment to extract data from the main biology databases and automatically translate them into Petri net models. ...

The Biology Petri Net Markup Language
  • Citing Conference Paper
  • January 2002

... With these issues related to concepts, meaning, and context in using ontologies, it is problematic for ontologies to measure or characterize changes knowledge without referring to context. Some have claimed computational approaches such as text mining, can help provide clarity on the knowledge about a concept by limiting the potential meanings of a concept within a text by using text and textual content to link concepts, contexts, and knowledge to ontologies, (Koehler et al., 2005;Mortensen et al., 2015). ...

Linking experimental results, biological networks and sequence analysis methods using Ontologies and Generalised Data Structures
  • Citing Article
  • January 2004

In Silico Biology

... The data it analyses can also be modeled, stored and retrieved. In many contexts, identifying, visualizing and analyzing links between items are referred to as graph analysis, a process Kohler et al. 84 identified as supported by Ondex, which maps and automatically links data from various heterogeneous sources, unlike other graph-based systems. ...

Graph-based analysis and visualization of experimental results with ONDEX
  • Citing Article
  • June 2006

Bioinformatics

... In fact, these methods are not adjusted to the conditions of use. The next stage in the development of methods for relationship extraction is classical machine-learning methods: SVM and its modifications [28,[38][39][40], Naive Bayes Classifier [41], hidden Markov model [42,43] with subsequent transition to neural networks: fully connected [44], recurrent [45,46], convolutional [47,48]. The paper [49] provides a comprehensive overview of methods applicable to the task of named entity and relationship extraction, and drug-drug interaction in particular. ...

Extraction of biological networks from scientific literature
  • Citing Article
  • October 2005

Briefings in Bioinformatics

... The PAST4 software (version 4.15) [32] was used to calculate the Shannon-H and Equitability microbial diversity indices. Graphia 2.0 software was used for correlation network analysis of OTUs, with Pearson's correlation coefficients above 0.95 [33], clustered by Markov Cluster Algorithm (MCL, granularity 1.1). The taxa of known functional microbial groups, such as mycorrhiza, diazotrophs, nitrifiers and denitrifiers, were grouped to represent the effect of the treatments on soil functional microbial groups. ...

Graph-based analysis and visualization of experimental results with ONDEX

Bioinformatics