Conference Paper

Information extraction and integration from heterogeneous, distributed, autonomous information sources - a federated ontology-driven query-centric approach

Dept. of Comput. Sci., Iowa State Univ., Ames, IA, USA
DOI: 10.1109/IRI.2003.1251412 Conference: Information Reuse and Integration, 2003. IRI 2003. IEEE International Conference on
Source: IEEE Xplore

ABSTRACT This paper motivates and describes the data integration component of INDUS (intelligent data understanding system) environment for data-driven information extraction and integration from heterogeneous, distributed, autonomous information sources. The design of INDUS is motivated by the requirements of applications such as scientific discovery, in which it is desirable for users to be able to access, flexibly interpret, and analyze data from diverse sources from different perspectives in different contexts. INDUS implements a federated, query-centric approach to data integration using user-specified ontologies.

  • Source
  • [Show abstract] [Hide abstract]
    ABSTRACT: Glycan biosynthesis is the process by which glycans are built in the cells. Information about the process is available from several Web accessible databases, published papers, experimental data produced at the Complex Carbohydrates Research Center (CCRC) as well as knowledge accumulated by the biologists working at CCRC. The principal purpose of GlycoVault is to support the research of glycobiologists in analyzing data about glycans, such as the changes in their abundance level over a cell's life cycle and their correlation with gene expression levels for proteins that serve as enzymes in the biosynthetic process. Traditional, bioinformatics resources have been provided by relational databases. Some new research efforts have proposed replacing relational databases with ontologies. In GlycoVault, we are utilizing both technologies to supply an integrated bioinformatics infrastructure that can serve as a resource to end users as well as other programs and applications.
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: A biological data acquisition based on web has emerged as a powerful tool for allowing scientists to interactively view entries form different databases, and to navigate from one database to another molecular-biology database links. In this paper, the biological conceptual model is constructed hybrid biological data model to represent interesting entities in the data sources to applying navigation rule property for each biological data source based on four biological data integrating layers to control biological data. When some user's requests for application service are occurred, we can get the data from database and data source via web service. In this paper, we propose a query processing model and execution structure based on integrating data layers that can search information on biological data sources.
    Journal of the Korea Society of Computer and Information. 01/2009; 14(6).