Article

Choosing the right path: enhancement of biologically relevant sets of genes or proteins using pathway structure.

Environmental Systems Biology Group, Laboratory of Molecular Toxicology, National Institute of Environmental Health Sciences, RTP, NC 27709, USA.
Genome biology (impact factor: 6.63). 05/2009; 10(4):R44. DOI:10.1186/gb-2009-10-4-r44
Source: PubMed

ABSTRACT A method is proposed that finds enriched pathways relevant to a studied condition using the measured molecular data and also the structural information of the pathway viewed as a network of nodes and edges. Tests are performed using simulated data and genomic data sets and the method is compared to two existing approaches. The analysis provided demonstrates the method proposed is very competitive with the current approaches and also provides biologically relevant results.

0 0
 · 
0 Bookmarks
 · 
45 Views
  • Source
    Article: Cluster analysis and display of genome-wide expression patterns.
    [show abstract] [hide abstract]
    ABSTRACT: A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression. The output is displayed graphically, conveying the clustering and the underlying expression data simultaneously in a form intuitive for biologists. We have found in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function, and we find a similar tendency in human data. Thus patterns seen in genome-wide expression experiments can be interpreted as indications of the status of cellular processes. Also, coexpression of genes of known function with poorly characterized or novel genes may provide a simple means of gaining leads to the functions of many genes for which information is not available currently.
    Proceedings of the National Academy of Sciences 01/1999; 95(25):14863-8. · 9.68 Impact Factor
  • Article: Analysis of gene expression data using self-organizing maps.
    [show abstract] [hide abstract]
    ABSTRACT: DNA microarray technologies together with rapidly increasing genomic sequence information is leading to an explosion in available gene expression data. Currently there is a great need for efficient methods to analyze and visualize these massive data sets. A self-organizing map (SOM) is an unsupervised neural network learning algorithm which has been successfully used for the analysis and organization of large data files. We have here applied the SOM algorithm to analyze published data of yeast gene expression and show that SOM is an excellent tool for the analysis and visualization of gene expression profiles.
    FEBS Letters 06/1999; 451(2):142-6. · 3.54 Impact Factor
  • Article: Analyzing gene expression data in terms of gene sets: methodological issues.
    [show abstract] [hide abstract]
    ABSTRACT: MOTIVATION: Many statistical tests have been proposed in recent years for analyzing gene expression data in terms of gene sets, usually from Gene Ontology. These methods are based on widely different methodological assumptions. Some approaches test differential expression of each gene set against differential expression of the rest of the genes, whereas others test each gene set on its own. Also, some methods are based on a model in which the genes are the sampling units, whereas others treat the subjects as the sampling units. This article aims to clarify the assumptions behind different approaches and to indicate a preferential methodology of gene set testing. RESULTS: We identify some crucial assumptions which are needed by the majority of methods. P-values derived from methods that use a model which takes the genes as the sampling unit are easily misinterpreted, as they are based on a statistical model that does not resemble the biological experiment actually performed. Furthermore, because these models are based on a crucial and unrealistic independence assumption between genes, the P-values derived from such methods can be wildly anti-conservative, as a simulation experiment shows. We also argue that methods that competitively test each gene set against the rest of the genes create an unnecessary rift between single gene testing and gene set testing.
    Bioinformatics 05/2007; 23(8):980-7. · 5.47 Impact Factor

Full-text (3 Sources)

View
4 Downloads
Available from
6 Nov 2012

Keywords

biologically relevant results
 
current approaches
 
finds enriched pathways relevant
 
measured molecular data
 
structural information
 
Tests