Article

Automated multidimensional phenotypic profiling using large public microarray repositories.

Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA.
Proceedings of the National Academy of Sciences (impact factor: 9.68). 07/2009; 106(30):12323-8. DOI:10.1073/pnas.0900883106 pp.12323-8
Source: PubMed

ABSTRACT Phenotypes are complex, and difficult to quantify in a high-throughput fashion. The lack of comprehensive phenotype data can prevent or distort genotype-phenotype mapping. Here, we describe "PhenoProfiler," a computational method that enables in silico phenotype profiling. Drawing on the principle that similar gene expression patterns are likely to be associated with similar phenotype patterns, PhenoProfiler supplements the missing quantitative phenotype information for a given microarray dataset based on other well-characterized microarray datasets. We applied our method to 587 human microarray datasets covering >14,000 samples, and confirmed that the predicted phenotype profiles are highly consistent with true phenotype descriptions. PhenoProfiler offers several unique capabilities: (i) automated, multidimensional phenotype profiling, facilitating the analysis and treatment design of complex diseases; (ii) the extrapolation of phenotype profiles beyond provided classes; and (iii) the detection of confounding phenotype factors that could otherwise bias biological inferences. Finally, because no direct comparisons are made between gene expression values from different datasets, the method can use the entire body of cross-platform microarray data. This work has produced a compendium of phenotype profiles for the National Center for Biotechnology Information GEO datasets, which can facilitate an unbiased understanding of the transcriptome-phenome mapping. The continued accumulation of microarray data will further increase the power of PhenoProfiler, by increasing the variety and the quality of phenotypes to be profiled.

0 0
 · 
0 Bookmarks
 · 
29 Views
  • Article: Computational approaches to phenotyping: high-throughput phenomics.
    [show abstract] [hide abstract]
    ABSTRACT: The recent completion of the Human Genome Project has made possible a high-throughput "systems approach" for accelerating the elucidation of molecular underpinnings of human diseases, and subsequent derivation of molecular-based strategies to more effectively prevent, diagnose, and treat these diseases. Although altered phenotypes are among the most reliable manifestations of altered gene functions, research using systematic analysis of phenotype relationships to study human biology is still in its infancy. This article focuses on the emerging field of high-throughput phenotyping (HTP) phenomics research, which aims to capitalize on novel high-throughput computation and informatics technology developments to derive genomewide molecular networks of genotype-phenotype associations, or "phenomic associations." The HTP phenomics research field faces the challenge of technological research and development to generate novel tools in computation and informatics that will allow researchers to amass, access, integrate, organize, and manage phenotypic databases across species and enable genomewide analysis to associate phenotypic information with genomic data at different scales of biology. Key state-of-the-art technological advancements critical for HTP phenomics research are covered in this review. In particular, we highlight the power of computational approaches to conduct large-scale phenomics studies.
    Proceedings of the American Thoracic Society 02/2007; 4(1):18-25.
  • Article: From syndrome families to functional genomics.
    Nature Reviews Genetics 08/2004; 5(7):545-51. · 38.08 Impact Factor
  • Article: Clinical importance of transforming growth factor-beta but not of tumor necrosis factor-alpha gene polymorphisms in patients with the myelodysplastic syndrome belonging to the refractory anemia subtype.
    [show abstract] [hide abstract]
    ABSTRACT: Tumor necrosis factor-alpha (TNF-alpha) and transforming growth factor-beta (TGF-beta) are cytokines that play key roles in the myelodysplastic syndrome (MDS). There have been several reports on the presence of genetic polymorphisms in the DNA sequence encoding the leader sequence of the TGF-beta1 protein, located in codon 10 in exon 1 and in the -308 promoter region of TNF-alpha. The objective of this study was to investigate the association between TNF-alpha and TGF-beta1 gene polymorphisms and the susceptibility to MDS and the progression of the disease among patients with MDS belonging to the refractory anemia (RA) subtype. The diagnosis of MDS (n = 50) was based on the FAB criteria. The TNF-alpha genotypes were analyzed by PCR-RFLP and the TGF-beta genotypes were analyzed using an amplification refractory mutation system. Compared with healthy control subjects, patients with RA showed no significant deviations in genotype or allele frequencies of TNF-alpha. The TT homozygosity at codon 10 of TGF-beta1 was significantly higher among patients with bi- or pancytopenia (severe group) than in the patients with anemia only (mild group; odds ratio = 6.99, p = 0.003). These findings suggest that the TGF-beta1 gene polymorphism in codon 10 and the -308 TNF-alpha gene polymorphism do not predispose to the development of RA, but the TGF-beta1 gene polymorphism may affect disease progression.
    Pathobiology 02/2005; 72(3):165-70. · 1.18 Impact Factor

Full-text (2 Sources)

View
1 Download
Available from
8 Mar 2013

Keywords

587 human microarray datasets
 
Biotechnology Information GEO datasets
 
cross-platform microarray data
 
different datasets
 
difficult
 
gene expression values
 
given microarray dataset
 
high-throughput fashion
 
microarray data
 
missing quantitative phenotype information
 
multidimensional phenotype profiling
 
National Center
 
profiled
 
silico phenotype profiling
 
similar gene expression patterns
 
similar phenotype patterns
 
treatment design
 
true phenotype descriptions
 
unique capabilities
 
well-characterized microarray datasets