Article

Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function

University of Alabama at Birmingham, Birmingham, Alabama, United States
Nature Genetics (Impact Factor: 29.65). 04/2005; 37(3):233-42. DOI: 10.1038/ng1518
Source: PubMed

ABSTRACT Patterns of gene expression in the central nervous system are highly variable and heritable. This genetic variation among normal individuals leads to considerable structural, functional and behavioral differences. We devised a general approach to dissect genetic networks systematically across biological scale, from base pairs to behavior, using a reference population of recombinant inbred strains. We profiled gene expression using Affymetrix oligonucleotide arrays in the BXD recombinant inbred strains, for which we have extensive SNP and haplotype data. We integrated a complementary database comprising 25 years of legacy phenotypic data on these strains. Covariance among gene expression and pharmacological and behavioral traits is often highly significant, corroborates known functional relations and is often generated by common quantitative trait loci. We found that a small number of major-effect quantitative trait loci jointly modulated large sets of transcripts and classical neural phenotypes in patterns specific to each tissue. We developed new analytic and graph theoretical approaches to study shared genetic modulation of networks of traits using gene sets involved in neural synapse function as an example. We built these tools into an open web resource called WebQTL that can be used to test a broad array of hypotheses.

Download full-text

Full-text

Available from: Nicole Elizabeth Baldwin, Jun 17, 2015
0 Followers
 · 
120 Views
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Biologists hope to address grand scientific challenges by exploring the abundance of data made available through modern microarray technology and other high-throughput techniques. The impact of this data, however, is limited unless researchers can effectively assimilate such complex information and integrate it into their daily research; interactive visualization tools are called for to support the effort. Specifically, typical studies of gene co-expression require novel visualization tools that enable the dynamic formulation and fine-tuning of hypotheses to aid the process of evaluating sensitivity of key parameters. These tools should allow biologists to develop an intuitive understanding of the structure of biological networks and discover genes residing in critical positions in networks and pathways. By using a graph as a universal representation of correlation in gene expression, our system employs several techniques that when used in an integrated manner provide innovative analytical capabilities. Our tool for interacting with gene co-expression data integrates techniques such as: graph layout, qualitative subgraph extraction through a novel 2D user interface, quantitative subgraph extraction using graph-theoretic algorithms or by compound queries, dynamic level-of-detail abstraction, and template-based fuzzy classification. We demonstrate our system using a real-world workflow from a large-scale, systems genetics study of mammalian gene co-expression.
    IEEE Transactions on Visualization and Computer Graphics 09/2008; 14(5):1081-94. DOI:10.1109/TVCG.2008.61 · 1.92 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Microarray experiments typically analyze thousands to tens of thousands of genes from small numbers of biological replicates. The fact that genes are normally expressed in functionally relevant patterns suggests that gene-expression data can be stratified and clustered into relatively homogenous groups. Cluster-wise dimensionality reduction should make it feasible to improve screening power while minimizing information loss. We propose a powerful and computationally simple method for finding differentially expressed genes in small microarray experiments. The method incorporates a novel stratification-based tight clustering algorithm, principal component analysis and information pooling. Comprehensive simulations show that our method is substantially more powerful than the popular SAM and eBayes approaches. We applied the method to three real microarray datasets: one from a Populus nitrogen stress experiment with 3 biological replicates; and two from public microarray datasets of human cancers with 10 to 40 biological replicates. In all three analyses, our method proved more robust than the popular alternatives for identification of differentially expressed genes. The C++ code to implement the proposed method is available upon request for academic use.
    Bioinformatics 08/2008; 24(14):1583-9. DOI:10.1093/bioinformatics/btn215 · 4.62 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Searches for the identity of genes that influence the levels of alcohol consumption by humans and other animals have often been driven by presupposition of the importance of particular gene products in determining positively or negatively reinforcing effects of ethanol. We have taken an unbiased approach and performed a meta-analysis across three types of mouse populations to correlate brain gene expression with levels of alcohol intake. Our studies, using filtering procedures based on QTL analysis, produced a list of eight candidate genes with highly heritable expression, which could explain a significant amount of the variance in alcohol preference in mice. Using the Allen Brain Atlas for gene expression, we noted that the candidate genes' expression was localized to the olfactory and limbic areas as well as to the orbitofrontal cortex. Informatics techniques and pathway analysis illustrated the role of the candidate genes in neuronal migration, differentiation, and synaptic remodeling. The importance of olfactory cues, learning and memory formation (Pavlovian conditioning), and cortical executive function, for regulating alcohol intake by animals (including humans), is discussed.
    Mammalian Genome 07/2008; 19(5):352-65. DOI:10.1007/s00335-008-9115-z · 2.88 Impact Factor