Conservation of core gene expression in vertebrate tissues

Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, Ontario, Canada.
Journal of Biology 05/2009; 8(3):33. DOI: 10.1186/jbiol130
Source: PubMed

ABSTRACT Vertebrates share the same general body plan and organs, possess related sets of genes, and rely on similar physiological mechanisms, yet show great diversity in morphology, habitat and behavior. Alteration of gene regulation is thought to be a major mechanism in phenotypic variation and evolution, but relatively little is known about the broad patterns of conservation in gene expression in non-mammalian vertebrates.
We measured expression of all known and predicted genes across twenty tissues in chicken, frog and pufferfish. By combining the results with human and mouse data and considering only ten common tissues, we have found evidence of conserved expression for more than a third of unique orthologous genes. We find that, on average, transcription factor gene expression is neither more nor less conserved than that of other genes. Strikingly, conservation of expression correlates poorly with the amount of conserved nonexonic sequence, even using a sequence alignment technique that accounts for non-collinearity in conserved elements. Many genes show conserved human/fish expression despite having almost no nonexonic conserved primary sequence.
There are clearly strong evolutionary constraints on tissue-specific gene expression. A major challenge will be to understand the precise mechanisms by which many gene expression patterns remain similar despite extensive cis-regulatory restructuring.

Download full-text


Available from: Andrew Wilde, Jul 01, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In vitro studies on hexavalent chromium [Cr(VI)] indicate that reduced forms of this metal can interact with DNA and cause mutations. Recently, Cr(VI) was shown to induce intestinal tumors in mice; however, Cr(VI) elicited redox changes, cytotoxicity and hyperplasia - suggesting involvement of tissue injury rather than direct mutagenesis. Moreover, toxicogenomic analyses indicated limited evidence for DNA damage responses. Herein, we extend these toxicogenomic analyses by comparing the gene expression patterns elicited by Cr(VI) with those of four mutagenic and four nonmutagenic carcinogens. To date, toxicogenomic profiles for mutagenic and nonmutagenic duodenal carcinogens do not exist, thus duodenal gene changes in mice were compared to those elicited by hepatocarcinogens. Specifically, duodenal gene changes in mice following exposure to Cr(VI) in drinking water were compared to hepatic gene changes previously identified as potentially discriminating mutagenic and nonmutagenic hepatocarcinogens. Using multivariate statistical analyses (including logistic regression classification), the Cr(VI) gene responses clustered apart from mutagenic carcinogens and closely with nonmutagenic carcinogens. These findings are consistent with other intestinal data supporting a nonmutagenic mode of action (MOA). These findings may be useful as part of a full weight of evidence MOA evaluation for Cr(VI)-induced intestinal carcinogenesis. Limitations to this analysis will also be discussed.
    Regulatory Toxicology and Pharmacology 06/2012; 64(1):68-76. DOI:10.1016/j.yrtph.2012.05.019 · 2.14 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Motivation: Comparative analyses of gene expression data from different species have become an important component of the study of molecular evolution. Thus methods are needed to estimate evolutionary distances between expression profiles, as well as a neutral reference to estimate selective pressure. Divergence between expression profiles of homologous genes is often calculated with Pearson's or Euclidean distance. Neutral divergence is usually inferred from randomized data. Despite being widely used, neither of these two steps has been well studied. Here, we analyze these methods formally and on real data, highlight their limitations and propose improvements. Results: It has been demonstrated that Pearson's distance, in contrast to Euclidean distance, leads to underestimation of the expression similarity between homologous genes with a conserved uniform pattern of expression. Here, we first extend this study to genes with conserved, but specific pattern of expression. Surprisingly, we find that both Pearson's and Euclidean distances used as a measure of expression similarity between genes depend on the expression specificity of those genes. We also show that the Euclidean distance depends strongly on data normalization. Next, we show that the randomization procedure that is widely used to estimate the rate of neutral evolution is biased when broadly expressed genes are abundant in the data. To overcome this problem, we propose a novel randomization procedure that is unbiased with respect to expression profiles present in the datasets. Applying our method to the mouse and human gene expression data suggests significant gene expression conservation between these species. Contact:; Supplementary information: Supplementary data are available at Bioinformatics online.
    Bioinformatics 05/2012; 28(14):1865-72. DOI:10.1093/bioinformatics/bts266 · 4.62 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We developed PolyA-seq, a strand-specific and quantitative method for high-throughput sequencing of 3' ends of polyadenylated transcripts, and used it to globally map polyadenylation (polyA) sites in 24 matched tissues in human, rhesus, dog, mouse, and rat. We show that PolyA-seq is as accurate as existing RNA sequencing (RNA-seq) approaches for digital gene expression (DGE), enabling simultaneous mapping of polyA sites and quantitative measurement of their usage. In human, we confirmed 158,533 known sites and discovered 280,857 novel sites (FDR < 2.5%). On average 10% of novel human sites were also detected in matched tissues in other species. Most novel sites represent uncharacterized alternative polyA events and extensions of known transcripts in human and mouse, but primarily delineate novel transcripts in the other three species. A total of 69.1% of known human genes that we detected have multiple polyA sites in their 3'UTRs, with 49.3% having three or more. We also detected polyadenylation of noncoding and antisense transcripts, including constitutive and tissue-specific primary microRNAs. The canonical polyA signal was strongly enriched and positionally conserved in all species. In general, usage of polyA sites is more similar within the same tissues across different species than within a species. These quantitative maps of polyA usage in evolutionarily and functionally related samples constitute a resource for understanding the regulatory mechanisms underlying alternative polyadenylation.
    Genome Research 03/2012; 22(6):1173-83. DOI:10.1101/gr.132563.111 · 13.85 Impact Factor