Transcriptome and genome sequencing uncovers functional variation in humans

1] Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland [2] Institute for Genetics and Genomics in Geneva (iG3), University of Geneva, 1211 Geneva, Switzerland [3] Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland.
Nature (Impact Factor: 41.46). 09/2013; 501(7468). DOI: 10.1038/nature12531
Source: PubMed


Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of messenger RNA and microRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project-the first uniformly processed high-throughput RNA-sequencing data from multiple human populations with high-quality genome sequences. We discover extremely widespread genetic variation affecting the regulation of most genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on the cellular mechanisms of regulatory and loss-of-function variation, and allows us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome.

Download full-text


Available from: Xavier Estivill
  • Source
    • "Understanding how this control system produces embryonic structures is a key to the mechanistic understanding of developmental processes (De Robertis, 2008;Shubin et al., 2009). The availability of developmental quantitative transcriptomes had greatly improved the ability to study the relation between developmental gene expression programs and the morphology of embryonic structures they generate (e.g.,Brown et al., 2014;Gong et al., 2013;Lappalainen et al., 2013;Nodine and Bartel, 2012;Tu et al., 2014Tu et al., , 2012). Studying the changing landscape of gene expression through embryogenesis can shed light on the relationship between the increasing complexity of embryonic morphology and the complexity of expressed transcripts. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Embryonic development progresses through the timely activation of thousands of differentially activated genes. Quantitative developmental transcriptomes provide the means to relate global patterns of differentially expressed genes to the emerging body plans they generate. The sea urchin is one of the classic model systems for embryogenesis and the models of its developmental gene regulatory networks are of the most comprehensive of their kind. Thus, the sea urchin embryo is an excellent system for studies of its global developmental transcriptional profiles. Here we produced quantitative developmental transcriptomes of the sea urchin Paracentrotus lividus (P. lividus) at seven developmental stages from the fertilized egg to prism stage. We generated de-novo reference transcriptome and identified 29,817 genes that are expressed at this time period. We annotated and quantified gene expression at the different developmental stages and confirmed the reliability of the expression profiles by QPCR measurement of a subset of genes. The progression of embryo development is reflected in the observed global expression patterns and in our principle component analysis. Our study illuminates the rich patterns of gene expression that participate in sea urchin embryogenesis and provide an essential resource for further studies of the dynamic expression of P. lividus genes.
    Full-text · Article · Dec 2015 · Marine Genomics
  • Source
    • "To explore whether genotypes of SNPs influence expression, in silico analysis was performed using expression profile of data set series GSE6536 (Gene Expression Omnibus, (, (Stranger et al., 2007), 1000 genomes data base (Abecasis et al., 2012) and Geuvadis RNA sequencing data base (Lappalainen et al., 2013). P-values were calculated comparing mean of expression values of genes having different genotypes by "
    [Show abstract] [Hide abstract]
    ABSTRACT: Oral cancer is usually preceded by pre-cancerous lesion and related to tobacco abuse. Tobacco carcinogens damage DNA and cells harboring such damaged DNA normally undergo apoptotic death, but cancer cells are exceptionally resistant to apoptosis. Here we studied association between sequence and expression variations in apoptotic pathway genes and risk of oral cancer and precancer. Ninety nine tagSNPs in 23 genes, involved in mitochondrial and non-mitochondrial apoptotic pathways, were genotyped in 525 cancer and 253 leukoplakia patients and 538 healthy controls using Illumina Golden Gate assay. Six SNPs (rs1473418 at BCL2; rs1950252 at BCL2L2; rs8190315 at BID; rs511044 at CASP1; rs2227310 at CASP7 and rs13010627 at CASP10) significantly modified risk of oral cancer but SNPs only at BCL2, CASP1and CASP10 modulated risk of leukoplakia. Combination of SNPs showed a steep increase in risk of cancer with increase in "effective" number of risk alleles. In silico analysis of published data set and our unpublished RNAseq data suggest that change in expression of BID and CASP7 may have affected risk of cancer. In conclusion, three SNPs, rs1473418 in BCL2, rs1950252 in BCL2L2 and rs511044 in CASP1, are being implicated for the first time in oral cancer. Since SNPs at BCL2, CASP1 and CASP10 modulated risk of both leukoplakia and cancer, so, they should be studied in more details for possible biomarkers in transition of leukoplakia to cancer. This study also implies importance of mitochondrial apoptotic pathway gene (such as BCL2) in progression of leukoplakia to oral cancer.
    Full-text · Article · Sep 2015 · Mitochondrion
  • Source
    • "These profiles were normalized, averaged, smoothed, and centered on the exon midpoint. To investigate the impact of intronic structural variants on nucleosome localization (Figure 4C), we used the MNase-seq data above and corresponding sQTL (Lappalainen et al., 2013) and genotype data (Abecasis et al., 2012). We then compiled the MNase profiles of individuals with genotypes representing shorter and longer upstream introns. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Mammalian genes are composed of exons, but the evolutionary origins and functions of new internal exons are poorly understood. Here, we analyzed patterns of exon gain using deep cDNA sequencing data from five mammals and one bird, identifying thousands of species- and lineage-specific exons. Most new exons derived from unique rather than repetitive intronic sequence. Unlike exons conserved across mammals, species-specific internal exons were mostly located in 5' UTRs and alternatively spliced. They were associated with upstream intronic deletions, increased nucleosome occupancy, and RNA polymerase II pausing. Genes containing new internal exons had increased gene expression, but only in tissues in which the exon was included. Increased expression correlated with the level of exon inclusion, promoter proximity, and signatures of cotranscriptional splicing. Altogether, these findings suggest that increased splicing at the 5' ends of genes enhances expression and that changes in 5' end splicing alter gene expression between tissues and between species. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
    Full-text · Article · Mar 2015 · Cell Reports
Show more