Article

AlleleSeq: analysis of allele-specific expression and binding in a network framework

Molecular Systems Biology 08/2011; 7(1). DOI:10.1038/msb.2011.54

ABSTRACT To study allele-specific expression (ASE) and binding (ASB), that is, differences between the maternally and paternally derived alleles, we have developed a computational pipeline (AlleleSeq). Our pipeline initially constructs a diploid personal genome sequence (and corresponding personalized gene annotation) using genomic sequence variants (SNPs, indels, and structural variants), and then identifies allele-specific events with significant differences in the number of mapped reads between maternal and paternal alleles. There are many technical challenges in the construction and alignment of reads to a personal diploid genome sequence that we address, for example, bias of reads mapping to the reference allele. We have applied AlleleSeq to variation data for NA12878 from the 1000 Genomes Project as well as matched, deeply sequenced RNA-Seq and ChIP-Seq data sets generated for this purpose. In addition to observing fairly widespread allele-specific behavior within individual functional genomic data sets (including results consistent with X-chromosome inactivation), we can study the interaction between ASE and ASB. Furthermore, we investigate the coordination between ASE and ASB from multiple transcription factors events using a regulatory network framework. Correlation analyses and network motifs show mostly coordinated ASB and ASE.

0 0
 · 
0 Bookmarks
 · 
37 Views
  • Source
    Article: Identification of allele-specific alternative mRNA processing via transcriptome sequencing.
    [show abstract] [hide abstract]
    ABSTRACT: Establishing the functional roles of genetic variants remains a significant challenge in the post-genomic era. Here, we present a method, allele-specific alternative mRNA processing (ASARP), to identify genetically influenced mRNA processing events using transcriptome sequencing (RNA-Seq) data. The method examines RNA-Seq data at both single-nucleotide and whole-gene/isoform levels to identify allele-specific expression (ASE) and existence of allele-specific regulation of mRNA processing. We applied the methods to data obtained from the human glioblastoma cell line U87MG and primary breast cancer tissues and found that 26-45% of all genes with sufficient read coverage demonstrated ASE, with significant overlap between the two cell types. Our methods predicted potential mechanisms underlying ASE due to regulations affecting either whole-gene-level expression or alternative mRNA processing, including alternative splicing, alternative polyadenylation and alternative transcriptional initiation. Allele-specific alternative splicing and alternative polyadenylation may explain ASE in hundreds of genes in each cell type. Reporter studies following these predictions identified the causal single nucleotide variants (SNVs) for several allele-specific alternative splicing events. Finally, many genes identified in our study were also reported as disease/phenotype-associated genes in genome-wide association studies. Future applications of our approach may provide ample insights for a better understanding of the genetic basis of gene regulation underlying phenotypic diversity and disease mechanisms.
    Nucleic Acids Research 03/2012; 40(13):e104. · 8.03 Impact Factor

Full-text (2 Sources)

View
26 Downloads
Available from
7 Jan 2013

Keywords

1000 Genomes Project
 
allele-specific events
 
AlleleSeq
 
ASE
 
computational pipeline
 
Correlation analyses
 
differences
 
diploid personal genome sequence
 
gene annotation
 
indels
 
individual functional genomic data sets
 
mapped
 
multiple transcription factors events
 
network motifs
 
personal diploid genome sequence
 
reference allele
 
regulatory network framework
 
sequenced RNA-Seq
 
technical challenges
 
widespread allele-specific behavior