[Show abstract][Hide abstract] ABSTRACT: Next-generation sequencing plays a central role in the characterization and quantification of transcriptomes. Although numerous metrics are purported to quantify the quality of RNA, there have been no large-scale empirical evaluations of the major determinants of sequencing success. We used a combination of existing and newly developed methods to isolate total RNA from 1115 samples from 695 plant species in 324 families, which represents >900 million years of phylogenetic diversity from green algae through flowering plants, including many plants of economic importance. We then sequenced 629 of these samples on Illumina GAIIx and HiSeq platforms and performed a large comparative analysis to identify predictors of RNA quality and the diversity of putative genes (scaffolds) expressed within samples. Tissue types (e.g., leaf vs. flower) varied in RNA quality, sequencing depth and the number of scaffolds. Tissue age also influenced RNA quality but not the number of scaffolds ≥1000 bp. Overall, 36% of the variation in the number of scaffolds was explained by metrics of RNA integrity (RIN score), RNA purity (OD 260/230), sequencing platform (GAIIx vs HiSeq) and the amount of total RNA used for sequencing. However, our results show that the most commonly used measures of RNA quality (e.g., RIN) are weak predictors of the number of scaffolds because Illumina sequencing is robust to variation in RNA quality. These results provide novel insight into the methods that are most important in isolating high quality RNA for sequencing and assembling plant transcriptomes. The methods and recommendations provided here could increase the efficiency and decrease the cost of RNA sequencing for individual labs and genome centers.
PLoS ONE 01/2012; 7(11):e50226. · 3.73 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: New hybrid species might be expected to show patterns of gene expression intermediate to those shown by parental species. "Transcriptomic shock" may also occur, in which gene expression is disrupted; this may be further modified by whole genome duplication (causing allopolyploidy). "Shock" can include instantaneous partitioning of gene expression between parental copies of genes among tissues. These effects have not previously been studied at a population level in a natural allopolyploid plant species. Here, we survey tissue-specific expression of 144 duplicated gene pairs derived from different parental species (homeologs) in two natural populations of 40-generation-old allotetraploid Tragopogon miscellus (Asteraceae) plants. We compare these results with patterns of allelic expression in both in vitro "hybrids" and hand-crossed F(1) hybrids between the parental diploids T. dubius and T. pratensis, and with patterns of homeolog expression in synthetic (S(1)) allotetraploids. Partitioning of expression was frequent in natural allopolyploids, but F(1) hybrids and S(1) allopolyploids showed less partitioning of expression than the natural allopolyploids and the in vitro "hybrids" of diploid parents. Our results suggest that regulation of gene expression is relaxed in a concerted manner upon hybridization, and new patterns of partitioned expression subsequently emerge over the generations following allopolyploidization.
Current biology: CB 03/2011; 21(7):551-6. · 10.99 Impact Factor