Article

Sequencing technologies - the next generation.

Human Genome Sequencing Center and Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA.
Nature Reviews Genetics (Impact Factor: 39.79). 12/2009; 11(1):31-46. DOI: 10.1038/nrg2626
Source: PubMed

ABSTRACT Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.

Full-text

Available from: Supriya Karkra, Jun 02, 2015
5 Followers
 · 
671 Views
  • [Show abstract] [Hide abstract]
    ABSTRACT: Tapping panel dryness (TPD) involves in the partial or complete cessation of latex flow thus seriously affect latex production in rubber tree (Hevea brasiliensis). Numerous studies have been conducted to define its origin and nature, but the molecular nature and mechanism of TPD occurrence remains unknown. This study is committed to de novo sequencing and comparative analysis of the transcriptomes of healthy (H) and TPD-affected (T) rubber trees to identify the genes and pathways related to the TPD. Total raw reads of 34,632,012 and 35,913,020 bp were obtained from H and T library, respectively using Illumina Hiseq 2000 sequencing technology. De novo assemblies yielded 141,456 and 169,285 contigs, and 96,070 and 112,243 unigenes from H and T library, respectively. Among 73597 genes, 22577 genes were identified as differential expressed genes between H and T library via comparative transcript profiling. A majority of genes involved in natural rubber biosynthesis and jasmonate synthesis with most potential relevance in TPD occurrence were found to be differentially expressed. In TPD-affected trees, the expression of most genes related to the latex biosynthesis and jasmonate synthesis was severely inhibited and is probably the direct cause of the TPD. These new de novo transcriptome data sets provide a significant resource for the discovery of genes related to TPD and improve our understanding of the occurrence and maintainace of TPD.
    BMC Genomics 05/2015; 16(1). DOI:10.1186/s12864-015-1562-9 · 4.04 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: The identification and characterisation of differentially methylated regions (DMRs) between phenotypes in the human genome is of prime interest in epigenetics. We present a novel method, DMRcate, that fits replicated methylation measurements from the Illumina HM450K BeadChip (or 450K array) spatially across the genome using a Gaussian kernel. DMRcate identifies and ranks the most differentially methylated regions across the genome based on tunable kernel smoothing of the differential methylation (DM) signal. The method is agnostic to both genomic annotation and local change in the direction of the DM signal, removes the bias incurred from irregularly spaced methylation sites, and assigns significance to each DMR called via comparison to a null model. We show that, for both simulated and real data, the predictive performance of DMRcate is superior to those of Bumphunter and Probe Lasso, and commensurate with that of comb-p. For the real data, we validate all array-derived DMRs from the candidate methods on a suite of DMRs derived from whole-genome bisulfite sequencing called from the same DNA samples, using two separate phenotype comparisons. The agglomeration of genomically localised individual methylation sites into discrete DMRs is currently best served by a combination of DM-signal smoothing and subsequent threshold specification. The findings also suggest the design of the 450K array shows preference for CpG sites that are more likely to be differentially methylated, but its overall coverage does not adequately reflect the depth and complexity of methylation signatures afforded by sequencing. For the convenience of the research community we have created a user-friendly R software package called DMRcate, downloadable from Bioconductor and compatible with existing preprocessing packages, which allows others to apply the same DMR-finding method on 450K array data.
    Epigenetics & Chromatin 01/2015; 8. DOI:10.1186/1756-8935-8-6 · 4.46 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Background Whole genome sequences (WGS) have proliferated as sequencing technology continues to improve and costs decline. While many WGS of model or domestic organisms have been produced, a growing number of non-model species are also being sequenced. In the absence of a reference, construction of a genome sequence necessitates de novo assembly which may be beyond the ability of many labs due to the large volumes of raw sequence data and extensive bioinformatics required. In contrast, the presence of a reference WGS allows for alignment which is more tractable than assembly. Recent work has highlighted that the reference need not come from the same species, potentially enabling a wide array of species WGS to be constructed using cross-species alignment. Here we report on the creation a draft WGS from a single bighorn sheep (Ovis canadensis) using alignment to the closely related domestic sheep (Ovis aries). Results Two sequencing libraries on SOLiD platforms yielded over 865 million reads, and combined alignment to the domestic sheep reference resulted in a nearly complete sequence (95% coverage of the reference) at an average of 12x read depth (104 SD). From this we discovered over 15 million variants and annotated them relative to the domestic sheep reference. We then conducted an enrichment analysis of those SNPs showing fixed differences between the reference and sequenced individual and found significant differences in a number of gene ontology (GO) terms, including those associated with reproduction, muscle properties, and bone deposition. Conclusion Our results demonstrate that cross-species alignment enables the creation of novel WGS for non-model organisms. The bighorn sheep WGS will provide a resource for future resequencing studies or comparative genomics. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1618-x) contains supplementary material, which is available to authorized users.
    BMC Genomics 05/2015; 16(1). DOI:10.1186/s12864-015-1618-x · 4.04 Impact Factor