Large-scale transcriptome analysis in chickpea (Cicer arietinum L.), an orphan legume crop of the semi-arid tropics of Asia and Africa

International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, India.
Plant Biotechnology Journal (Impact Factor: 5.68). 05/2011; 9(8):922-31. DOI: 10.1111/j.1467-7652.2011.00625.x
Source: PubMed

ABSTRACT Chickpea (Cicer arietinum L.) is an important legume crop in the semi-arid regions of Asia and Africa. Gains in crop productivity have been low however, particularly because of biotic and abiotic stresses. To help enhance crop productivity using molecular breeding techniques, next generation sequencing technologies such as Roche/454 and Illumina/Solexa were used to determine the sequence of most gene transcripts and to identify drought-responsive genes and gene-based molecular markers. A total of 103,215 tentative unique sequences (TUSs) have been produced from 435,018 Roche/454 reads and 21,491 Sanger expressed sequence tags (ESTs). Putative functions were determined for 49,437 (47.8%) of the TUSs, and gene ontology assignments were determined for 20,634 (41.7%) of the TUSs. Comparison of the chickpea TUSs with the Medicago truncatula genome assembly (Mt 3.5.1 build) resulted in 42,141 aligned TUSs with putative gene structures (including 39,281 predicted intron/splice junctions). Alignment of ∼37 million Illumina/Solexa tags generated from drought-challenged root tissues of two chickpea genotypes against the TUSs identified 44,639 differentially expressed TUSs. The TUSs were also used to identify a diverse set of markers, including 728 simple sequence repeats (SSRs), 495 single nucleotide polymorphisms (SNPs), 387 conserved orthologous sequence (COS) markers, and 2088 intron-spanning region (ISR) markers. This resource will be useful for basic and applied research for genome analysis and crop improvement in chickpea.

Download full-text


Available from: Pooran M Gaur, Jul 27, 2015
  • Source
  • [Show abstract] [Hide abstract]
    ABSTRACT: Novel high-throughput next generation sequencing (NGS) technologies are providing opportunities to explore genomes and transcriptomes in a cost-effective manner. To construct a gene expression atlas of developing oat (Avena sativa) seeds, two software packages specifically designed for RNA-seq (Trinity and Velvet/Oases) were employed for de novo assembly of nearly 134 million quality-filtered 100-bp paired-end reads sequenced from RNA extracted at four stages of seed development. Based on the quality-parameters assessed, Velvet/Oases assemblies were more accurate, produced longer transcripts, and contained more putative unique genes. The final assembly, termed dnOSt (de novo Oat Seed transcriptome), is over 55 million nucleotides in length, contained 53,339 transcript isoforms, and was constructed with approximately 43 million reads, which represents an estimated 74.8× sequencing depth with an average transcript length of 1,043 nucleotides. To assess the accuracy and completeness of dnOSt, we investigated the presence of transcripts associated with the biosynthesis of three compounds with health-promoting properties: avenanthramides, tocols (vitamin E), and ß-glucans. Homologs to all investigated genes were present in dnOSt, demonstrating that it is a robust and useful new tool for oat research. Currently, we are independently assembling transcripts from the four sampled time-points for use in characterizing temporal gene expression patterns during the course of seed development. Results of these analyses will also be presented.
    International Plant and Animal Genome Conference XXI 2013;
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Next-generation sequencing (NGS) approach, due to inexpensive and faster in nature, is facilitating identification of SNPs in crop species. While several tools and pipelines are available for SNP discovery in the crop species that have the reference genome, SNP discovery with a higher precision and better efficiency is still challenging in the crop species that do not have the reference genome. In this context, we have developed a coverage-based consensus calling (CbCC) approach. As an example, this approach has been used to identify SNPs between two genotypes (ICC 4958 and ICC 1882) of chickpea (Cicer arietinum), a crop species without a reference genome. Four open source short read alignment tools (Bowtie, Maq, NovoAlign, and SOAP2) were used on 15.7 and 22.1 million Illumina reads (36 bp long) from ICC4958 and ICC1882, respectively onto a transcriptome assembly of chickpea with 46,740 transcriptome assembly contigs (TACs). By using CbCC approach on above-mentioned tools, a non-redundant set of 4543 SNPs of which 224 SNPs were randomly selected for experimental validation. By comparing the in silico and wet lab results, Maq in comparison to other individual tools, showed superiority as 50.0% of SNPs predicted by Maq were true SNPs. For combinations of two tools, greatest accuracy (55.7%) was reported for Maq and Bowtie, with a combination of Bowtie, Maq, and Novoalign identifying 61.5% true SNPs. In summary, this study provides a benchmark of comparison of tools as well as read depths for four commonly used tools for SNP discovery in a non-reference genome species.
    International Plant and Animal Genome Conference XXI 2013;
Show more