The genome of Theobroma cacao.

Nature Genetics (Impact Factor: 29.65). 01/2011; 34(2):101-109. DOI: 10.1038/npre.2010.4908.1
Source: OAI

ABSTRACT We sequenced and assembled the genome of Theobroma cacao, an economically important tropical fruit tree crop that is the source of chocolate. The assembly corresponds to 76% of the estimated genome size and contains almost all previously described genes, with 82% of them anchored on the 10 T. cacao chromosomes. Analysis of this sequence information highlighted specific expansion of some gene families during evolution, for example flavonoid-related genes. It also provides a major source of candidate genes for T. cacao disease resistance and quality improvement. Based on the inferred paleohistory of the T. cacao genome, we propose an evolutionary scenario whereby the ten T. cacao chromosomes were shaped from an ancestor through eleven chromosome fusions. The T. cacao genome can be considered as a simple living relic of higher plant evolution.

Download full-text


Available from: Spencer Craig Brown, Dec 13, 2013
1 Follower
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The scope and breadth of genome-scale metabolic reconstructions have continued to expand over the last decade. Herein, we introduce a genome-scale model for a plant with direct applications to food and bioenergy production (i.e., maize). Maize annotation is still underway, which introduces significant challenges in the association of metabolic functions to genes. The developed model is designed to meet rigorous standards on gene-protein-reaction (GPR) associations, elementally and charged balanced reactions and a biomass reaction abstracting the relative contribution of all biomass constituents. The metabolic network contains 1,563 genes and 1,825 metabolites involved in 1,985 reactions from primary and secondary maize metabolism. For approximately 42% of the reactions direct literature evidence for the participation of the reaction in maize was found. As many as 445 reactions and 369 metabolites are unique to the maize model compared to the AraGEM model for A. thaliana. 674 metabolites and 893 reactions are present in Zea mays iRS1563 that are not accounted for in maize C4GEM. All reactions are elementally and charged balanced and localized into six different compartments (i.e., cytoplasm, mitochondrion, plastid, peroxisome, vacuole and extracellular). GPR associations are also established based on the functional annotation information and homology prediction accounting for monofunctional, multifunctional and multimeric proteins, isozymes and protein complexes. We describe results from performing flux balance analysis under different physiological conditions, (i.e., photosynthesis, photorespiration and respiration) of a C4 plant and also explore model predictions against experimental observations for two naturally occurring mutants (i.e., bm1 and bm3). The developed model corresponds to the largest and more complete to-date effort at cataloguing metabolism for a plant species.
    PLoS ONE 07/2011; 6(7):e21784. DOI:10.1371/journal.pone.0021784 · 3.53 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: The development of modern approaches to the genetic improvement of the tree crop Ilex paraguariensis (‘yerba mate’) and Ilex dumosa (‘yerba sen ˜orita’) is halted by the scarcity of basic genetic information. In this study, we characterized the implementation of low-cos methodologies such as representational difference analysis (RDA), single-strand conformation polymorphisms (SSCP), and reverse and direct dot-blot filter hybridization assays coupled with thorough bioinformatic characterization of sequence data for both species. Also, we estimated the genome size of each species using flow cytometry. This study contributes to the better understanding of the genetic differences between two cultivated species, by generating new quantitative and qualitative genome-level data. Using the RDA technique, we isolated a group of non-coding repetitive sequences, tentatively considered as Ilex-specific, which were 1.21- to 39.62-fold more abundant in the genome of I. paraguariensis. Another group of repeti tive DNA sequences involved retrotransposons, which appeared 1.41- to 35.77-fold more abundantly in the genome of I. dumosa. The genomic DNA of each species showed differen performances in filter hybridizations: while I. paraguariensis showed a high intraspecific affinity I. dumosa exhibited a higher affinity for the genome of the former species (i.e. interspecific) These differences could be attributed to the occurrence of homologous but slightly divergen repetitive DNA sequences, highly amplified in the genome of I. paraguariensis but not in the genome of I. dumosa. Additionally, our hybridization outcomes suggest that the genomes o both species have less than 80% similarity. Moreover, for the first time, we report herein a genome size estimate of 1670Mbp for I. paraguariensis and that of 1848Mbp for I. dumosa.
    Plant Genetic Resources 05/2014; 13(02):1. DOI:10.1017/S1479262114000756 · 1.06 Impact Factor