Ultrafast pyrosequencing of Corynebacterium kroppenstedtii DSM44385 revealed insights into the physiology of a lipophilic corynebacterium that lacks mycolic acids. J Biotechnol 136: 22-30

Institut für Genomforschung und Systembiologie, Centrum für Biotechnologie, Universität Bielefeld, Universitätsstrasse 25, D-33615 Bielefeld, Germany.
Journal of Biotechnology (Impact Factor: 2.88). 04/2008; 136(1-2):22-30. DOI: 10.1016/j.jbiotec.2008.03.004
Source: PubMed

ABSTRACT Corynebacterium kroppenstedtii is a lipophilic corynebacterial species that lacks in the cell envelope the characteristic alpha-alkyl-beta-hydroxy long-chain fatty acids, designated mycolic acids. We report here the bioinformatic analysis of genome data obtained by pyrosequencing of the type strain C. kroppenstedtii DSM44385 that was initially isolated from human sputum. A single run with the Genome Sequencer FLX system revealed 560,248 shotgun reads with 110,018,974 detected bases that were assembled into a contiguous genomic sequence with a total size of 2,446,804bp. Automatic annotation of the complete genome sequence resulted in the prediction of 2122 coding sequences, of which 29% were considered as specific for C. kroppenstedtii when compared with predicted proteins from hitherto sequenced pathogenic corynebacteria. This comparative content analysis of the genome data revealed a large repertoire of genes involved in sugar uptake and central carbohydrate metabolism and the presence of the mevalonate route for isoprenoid biosynthesis. The lack of mycolic acids and the lipophilic lifestyle of C. kroppenstedtii are apparently caused by gene loss, including a condensase gene cluster, a mycolate reductase gene, and a microbial type I fatty acid synthase gene. A complete beta-oxidation pathway involved in the degradation of fatty acids is present in the genome. Evaluation of the genomic data indicated that lipophilism is the dominant feature involved in pathogenicity of C. kroppenstedtii.

  • Source
    • "An example of this scenario is given by the synthesis of cardiolipin in Actinobacteria (Fig. 1), where two classes of cardiolipin synthases (CL-S) have been reported: the prokaryotic (Nampoothiri et al. 2002) and eukaryotic (Sandoval-Calderon et al. 2009) versions. Comparative genome analyses with emphasis on these two versions of the cls genes (Supplementary Information) revealed that the prokaryotic version of CL-S is present only in Corynebacterium (the exception being C. kroppenstedtii, a lipophilic bacteria that lacks mycolic acids [Tauch et al. 2008]) and in certain deep-rooted 36 Antonie van Leeuwenhoek (2012) 101:35–43 actinobacteria). Enzymes of this class use as a substrate two molecules of phosphatidylglycerol (PG) to synthesise cardiolipin in a reversible fashion. "
    [Show abstract] [Hide abstract]
    ABSTRACT: It has recently been proposed that in addition to Nomenclature, Classification and Identification, Comprehending Microbial Diversity may be considered as the fourth tenet of microbial systematics [Staley JT (2010) The Bulletin of BISMiS, 1(1): 1-5]. As this fourth goal implies a fundamental understanding of microbial speciation, this perspective article argues that translation of bacterial genome sequences into metabolic features may contribute to the development of modern polyphasic taxonomic approaches. Genome-scale metabolic network reconstructions (GSMRs), which are the result of computationally predicted and experimentally confirmed stoichiometric matrices incorporating all enzyme and metabolite components encoded by a genome sequence, provide a platform that can illustrate bacterial speciation. As the topology and the composition of GSMRs are expected to be the result of adaptive evolution, the features of these networks may provide the prokaryotic taxonomist with novel tools for reaching the fourth tenet of microbial systematics. Through selected examples from the Actinobacteria, which have been inferred from GSMRs and experimentally confirmed after phenotypic characterisation, it will be shown that this level of information can be incorporated into modern polyphasic taxonomic approaches. In conclusion, three specific examples are illustrated to show how GSMRs will revolutionize prokaryotic systematics, as has previously occurred in many other fields of microbiology.
    Antonie van Leeuwenhoek 01/2012; 101(1):35-43. DOI:10.1007/s10482-011-9655-1 · 2.14 Impact Factor
  • Source
    • "Applying next generation sequencing (NGS) technologies has facilitated the rapid determination of whole genome sequences. To determine the genome sequences of C. urealyticum (Tauch et al., 2008a) and C. kroppenstedtii (Tauch et al., 2008b) it was sufficient to carry out NGS-based whole genome shotgun sequencing. In the meantime it turned out that genomes with even small numbers of repetitive elements can often only be completely sequenced by use of a large insert library for sorting the assembled contigs and for closing existing gaps. "
    [Show abstract] [Hide abstract]
    ABSTRACT: The complete genome sequence of Bacillus amyloliquefaciens type strain DSM7(T) is presented. A comparative analysis between the genome sequences of the plant associated strain FZB42 (Chen et al., 2007) with the genome of B. amyloliquefaciens DSM7(T) revealed obvious differences in the variable part of the genomes, whilst the core genomes were found to be very similar. The strains FZB42 and DSM7(T) have in common 3345 genes (CDS) in their core genomes; whilst 547 and 344 CDS were found to be unique in DSM7(T) and FZB42, respectively. The core genome shared by both strains exhibited 97.89% identity on amino acid level. The number of genes representing the core genome of the strains FZB42, DSM7(T), and Bacillus subtilis DSM10(T) was calculated as being 3098 and their identity was 92.25%. The 3,980,199 bp genome of DSM7(T) contains numerous genomic islands (GI) detected by different methods. Many of them were located in vicinity of tRNA, glnA, and glmS gene copies. In contrast to FZB42, but similar to B. subtilis DSM10(T), the GI were enriched in prophage sequences and often harbored transposases, integrases and recombinases. Compared to FZB42, B. amyloliquefaciens DSM7(T) possessed a reduced potential to non-ribosomally synthesize secondary metabolites with antibacterial and/or antifungal action. B. amyloliquefaciens DSM7(T) did not produce the polyketides difficidin and macrolactin and was impaired in its ability to produce lipopeptides other than surfactin. Differences established within the variable part of the genomes, justify our proposal to discriminate the plant-associated ecotype represented by FZB42 from the group of type strain related B. amyloliquefaciens soil bacteria.
    Journal of Biotechnology 08/2011; 155(1):78-85. DOI:10.1016/j.jbiotec.2011.01.006 · 2.88 Impact Factor
  • Source
    • "A bacterial genome can now be completely sequenced, assembled and annotated in less than 24 h (Flicek & Birney, 2009; Reeves et al., 2009). The necessary final steps required for full closure of all the gaps and quality assessment of the annotation are, however, still time consuming, although it is possible, even with the current technology, to completely assemble a bacterial genome based on a single run from a 'next generation' machine (Tauch et al., 2008). The imminent 'third generation' sequencing machines are promising vast improvements and one can envision a time in the not-too-distant future when it will be routine to sequence and annotate several bacterial genomes before the morning tea break. "
    [Show abstract] [Hide abstract]
    ABSTRACT: There are now more than 1000 sequenced prokaryotic genomes deposited in public databases and available for analysis. Currently, although the sequence databases GenBank, DNA Database of Japan and EMBL are synchronized continually, there are slight differences in content at the genomes level for a variety of logistical reasons, including differences in format and loading errors, such as those caused by file transfer protocol interruptions. This means that the 1000th genome will be different in the various databases. Some of the data on the highly accessed web pages are inaccurate, leading to false conclusions for example about the largest bacterial genome sequenced. Biological diversity is far greater than many have thought. For example, analysis of multiple Escherichia coli genomes has led to an estimate of around 45 000 gene families - more genes than are recognized in the human genome. Moreover, of the 1000 genomes available, not a single protein is conserved across all genomes. Excluding the members of the Archaea, only a total of four genes are conserved in all bacteria: two protein genes and two RNA genes.
    Microbiology 03/2010; 156(Pt 3):603-8. DOI:10.1099/mic.0.038257-0 · 2.84 Impact Factor
Show more