A genome-wide survey of short coding sequences in streptococci.

Unité de Biochimie Bactérienne, UR477, INRA, 78350 Jouy-en-Josas, France.
Microbiology (Impact Factor: 2.85). 12/2007; 153(Pt 11):3631-44. DOI: 10.1099/mic.0.2007/006205-0
Source: PubMed

ABSTRACT Identification of short genes that encode peptides of fewer than 60 aa is challenging, both experimentally and in silico. As a consequence, the universe of these short coding sequences (CDSs) remains largely unknown, although some are acknowledged to play important roles in cell-cell communication, particularly in Gram-positive bacteria. This paper reports a thorough search for short CDSs across streptococcal genomes. Our bioinformatic approach relied on a combination of advanced intrinsic and extrinsic methods. In the first step, intrinsic sequence information (nucleotide composition and presence of RBSs) served to identify new short putative CDSs (spCDSs) and to eliminate the differences between annotation policies. In the second step, pseudogene fragments and false predictions were filtered out. The last step consisted of screening the remaining spCDSs for lines of extrinsic evidence involving sequence and gene-context comparisons. A total of 789 spCDSs across 20 complete genomes (19 Streptococcus and one Enterococcus) received the support of at least one line of extrinsic evidence, which corresponds to an average of 20 short CDSs per million base pairs. Most of these had no known function, and a significant fraction (31%) are not even annotated as hypothetical genes in GenBank records. As an illustration of the value of this list, we describe a new family of CDSs, encoding very short hydrophobic peptides (20-23 aa) situated just upstream of some of the positive transcriptional regulators of the Rgg family. The expression of seven other short CDSs from Streptococcus thermophilus CNRZ1066 that encode peptides ranging in length from 41 to 56 aa was confirmed by real-time quantitative RT-PCR and revealed a variety of expression patterns. Finally, one peptide from this list, encoded by a gene that is not annotated in GenBank, was identified in a cell-envelope-enriched fraction of S. thermophilus CNRZ1066.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: With the recent progress in complete genome sequencing, mining the increasing amount of genomic information available should in theory provide the means to discover new classes of peptides. However, annotation pipelines often do not consider small reading frames likely to be expressed. BactPepDB, available online at, is a database that aims at providing an exhaustive re-annotation of all complete prokaryotic genomes-chromosomal and plasmid DNA-available in RefSeq for coding sequences ranging between 10 and 80 amino acids. The identified peptides are classified as (i) previously identified in RefSeq, (ii) entity-overlapping (intragenic) or intergenic, and (iii) potential pseudogenes-intergenic sequences corresponding to a portion of a previously annotated larger gene. Additional information is related to homologs within order, predicted signal sequence, transmembrane segments, disulfide bonds, secondary structure, and the existence of a related 3D structure in the Protein Databank. As a result, BactPepDB provides insights about candidate peptides, and provides information about their conservation, together with some of their expected biological/structural features. The BactPepDB interface allows to search for candidate peptides in the database, or to search for peptides similar to a query, according to the multiple properties predicted or related to genomic localization. Database URL:
    Database The Journal of Biological Databases and Curation 01/2014; 2014. · 4.20 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Quorum sensing (QS) is a widespread phenomenon in the microbial world that has important implications in the coordination of population-wide responses in several bacterial pathogens. In Group A Streptococcus (GAS), many questions surrounding QS systems remain to be solved pertaining to their function and their contribution to the GAS lifestyle in the host. The QS systems of GAS described to date can be categorized into four groups: regulator gene of glucosyltransferase (Rgg), Sil, lantibiotic systems, and LuxS/AI-2. The Rgg family of proteins, a conserved group of transcription factors that modify their activity in response to signaling peptides, has been shown to regulate genes involved in virulence, biofilm formation and competence. The sil locus, whose expression is regulated by the activity of signaling peptides and a putative two-component system (TCS), has been implicated on regulating genes involved with invasive disease in GAS isolates. Lantibiotic regulatory systems are involved in the production of bacteriocins and their autoregulation, and some of these genes have been shown to target both bacterial organisms as well as processes of survival inside the infected host. Finally AI-2 (dihydroxy pentanedione, DPD), synthesized by the LuxS enzyme in several bacteria including GAS, has been proposed to be a universal bacterial communication molecule. In this review we discuss the mechanisms of these four systems, the putative functions of their targets, and pose critical questions for future studies. BACTERIAL COMMUNICATION IN GRAM-POSITIVE BACTERIA For a long time, bacteria were thought of as organisms carrying out self-sufficient and independent, unicellular lifestyles. During the last 40 years, several studies have demonstrated how, in fact, bacteria interact and establish complex social behaviors with their siblings and with other bacteria in their community to develop beneficial actions for the population, by means of conserved chemical languages. Quorum Sensing (QS) is the com-munication process in which bacteria produce, secrete and detect chemical signals with the purpose of triggering specific pheno-typical responses. QS regulates genes involved in population-wide decisions and behaviors that are beneficial when performed as a synchronous group rather than at the individual level and which include bioluminiscence, sporulation, competence, antibi-otic production, biofilm formation, and secretion of virulence factors (Reviewed by Atkinson and Williams, 2009; Ng and Bassler, 2009; Rutherford and Bassler, 2012). QS signaling in Gram-positive bacteria (Figure 1) operates through the activity of post-translationally modified oligopep-tides, named autoinducing peptides or pheromones, which can range from 5 to 34 amino acids in length and can adopt either linear or cyclical conformations (Håvarstein et al., 1995; Ji et al., 1995; Kuipers et al., 1995; Solomon et al., 1996; Otto et al., 1998; Mayville et al., 1999; Sturme et al., 2005). These pheromones are initially synthesized as inactive pro-peptides in the ribosome, and then exported from the cell by either the general secretion system (Sec) or by dedicated ABC transporters (Hui and Morrison, 1991; Zhang et al., 2002; Stephenson et al., 2003). During the export event, pro-peptides undergo proteolytic processing (and in some cases additional covalent modification) to generate the active pheromone, and a variety of enzymes have been involved in these maturation processes (Magnuson et al., 1994; Otto et al., 1998; An et al., 1999; Mayville et al., 1999; Zhang et al., 2002; Lanigan-Gerdes et al., 2007; Thoendel and Horswill, 2009). When the pheromones surpass threshold concentrations in the extracellular medium they are efficiently detected by transmembrane receptors of the two-component system (TCS) signal transduction family, leading to differential phosphorylation of a response regulator and consequent change in target gene expression. Alternatively, pheromones can be imported into the cytoplasm via peptide transporter complexes, most commonly the Opp/Ami oligopep-tide permease, a promiscuous transporter of peptides involved in the import of nutritional peptides, peptidoglycan recycling com-ponents as well as pheromone peptides for other QS systems (Leonard et al., 1996; Lazazzera et al., 1997; Slamti and Lereclus, 2002; Fontaine et al., 2010; Mashburn-Warren et al., 2010; Chang et al., 2011). Once inside the cell, peptide pheromones bind and directly modulate the activity of transcriptional regulators inside the cell. As a result of signaling, target genes change their expres-sion pattern and genes encoding for the pheromone pre-peptides are upregulated, increasing the production of mature pheromone
    Frontiers in Cellular and Infection Microbiology 09/2014; 4. · 2.62 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Lambdoid bacteriophages serve as useful models in microbiological and molecular studies on basic biological process. Moreover, this family of viruses plays an important role in pathogenesis of enterohemorrhagic Escherichia coli (EHEC) strains, as they are carriers of genes coding for Shiga toxins. Efficient expression of these genes requires lambdoid prophage induction and multiplication of the phage genome. Therefore, understanding the mechanisms regulating these processes appears essential for both basic knowledge and potential anti-EHEC applications. The exo-xis region, present in genomes of lambdoid bacteriophages, contains highly conserved genes of largely unknown functions. Recent report indicated that the Ea8.5 protein, encoded in this region, contains a newly discovered fused homeodomain/zinc-finger fold, suggesting its plausible regulatory role. Moreover, subsequent studies demonstrated that overexpression of the exo-xis region from a multicopy plasmid resulted in impaired lysogenization of E. coli and more effective induction of λ and Ф24B prophages. In this report, we demonstrate that after prophage induction, the increase in phage DNA content in the host cells is more efficient in E. coli bearing additional copies of the exo-xis region, while survival rate of such bacteria is lower, which corroborated previous observations. Importantly, by using quantitative real-time reverse transcription PCR, we have determined patterns of expressions of particular genes from this region. Unexpectedly, in both phages λ and Ф24B, these patterns were significantly different not only between conditions of the host cells infection by bacteriophages and prophage induction, but also between induction of prophages with various agents (mitomycin C and hydrogen peroxide). This may shed a new light on our understanding of regulation of lambdoid phage development, depending on the mode of lytic cycle initiation.
    PLoS ONE 01/2014; 9(10):e108233. · 3.53 Impact Factor


Available from