[Show abstract][Hide abstract] ABSTRACT: Cyanobacteria of the genera Synechococcus and Prochlorococcus play a key role in marine photosynthesis, which contributes to the global carbon cycle and to the world oxygen supply. Recently, genes encoding the photosystem II reaction center (psbA and psbD) were found in cyanophage genomes. This phenomenon suggested that the horizontal transfer of these genes may be involved in increasing phage fitness. To date, a very small percentage of marine bacteria and phages has been cultured. Thus, mapping genomic data extracted directly from the environment to its taxonomic origin is necessary for a better understanding of phage-host relationships and dynamics.
To achieve an accurate and rapid taxonomic classification, we employed a computational approach combining a multi-class Support Vector Machine (SVM) with a codon usage position specific scoring matrix (cuPSSM). Our method has been applied successfully to classify core-photosystem-II gene fragments, including partial sequences coming directly from the ocean, to seven different taxonomic classes. Applying the method on a large set of DNA and RNA psbA clones from the Mediterranean Sea, we studied the distribution of cyanobacterial psbA genes and transcripts in their natural environment. Using our approach, we were able to simultaneously examine taxonomic and ecological distributions in the marine environment.
The ability to accurately classify the origin of individual genes and transcripts coming directly from the environment is of great importance in studying marine ecology. The classification method presented in this paper could be applied further to classify other genes amplified from the environment, for which training data is available.
[Show abstract][Hide abstract] ABSTRACT: Cyanobacteria of the genera Synechococcus and Prochlorococcus are important contributors to photosynthetic productivity in the open ocean. The discovery of genes (psbA, psbD) that encode key photosystem II proteins (D1, D2) in the genomes of phages that infect these cyanobacteria suggests new paradigms for the regulation, function and evolution of photosynthesis in the vast pelagic ecosystem. Reports on the prevalence and expression of phage photosynthesis genes, and evolutionary data showing a potential recombination of phage and host genes, suggest a model in which phage photosynthesis genes help support photosynthetic activity in their hosts during the infection process. Here, using metagenomic data in natural ocean samples, we show that about 60% of the psbA genes in surface water along the global ocean sampling transect are of phage origin, and that the phage genes are undergoing an independent selection for distinct D1 proteins. Furthermore, we show that different viral psbA genes are expressed in the environment.