ArticlePDF Available

Members of the Candidate Phyla Radiation are functionally differentiated by carbon- and nitrogen-cycling capabilities

Springer Nature
Microbiome
Authors:

Abstract and Figures

Background: The Candidate Phyla Radiation (CPR) is a recently described expansion of the tree of life that represents more than 15% of all bacterial diversity and potentially contains over 70 different phyla. Despite this broad phylogenetic variation, these microorganisms appear to feature little functional diversity, with members generally characterized as obligate fermenters. Additionally, much of the data describing CPR phyla has been generated from a limited number of environments, constraining our knowledge of their functional roles and biogeographical distribution. To expand our understanding of subsurface CPR microorganisms, we sampled four separate groundwater wells over 2 years across three Ohio counties. Results: Samples were analyzed using 16S rRNA gene amplicon and shotgun metagenomic sequencing. Amplicon results indicated that CPR members comprised between 2 and 20% of the microbial communities with relative abundances stable through time in Athens and Greene samples but dynamic in Licking groundwater. Shotgun metagenomic analyses generated 71 putative CPR genomes, representing roughly 32 known phyla and 2 putative new phyla, Candidatus Brownbacteria and Candidatus Hugbacteria. While these genomes largely mirrored metabolic characteristics of known CPR members, some features were previously uncharacterized. For instance, nitrite reductase, encoded by nirK, was found in four of our Parcubacteria genomes and multiple CPR genomes from other studies, indicating a potentially undescribed role for these microorganisms in denitrification. Additionally, glycoside hydrolase (GH) family profiles for our 71 genomes and over 2000 other CPR genomes were analyzed to characterize their carbon-processing potential. Although common trends were present throughout the radiation, differences highlighted potential mechanisms that could allow microorganisms across the CPR to occupy various subsurface niches. For example, members of the Microgenomates superphylum appear to potentially degrade a wider range of carbon substrates than other CPR phyla. Conclusions: CPR members are present across a range of environments and often constitute a significant fraction of the microbial population in groundwater systems, particularly. Further sampling of such environments will resolve this portion of the tree of life at finer taxonomic levels, which is essential to solidify functional differences between members that populate this phylogenetically broad region of the tree of life.
This content is subject to copyright. Terms and conditions apply.
R E S E A R C H Open Access
Members of the Candidate Phyla Radiation
are functionally differentiated by carbon-
and nitrogen-cycling capabilities
R. E. Danczak
1
, M. D. Johnston
2
, C. Kenah
3
, M. Slattery
3
, K. C. Wrighton
1
and M. J. Wilkins
1,2*
Abstract
Background: The Candidate Phyla Radiation (CPR) is a recently described expansion of the tree of life that represents
more than 15% of all bacterial diversity and potentially contains over 70 different phyla. Despite this broad phylogenetic
variation, these microorganisms appear to feature little functional diversity, with members generally characterized as
obligate fermenters. Additionally, much of the data describing CPR phyla has been generated from a limited number of
environments, constraining our knowledge of their functional roles and biogeographical distribution. To expand our
understanding of subsurface CPR microorganisms, we sampled four separate groundwater wells over 2 years across three
Ohio counties.
Results: Samples were analyzed using 16S rRNA gene amplicon and shotgun metagenomic sequencing. Amplicon results
indicated that CPR members comprised between 2 and 20% of the microbial communities with relative abundances stable
through time in Athens and Greene samples but dynamic in Licking groundwater. Shotgun metagenomic analyses
generated 71 putative CPR genomes, representing roughly 32 known phyla and 2 putative new phyla, Candidatus
Brownbacteria and Candidatus Hugbacteria. While these genomes largely mirrored metabolic characteristics of known
CPR members, some features were previously uncharacterized. For instance, nitrite reductase, encoded by nirK,was
found in four of our Parcubacteria genomes and multiple CPR genomes from other studies, indicating a potentially
undescribed role for these microorganisms in denitrification. Additionally, glycoside hydrolase (GH) family profiles for
our 71 genomes and over 2000 other CPR genomes were analyzed to characterize their carbon-processing potential.
Although common trends were present throughout the radiation, differences highlighted potential mechanisms that
could allow microorganisms across the CPR to occupy various subsurface niches. For example, members of the
Microgenomates superphylum appear to potentially degrade a wider range of carbon substrates than other CPR phyla.
Conclusions: CPR members are present across a range of environments and often constitute a significant fraction of
the microbial population in groundwater systems, particularly. Further sampling of such environments will resolve this
portion of the tree of life at finer taxonomic levels, which is essential to solidify functional differences between
members that populate this phylogenetically broad region of the tree of life.
Keywords: Candidate phyla, Groundwater, Subsurface, Carbon, Nitrogen
* Correspondence: wilkins.231@osu.edu
1
Department of Microbiology, The Ohio State University, Columbus, OH, USA
2
School of Earth Sciences, The Ohio State University, Columbus, OH, USA
Full list of author information is available at the end of the article
© The Author(s). 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0
International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and
reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to
the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver
(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Danczak et al. Microbiome (2017) 5:112
DOI 10.1186/s40168-017-0331-1
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Background
The Candidate Phyla Radiation (CPR) is a recently
described expansion of the tree of life that lacked any
sequenced genomes until 2012 [1], with all prior know-
ledge derived from various marker gene studies [2, 3].
This absence of genomic information limited inferences
on the phylogenetic diversity and biogeochemical roles
of these organisms. Early work predicted that this radi-
ation represents roughly 15% of the bacterial domain [4].
More recently, extensive genomic sampling of this
radiation yielded over 2000 genomes, many of which
were closed, significantly expanding the phylogenetic
diversity within the radiation [46]. With this increased
metagenomic sampling, CPR membership has grown to
include over 70 phyla including two superphyla (Parcu-
bacteria and Microgenomates) [6, 7]. Despite this
radiation constituting a large portion of bacterial diver-
sity, the absence of cultured representatives means that
our current understanding is derived exclusively from
various omics datasets, with a large fraction derived
from a single field site near Rifle, CO. Since then,
additional genomes have been reconstructed from other
subsurface locations [810] and habitats [11, 12] around
the globe.
Based upon current data, there appear to be a number
of conserved traits throughout the entire CPR. Firstly,
despite representing a broad phylogenetic diversity, CPR
members are typically characterized by very limited
biosynthetic capabilities (i.e., the inability to produce
amino acids, lipids, some conserved genes, etc.) [4, 5].
Originally, these organisms were inferred to be obligate
fermenters based upon the complete absence of respira-
tory genes. Recently, however, CPR metabolic versatility
was expanded for members of the Parcubacteria with
new genomes that encode putative components of the
dissimilatory nitrate reduction to ammonia pathway
[12, 13] as well as potential hydroxylamine oxidation
[13]. Metaproteomic analyses also suggested that fermen-
tative CPR likely play a significant role in hydrogen and
carbon cycling in subsurface ecosystems [4, 14, 15].
Beyond metabolism, small genome size (~ 1 Mb) is
also generally conserved throughout the radiation [4].
Consistent with small genome sizes, high-resolution
cryo-TEM of the CPR revealed extremely small cell sizes
of 0.009 ± 0.002 mm
3
[16]. Taken together, phylogenetic,
metabolic, and cell biology inferences suggest that
members of the CPR may lead a symbiotic or syntrophic
lifestyle, dependent upon some partner (or partners) for
necessary metabolites while potentially providing labile
fermentation waste products such as acetate in return
[4, 5, 9, 11]. Such a co-dependent lifestyle could potentially
account for our current lack of cultured CPR members.
While these conclusions are well supported by existing
genomic datasets, samples from only a limited range of
subsurface locations account for nearly all the phylogen-
etic diversity within the CPR. To better resolve both the
biogeography of CPR members and uncover new phylo-
genetic diversity within this radiation, we performed
time-resolved metagenomic sampling on planktonic
biomass from four aquifers across southern Ohio (Fig. 1a).
From the resulting data, we were able to obtain 71 near-
complete or incomplete CPR genomes. Furthermore, we
profiled glycoside hydrolases encoded across over 2000
separate CPR genomes and identified a potentially new
role for many of these microorganisms in the nitrogen
cycle via a critical step in denitrification (conversion
ofnitritetonitricoxide).Thesedataexpandour
current understanding of the phylogeny of the CPR
and suggest that these microorganisms are cosmopol-
itan in subsurface environments. Moreover, our re-
sults advance knowledge of functional differentiation
across the radiation, with new roles in nitrogen and
carbon cycling highlighted here.
Methods
Sample collection
Groundwater samples were collected from the Ohio
Department of Natural Resources (ODNR) Observation
Well Network used to monitor ground water level fluc-
tuations in three different counties (Fig. 1a). These three
wells are located within separate buried valley aquifers,
valleys that have been back filled with glacial sands and
gravels and some till. The observation wells were
sampled on a quarterly basis over a 2-year period from
July 2014 to July 2016. Additionally, one private drinking
water well, located in a sand and gravel aquifer within a
thick till sequence in western Licking County (Fig. 1a),
was sampled once in June 2016.
Wells were purged of more than 250 L of water to
ensure that aquifer-derived water was being sampled
(dedicated pumps were placed at the screened interval
for the ODNR wells). Approximately 38 L of post-purge
groundwater was pumped sequentially through a 0.2-μm
then 0.1-μm Supor PES Membrane Filter (Pall Corpor-
ation, NY, USA). The filters were then immediately flash
frozen and kept on dry ice until they were returned to the
Ohio State University where they were stored at 80 °C
until DNA extraction. During sampling, oxidation-
reduction potential (ORP), temperature, and pH were
measured using a handheld Myron Ultrameter II (Myron
L Company, CA, USA). General observational data for
each of these groundwater wells is provided in Additional
file 1: Table S1.
DNA extraction, sequencing, and processing
DNA was extracted from roughly a quarter of each Supor
PES membrane filter by using the PowerSoil DNA Isola-
tion Kit (Mo Bio Laboratories, Inc., Carlsbad, CA, USA).
Danczak et al. Microbiome (2017) 5:112 Page 2 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Final DNA concentrations were determined by using a
Qubit fluorometer (Invitrogen, Carlsbad, CA, USA).
To generate 16S rRNA gene data, the V4 region of 16S
rRNA genes was amplified and sequenced by using the
universal bacterial/archaeal primer set 515F/806R on an
Illumina MiSeq instrument at Argonne National Labora-
tory. The resulting reads were processed through the
QIIME pipeline (V1.7.0) and clustered into operational
taxonomic unit (OTU) classifications at 97% similarities.
Taxonomies were assigned using the SILVA database as
well as a CPR 16S rRNA gene database curated by
Brown et al. [4].
Metagenomic data for nine samples (0.2-μm filters from
July 2014, October 2014, and April 2016 for Greene and
Athens, October 2014 for Licking, July 2016 for the Lick-
ing Private Well and then a 0.1-μm filter from October
2014 for Greene) was collected by shotgun sequencing on
an Illumina HiSeq 2500 at the Genomics Shared Resource
at the Ohio State University. Raw reads were trimmed and
filtered based upon read quality using sickle pe with de-
fault parameters [17]. Resulting reads were subsequently
assembled into larger contigs and then scaffolds using
idba_ud with default parameters [18].
The data generated during 16S rRNA gene sequencings
can be obtained from NCBI using accession number
SRX2896383. The 71 genomes and annotated protein se-
quences are publicly hosted at http://chimera.asc.ohio-sta-
te.edu/Danczak_Genomes_and_Protein_sequences.html.
16S rRNA gene sequencing analysis
In order to obtain results specific to this radiation, non-
CPR taxonomies were first removed from the OTU
table. A stacked bar chart was generated in R v3.3.2 [19]
using ggplot2 to visualize differential relative abundances
(ggplot, ggplot2 package v2.2.1) [20]. Beta diversity was
calculated using Bray-Curtis dissimilarity [21] and
subsequently plotted using non-metric multidimensional
scaling (NMDS) to observe CPR community differences
(metaMDS, vegan package v2.4-2) [22]. Species scores
were then plotted as loading arrows in order to under-
stand which members were responsible for the observed
separation.
Metagenomic binning and annotation
In order to determine average coverage across a scaffold,
trimmed reads were mapped to assembled scaffolds
using bowtie2 (bowtie2 fast) [23]. Using the read-
mapping information, assembled scaffolds 2500 bp
were binned using MetaBAT (metabat superspecific)
[24]. Genome completeness in each bin was determined
by analyzing the presence of 31 conserved bacterial
genes using AMPHORA2 [25]. Given that these
conserved genes are considered single copy, genomes
with multiple copies were considered to be contami-
nated (or misbinned). Bins with a genome completion
> 50% and a misbin < 10% were used in downstream
analyses. The open reading frames (ORFs) for the nine
ba
Fig. 1 Map of Ohio and CPR relative abundance through time. aA map of Ohio with studied colors indicated by various colors; colored circles
represent the approximate location of the Ohio Department of Natural Resources (ODNR) sampling wells and one private well. Columbus, OH, is
indicated as the yellow dot for reference. bStacked bar chart of CPR phyla relative abundances from the three ODNR sampling locations (Athens,
Greene, and Licking). Oxidation-reduction potential (ORP) is plotted atop the abundance graphs as line graphs
Danczak et al. Microbiome (2017) 5:112 Page 3 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
entire metagenomes and resulting bins were predicted
using MetaProdigal [26] and subsequently annotated
by comparing predicted ORFs to the KEGG,
UniRef90, and InterProScan databases [2731] using
USEARCH to scan for single and reverse best hit
(RBH) results [32]. The KEGG Automatic Annotation
Server (KAAS) [33] was also used to visualize path-
way completeness.
The presence of different glycoside hydrolase (GH)
families was determined in both the above bins and
2155 previously studied genomes obtained from
ggkbase [4, 6] by using the dbCAN hidden Markov
model (HMM) (hmmsearch cpu 4 tblout
result.hres noali -E 0.00001 -o result_hmm.txt
dbCAN-fam-HMMs.hmm input.faa)[3436]. Using
the hmmsearch output, GH family gene counts were
obtained for each genome by selecting the HMM results
with the lowest e-values. Differences in GH profiles for
both bins and genomes were visualized in R using both a
presence/absence heatmap (ggplot, ggplot2 package
v2.2.1) [20] and a redundancy analysis (rda, vegan package
v2.4-2) [22] with bins from this study collapsed into their
putative phylogenies. Given that putative nirK function in
Parcubacteria is newly proposed, a MUSCLE alignment
[37] of nirK sequences from this study, from previously
obtained Parcubacteria, and from other bacteria was
generated using default parameters to observe the pres-
ence/absence of conserved residues [38]. The nirK
sequences not from this study were obtained from NCBI
and represent diverse taxonomies capable of nitrite reduc-
tion. These alignments were then trimmed in Geneious
9.1.5 [39] to remove unaligned ends and portions with >
95% gaps. A RAxML tree with 100 bootstraps was then
generated to determine evolutionary relatedness and func-
tional potential (raxmlHPC-PTHREADS famPROT-
CAT x 12345 p 12345 N100T20sinputn
output) [40] and visualized in R using ggtree (ggtree,
ggtree package v1.6.9) [41].
Metagenomic abundance calculations
The rough microbial diversity within the nine metage-
nomic samples was determined using an approach
modified from Anantharaman et al. [6]. First, the
ribosomal protein S3 (rps3 or rpsC)waspulledfrom
each sample using an HMM derived from AM-
PHORA2 [25]. Ribosomal protein S3 sequences for
each metagenome were subsequently clustered to-
gether at 99% using USEARCH [32]. Reads were then
mapped to the resulting rps3 clusters using Bowtie2,
and coverage was subsequently determined as a proxy
for abundance. Taxonomic assignments for each rps3
cluster were obtained by comparing them against a
database of rps3 proteins derived from Hug et al. [7]
using blastp with an e-value cutoff of 10
15
.
Phylogenetic analysis for bins
Given the inconsistent presence of various phylogenetic
marker genes in different bins, numerous genes were
analyzed to approximate phylogeny. The 16S rRNA gene
was extracted from bins using SSU-Align with default
parameters [42] and then aligned to two separate 16S
rRNA gene databases [4, 6] using SINA on the SILVA
website [43, 44]. The sequence identity to all genes
within the SILVA NR99 database was also determined
for potential phyla proposition [6]. Unaligned ends and
regions with 95% or greater gaps were then trimmed using
Geneious 9.1.5 [39] with a RAxML tree (100
bootstraps) subsequently generated (raxmlHPC-
PTHREADS fam GTRGAMMA x 12345 p 12345
N100T20sinputnoutput)[40]andvisualizedusing
the ggtree package in R (ggtree, ggtree package v1.6.9) [41].
Phylogeny was further identified by building rps3 and
gyrA trees following the protocols outlined above for
nirK (i.e., pulled sequences using HMMs, aligned in
MUSCLE and generated a tree using RAxML). For
high-resolution phylogeny of 45 bins, a concatenated
ribosomal tree was generated from 16 ribosomal
proteins (rpL2,3,4,5,6,14,15,16,18,22,24,rps3,8,
10,17,19). First, each ribosomal protein sequence was
pulled from the bins using the appropriate HMM either
derived from AMPHORA2 or TIGRFAM release 15.0
in hmmsearch [25, 34, 45]. Alignments were then
performed independently for each sequence using
MUSCLE with default parameters [37]. Completed
alignments were concatenated using Geneious 9.1.5
and trimmed to remove unaligned ends and portions
with 95% gaps or greater [39]. RAxML was then used
inordertogenerateaphylogenetictreewith100
bootstraps with the same command given above,
again visualized in R with ggtree (ggtree, ggtree pack-
age v1.6.9) [40, 41].
New phyla were proposed based upon three separate
parameters. Firstly, if the 16S rRNA gene sequence was
less than 75% similar to previously described sequences
and at least two new genomes formed a monophyletic
clade within the concatenated ribosomal tree, a new
phylum was proposed [6, 46]. However, given high
phylogenetic variability within the CPR, if a very obvi-
ous monophyletic clade was observed on the
concatenated tree, up to roughly 80% similarity was
tolerated. Following naming conventions for the CPR,
proposed names for the phyla were based upon re-
searchers who have made significant contributions to
our understanding of the diversity and physiology of
CPR microorganisms [6].
All alignments generated during the phylogenetic ana-
lysis (i.e., rps3,gyrA, 16S rRNA, and concatenated ribo-
somal proteins) can be found in the supplemental
information (Additional files 2, 3, 4, and 5).
Danczak et al. Microbiome (2017) 5:112 Page 4 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Results
CPR community member abundances are stable through
time
Quarterly groundwater samples were collected from four
wells across central and southern Ohio in Licking,
Athens, and Greene counties over the course of 2 years
(Fig. 1a). Field measurements revealed varying redox
conditions across these locations (Fig. 1b). CPR mem-
bers constituted modest portions of the overall microbial
community of each well, as inferred from 16S rRNA
gene analyses (Fig. 1b). Relative abundances ranged be-
tween 4 and 5% in Athens groundwater, between 1 and
4% in Greene groundwater, and between 5 and 20% in
Licking groundwater. Moreover, the relative abundance
of CPR was temporally stable in both the Athens and
Greene samples but more dynamic in Licking where
members of the CPR represented an increasing portion
of the overall community over the sampling period.
Multivariate ordination analyses of CPR community
members revealed that each location contained signifi-
cantly different CPR phyla (Additional file 6: Figure S1).
For example, Peregrinibacteria were almost entirely
unique to the Greene location (Fig. 1b), while a broad
diversity of phyla differentiated this well from the
Athens and Licking locations. Notably, groundwater
samples from the Greene location, which had the lowest
relative abundance but greatest diversity of CPR, were
the most oxidized (Fig. 1b). In contrast, the more re-
duced groundwater contained greater CPR-relative
abundances and shared more similar CPR membership
(Additional file 6: Figure S1).
To better profile the phylogeny and physiology of
these CPR lineages, we used assembly-based shotgun
metagenomic analyses of microbial communities in nine
of these samples. Briefly, between 300 and 900 Mbp of
sequence information was generated after assembly
(Additional file 1: Table S2). The relative abundances for
the CPR in each metagenomic sample were calculated
using sequencing read-depth for all ribosomal protein S3
(rps3) sequences (Fig. 2a). Although some differences
were observed between metagenomic and 16S rRNA
gene amplicon-inferred relative abundances, broad
patterns between samples were generally maintained.
For example, the Peregrinibacteria were the most
abundant CPR taxa in samples from the Greene well,
while the Microgenomates had pronounced representa-
tion in the Athens well samples.
Samples contain significant CPR phylogenetic diversity
Sizes of the genomes obtained during the binning
process ranged from 0.5 to 1.3 Mbp, in line with previ-
ous characterizations of the radiation [4]. Completeness
measurements, phylogeny, and putative function were
determined to identify and separate putative CPR bins.
Any bins less than 50% complete based upon single copy
gene presence, or containing greater than 10% misbin
based upon multiple copies of single copy genes, were
excluded from future analysis. In total, 71 putative CPR
bins were obtained across all samples. Of these 71 bins,
45 bins were considered near completebased on com-
pletion greater than 90%; the 26 remaining bins were
deemed incomplete(Fig. 2b). Taxonomy was initially
assigned to each bin using the rps3 marker gene and at
least one other single copy phylogenetic marker (16S
rRNA gene or gyrA) (Additional file 6: Figures S2S4;
Additional files 7, 8 and 9). For increased phylogenetic
resolution, a 16-protein concatenated ribosomal tree was
also constructed that included a mix of 45 near-
complete and incomplete genomes (63%) from this study
(Fig. 3). A brief summary of the 71 bins is provided in
Additional file 1: Table S3.
Genomes obtained from this study occupied a broad
phylogenetic diversity with many different CPR phyla
sampled. Though most bins were from the Parcubacteria
superphylum, Berkelbacteria were the best-represented
individual phylum with six bins total. Many of the
Parcubacteria bins were from recently characterized
phyla [6], with Kaiserbacteria (five bins), Harrisonbac-
teria (four bins), Lloydbacteria (four bins), and Taylor-
bacteria (four bins) being the most represented. At these
high taxonomic levels (i.e., phyla), many of the spatial
differences between wells inferred from 16S rRNA
amplicon data were not recapitulated in the metage-
nomic data (Fig. 2b).
From the placement of these new genomes in the
context of the CPR, we identified two new phylum-level
groups within the Parcubacteria superphylum. Following
naming conventions previously established for this radi-
ation, we propose that six of our bins belong to two new
phylum-level lineages, Candidatus Brownbacteria and
Candidatus Hugbacteria. These names were chosen due
to the significant contributions that Christopher Brown
and Laura Hug have made to our understanding of
CPR physiology and phylogeny [4, 7]. Candidatus
Brownbacteria is defined by four separate bins from
this study and five previously described Uhrbacteria
or unaffiliated members that now form a well-
supported, early branching, monophyletic clade in the
concatenated tree (Fig. 3). Candidatus Hugbacteria,in
contrast, represents a new clade most closely related
to Adlerbacteria, represented by genomes sampled
only within this study (Fig. 3; Additional file 6: Figure
S5; Additional file 10). Aside from these two candi-
date phyla, a number of other deep-branching and
unique lineages at lower phylogenetic levels were also
recovered here (Additional file 6: Figures S2S5),
suggesting that additional CPR diversity remains to be
discovered.
Danczak et al. Microbiome (2017) 5:112 Page 5 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
The CPR potentially catalyze diverse reactions in carbon
and nitrogen cycling
The functional annotation of each CPR bin in our
dataset indicated that our CPR genomes shared meta-
bolic features with prior descriptions [1, 46]. Mirroring
these trends, our CPR genomes appeared to be incapable
of synthesizing most of their own amino acids, lacked
lipopolysaccharide synthesis pathways, relied upon EMP
glycolysis, and lacked a complete tricarboxylic acid cycle.
Interestingly, genes encoding the pentose phosphate
pathway were present and near-complete in many
members of the CPR described here, despite lacking
downstream biosynthetic pathways or energy-capturing
mechanisms. Given that 34 out of 45 of the near-
complete bins were members of the Parcubacteria, the
metabolisms of these 34 genomes were summarized to
illustrate the potential functional capabilities of this
superphylum in the samples studied here (Fig. 4).
Additionally, while many prior studies concluded that
NiFe-hydrogenases were important to the physiology of
some CPR members [14], there appears to be little
evidence for these genes within the genomes recovered
from Ohio aquifers.
While many of the CPR lineages were inferred to lack
electron transport mechanisms, the putative capability
for nitrite reduction (nirK) was found in four separate
Parcubacteria bins from two different phyla, Kaiserbac-
teria and Harrisonbacteria (Fig. 5). This copper-containing
nitrite reductase is responsible for reducing nitrite to
nitric oxide, an important step in denitrification. Although
nirK genes from CPR genomes are present in public data-
bases, these sequences have not been previously analyzed
in a phylogenetic or metabolic context [6, 9]. Sequence
alignments using both CPR nirK and nirK sequences from
a
b
Fig. 2 CPR relative abundance derived from metagenomic rps3 sequences and bin counts. aRelative percentage of mapped rps3 sequence reads
from a given metagenome presented as a stacked bar chart; Inset: a pie chart illustrating the relative percentage of mapped reads for the 0.1-μm
filter from Greene 10/2014. bA count of the number of near-complete quality (NC; > 90% complete; solid colors) and incomplete (IC; 5089%
complete; dashed colors) bins from each metagenomic sample (71 total)
Danczak et al. Microbiome (2017) 5:112 Page 6 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
well-characterized nitrite-reducing microorganisms
revealed that the type 1 and type 2 Cu ligand binding sites
were conserved (Additional file 6: Figure S6; Additional
file 11) [38, 47, 48]. CPR nirK sequences formed an
individual clade, suggesting that potential nitrite reduction
is not a recently acquired function of these CPR genomes
(Fig. 5; Additional file 6: Figure S7; Additional file 12).
Adjacent to this nirK clade, the closest non-CPR sequences
in the tree were from known nitrogen-cycling organisms
such as Candidatus Brocadia and Rhodanobacter.
For nirK to catalyze nitrite reduction, the enzyme
must receive electrons from cellular metabolism. Each of
the nirK-containing CPR genome bins contained a puta-
tive cupredoxin protein, a class of proteins previously
shown to pass electrons onto nirK specifically [47].
These bins also contained a potential means of removing
electrons from reducing equivalents within the cell: the
two Harrisonbacteria encoded an NADH dehydrogenase
II-like (NDH II-like) gene whereas the two Kaiserbac-
teria contained genes for a putative membrane-bound,
six-subunit Nqo-like/hydrogenase similar to sequences
previously reported [1]. This latter gene was originally
annotated as a group 4 NiFe-hydrogenase, but upon
closer inspection, the near-complete two Kaiserbacteria
genome bins did not contain maturation enzymes.
Moreover, while the main subunit did not contain
Ni-binding sites, the presence of NADH-binding sites
suggests that NADH may act as an electron donor to
this complex, unsurprising given the evolutionary rela-
tionship of these subunits to NADH dehydrogenase [49].
Fig. 3 Sixteen-protein concatenated ribosomal tree. Concatenated ribosomal protein tree generated from 16 ribosomal proteins (rpL2,3,4,5,6,
14,15,16,18,22,24,rpS3,8,10,17,19). Each of the major CPR classifications and phyla are provided as text. Each point represents a binned
genome from a given location in Ohio, indicated by point shape and color. Newly proposed phyla are indicated by blue text
Danczak et al. Microbiome (2017) 5:112 Page 7 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
This six-subunit enzyme contains four putative proton-
translocating, transmembrane domains that may offer a
mechanism via which electrons are transported across
the inner membrane to a periplasmic cupredoxin which
could ultimately shuttle electrons to nirK (Fig. 4).
Within the Harrisonbacteria, the NDH-like gene may
function similarly to our proposed Nqo-like model,
although this gene does not contain transmembrane
domains. The linked electron transport components and
the potential for nitrite reduction represent previously
undescribed functions within this radiation.
The CPR have often been broadly characterized as a
group of organisms responsible for generating fermenta-
tion products through the degradation of more complex
carbon substrates [4, 15]. To better understand this
potential role, over 2000 previously described genomes
and the 71 genomes from this study were analyzed for
the presence and count of 135 different glycoside hydro-
lase (GH) families (Fig. 6; Additional file 1: Table S4).
Across all CPR phyla, while the capacity to degrade
broad substrate classes (e.g., amylose, cellulose) was rela-
tively similar, the GH genes encoding these capacities
Fig. 4 Summary of Parcubacteria genomes obtained from Ohio groundwater. A metabolic map summarizing the genomic potential of 34
separate Parcubacteria bins. Arrow thickness represents the frequency a given gene occurs throughout the 34 genomes. Numbers adjacent to
the arrows correspond to the gene represented by the arrow; the legend can be found in the supplemental information. Functions within the
box were not broadly found throughout Ohio and indicate newly described functions. Inset within the metabolic map are the frequencies (as pie
charts) of 12 of the most common glycoside hydrolase families. Colors in the pie chart represent the primary activity of the given GH family:
green typically operates on cellulolytic material, brown is active on alpha-glycosides/starches, purple operates on peptidoglycan/chitin, and red
represents GH families that operate on other carbon substrates
Danczak et al. Microbiome (2017) 5:112 Page 8 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
differed. While nearly all CPR genomes encoded en-
zymes for hemicellulose side-chain degradation, the
exact GH genes responsible differed between phyla. For
example, the Uhrbacteria contain a broad cassette of
different GH families (GH2, GH3, GH10, etc.) while
many other Parcubacteria primarily encode only GH39
and GH74. Additionally, many phyla within the Micro-
genomates appear to contain a wider array of GH family
sequences than other CPR members, indicating a
broader role for these organisms in carbon degradation
(Fig. 6). Certain phyla within the CPR were exceptions
however; KAZAN and Niyogibacteria lack the capacity
for amylose degradation, and CPR2 lack the capacity for
mannose degradation. Through the sampling of CPR
genomes from a new geographic location (Ohio ground-
water), additional functional capacity was uncovered. For
instance, our data revealed the presence of genes
encoding GH1 enzymes within the Harrisonbacteria,
representing a potential metabolic expansion of this
phylum into cellulose degradation (Fig. 6).
RDA analyses of GH profiles for every genome (based
on gene abundance) indicated that the capacity for
carbon degradation within the CPR is complex
(Additional file 6: Figures S8S11). Within the Parcubac-
teria, for example, Lloydbacteria and Yonathbacteria ap-
pear to be differentiated primarily due to their putative
amylose degradation capacity (i.e., GH15 and 119) but
they localize near to Niyogibacteria and Azambacteria
(devoid of amylose-active GH families). This is likely a
result of the shared presence of the chitin-active GH18
(Additional file 6: Figure S8). Outside of the Parcubac-
teria, similar trends can be observed. While genes en-
coding chitin-active GH23 enzymes are present in both
the Doudnabacteria and Berkelbacteria, the presence of
additional cellulolytic GH families in the former repre-
sent a key difference in carbon-processing potential in
Fig. 5 nirK tree. A maximum-likelihood tree of nirK (nitrite reductase) amino acid sequences with bootstrap support indicated by the shaded circles on
the nodes
Danczak et al. Microbiome (2017) 5:112 Page 9 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
this phylum (Additional file 6: Figure S9). Additionally,
GH profiles for many of these CPR phyla occupy a large
ordination space (Additional file 6: Figures S10S11).
Taken together with the general trends outlined above,
this suggests metabolic heterogeneity at finer taxonomic
resolution.
Discussion
The CPR is a recently described expansion of the tree of
life that continues to grow as metagenomic analyses are
performed from different ecosystems [1, 4, 7, 1013, 50].
Despite their diminutive size, members of this radiation
are often abundant members of the microbial commu-
nity in subsurface environments [51, 52]. This trend is
apparent in the samples studied here, with members of
the CPR constituting up to 20% of the overall commu-
nity (Fig. 1b), although these abundances may be under-
estimates due to limitations of broad specificity 16S
rRNA gene primers [4]. An additional factor contribut-
ing to underestimates of CPR abundances is the
common utilization of 0.2-μm filters to retain microbial
biomass when filtering aqueous media. Due to small cell
sizes, CPR microorganisms can easily pass through such
membranes and are only retained when the filter is more
clogged or when a 0.1-μm pore size is used instead [16].
Many of the OTUs obtained during 16S rRNA gene
amplicon sequencing that were present exclusively on
the 0.1 μm were members of the CPR, primarily in the
Fig. 6 Glycoside hydrolase (GH) heat map. A heat map illustrating the presence/absence of various different GH families with putative substrates
illustrated by color.
Danczak et al. Microbiome (2017) 5:112 Page 10 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Athens and Greene locations, mirroring a trend ob-
served in previous studies [16]. Additionally, many of
our CPR genomes (~ 34) were assembled from a 0.1-μm
filter. Although our sampling locations were all ground-
water aquifers, geochemical differences (displayed as
ORP values) were apparent between sites (Fig. 1b), and
these differences were reflected in 16S rRNA gene
datasets (Additional file 6: Figure S1). Samples from the
Greene location were consistently the most oxidized and
contained the most different population of CPR bacteria
relative to samples from the Athens and Licking wells,
suggesting that as-yet unknown functional differences
between CPR members may account for heterogeneity
in the spatial distribution of different taxa.
The CPR represents a large portion of total bacterial
phylogenetic diversity [6, 7]; in addition to capturing
much of this diversity in this survey, we were also able
to expand the CPR through detection of two new phyla,
Candidatus Brownbacteria and Candidatus Hugbacteria
(Fig. 3). Given that these putative new phyla were found
from 71 genomes derived from one ecosystem type (sat-
urated subsurface), our results suggest that continued
studies across a range of environments will reveal
additional diversity within this radiation. Such discovery
is necessary; the spatiotemporal trends we observed
within our 16S rRNA gene amplicon data were not al-
ways recapitulated in binned genomes, likely due to the
wide phylogenetic breadth (but not depth) that our
library of CPR genomes currently accounts for. This
results in a situation whereby we can classify genomes at
the phyla level, but not at lower taxonomic levels (i.e.,
class, order). As more CPR genomes are available for
analysis, these lower taxonomic levels will resolve,
resulting in an improved ability to link genome bins with
environmental niches and observe temporal and spatial
trends in microorganism abundance.
Despite the CPR containing over 70 different phyla, pre-
vious work indicates that there is limited functional diver-
sity within the radiation [4, 14], although some recent
studies have described additional processes catalyzed by
CPR members [12, 13]. The 71 CPR genomes described in
this study span roughly 32 phyla and largely support these
previous conclusions at such a broad phylogenetic scale.
At the genome level, however, functional differences
between organisms are more apparent. Conserved traits
across the Parcubacteria superphylum include a near-
complete pentose phosphate pathway and consistent
patterns of genes involved in glycolysis/gluconeogenesis
(Fig. 4). Differentiating characteristics include the inferred
production of various fermentation end products, ranging
from D-lactate in some organisms to ethanol or acetate in
others. Furthermore, while some genomes encode partial
non-reductive and reductive TCA cycles, no individual
genome has every necessary component.
While the potential for these microorganisms to de-
grade more recalcitrant carbon compounds into labile
substrates is relatively well appreciated, little in-depth
analysis has been performed on their specific capabilities
[15]. To better characterize the carbon-processing
potential encoded with the CPR, we analyzed the glyco-
side hydrolase profiles of our 71 genomes and ~ 2000
previously obtained genomes. By looking at the differences
across these profiles, our analyses revealed functional dif-
ferentiation with regard to carbon processing. Despite
substrate redundancy among many of these families of
enzymes, variations exist in the bonds these proteins may
act upon (i.e., endo- versus exo-acting enzymes). We
suggest that these differences in carbon processing enable
these microorganisms to access a wide range of carbon
substrates and respond to fluctuating geochemical condi-
tions that might influence carbon type and availability.
Supporting this inference, prior studies have revealed
dynamic behavior of different CPR phyla in response to
varying hydrologic and geochemical conditions in a ripar-
ian aquifer [51].
A copper-containing nitrite reductase encoded by nirK
was found in 4 of the 34 high-resolution genomes,
representing an undescribed function within the Parcu-
bacteria that highlights a potential role for these
microorganisms in denitrification. Similarly, recent
studies implicated other members of the Parcubacteria
in the nitrogen cycle [12, 13], although these functions
were within novel genomes rather than widespread
throughout the superphyla. Despite being previously
undescribed in this lineage, nirK not only appears to be
widely dispersed throughout the Parcubacteria and po-
tentially other CPR groups [6, 9] but also appears to be
distinct from previously described sequences, suggesting
that these genes were not obtained by recent horizontal
gene transfer (Fig. 5). Even though the CPR nirK
sequences form a distinct clade, all of the residues ne-
cessary for function were conserved. Furthermore, there
was a potential path for electrons to be transferred from
a putative NDH or NDH-like protein to a cupredoxin by
an unknown means and lastly to the reductase. Overall,
these results suggest that some members of the CPR
may play a previously unknown role in nitrogen cycling
in the subsurface, reducing nitrite to NO, which is
subsequently available for utilization by microorganisms
with additional reductive machinery.
Given that these organisms lack typical energy conser-
vation mechanisms, the potential role of nirK within
these cells is questionable. This gene may function as
means of protection against nitrite in the environment,
given its toxicity [53]. At the single organism level, this
mechanism is unlikely, given that nitric oxide, the result
of nirK activity, is potentially more inhibitory than ni-
trite [54]. Such a role for nirK could be more viable in a
Danczak et al. Microbiome (2017) 5:112 Page 11 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
symbiotic relationship between the Parcubacteria and a
microorganism capable of reducing nitric oxide, thus re-
moving it from the local environment. Such relation-
ships have previously been hypothesized as a mechanism
for members of the CPR to obtain critical amino acids
and other nutrients that are not encoded within their ge-
nomes [5, 9]. An alternative role for nirK is for capturing
energy through the formation of a proton motive force
(PMF) [55]. This function may be possible for the two
Kaiserbacteria in this study given that they encode a pu-
tative proton-translocating hydrogenase and an ATP
synthetase. However, the two Harrisonbacteria described
here lack a proton-translocation mechanism and there-
fore could not generate a PMF from nitrite reduction. In
summary, while some members might be able to capture
energy by nitrite reduction, this is clearly not a universal
function in the superphylum and cannot explain the broad
distribution of the nirK gene. However, these results do
highlight a potentially novel function within a metabolic-
ally limited lineage.
Conclusions
The Candidate Phyla Radiation (CPR) represents a re-
cently discovered portion of the tree of life that spans a
large phylogenetic space that until a few years ago was
known only from marker gene studies [2, 3]. As the radi-
ation grows and we are able to populate phyla with rep-
resentative genomes, we increase our understanding of
different functional roles for CPR microorganisms and
the environmental niches that they might occupy. Such
efforts are critical for accurately placing CPR member
functions in subsurface biogeochemical networks. While
Wrighton et al. [14] linked the fermentative activity of
CPR bacteria with respiratory microorganisms, the ana-
lyses presented here (coupled with increased future sam-
pling of CPR genomes) enable us to better determine
how the CPR members interface with their local envir-
onment through the processing of varied complex car-
bon substrates. Additionally, recent studies (including
this one) have suggested new roles for these microorgan-
isms in nitrogen cycling in the environment. Through
further sampling of subsurface systems, the opportunity
exists to both improve the phylogeny of the CPR and
better understand their biogeochemical role in complex
environmental systems. The seeming ubiquity of these
microorganisms suggests that they are key members of
communities that drive critical elemental cycles across
the globe in a variety of subsurface environments.
Additional files
Additional file 1: Table S1. Table of field measurements taken during well
sampling. Table S2. Summary of metagenomic data. Table S3.Summaryof
bins derived from metagenomic data. Table S4. Glycoside hydrolase (GH)
profiles for over 2000 CPR genomes. Values indicate the number of positive
hits per GH family found within a given genome during H MM alignment.
Table S5. Legend used to identify the genes in Figure 4. (XLSX 934 kb)
Additional file 2: Trimmed MUSCLE alignment of rps3 sequences.
(AFA 911 kb)
Additional file 3: Trimmed SINA alignment of 16S rRNA sequences.
(AFA 841 kb)
Additional file 4: Trimmed MUSCLE alignment of gyrA sequences.
(AFA 1832 kb)
Additional file 5: Trimmed MUSCLE alignment of 16 concatenated
ribosomal proteins. (AFA 1964 kb)
Additional file 6: Figure S1-S11. (PDF 20.0 mb)
Additional file 7: Maximum-likelihood rps3 tree file. (TREE 419 kb)
Additional file 8: Maximum-likelihood 16S rRNA tree file. (TREE 102 kb)
Additional file 9: Maximum-likelihood gyrA tree file. (TREE 294 kb)
Additional file 10: Maximum-likelihood concatenated ribosomal tree
file. (TREE 72 kb)
Additional file 11: Trimmed MUSCLE alignment of nirK sequences
(AFA 32 kb)
Additional file 12: Maximum-likelihood nirK tree file. (TREE 8 kb)
Acknowledgements
Not applicable.
Funding
This work was supported through a grant to MJW from the Ohio Water
Development Authority (#7171).
Availability of data and materials
The data generated during the 16S rRNA gene sequencings can be accessed
at NCBI using accession number SRX2896383. The 71 genomes and
annotated protein sequences are publicly hosted at http://chimera.asc.ohio-
state.edu/Danczak_Genomes_and_Protein_sequences.html.
Authorscontributions
RD helped design the project, carried out 16S rRNA amplicon and bioinformatic
analysis, assisted in collecting samples, and drafted the manuscript. MJ, CK, and
MS assisted in sample collection and coordination. KW assisted in data analysis
and helped draft the manuscript. MW conceived and helped design the study,
aided in data analysis, and helped draft the manuscript. All authors have read
and approved the final manuscript.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
PublishersNote
Springer Nature remains neutral with regard to jurisdictional claims in
published maps and institutional affiliations.
Author details
1
Department of Microbiology, The Ohio State University, Columbus, OH, USA.
2
School of Earth Sciences, The Ohio State University, Columbus, OH, USA.
3
Ohio Environmental Protection Agency, Columbus, OH, USA.
Danczak et al. Microbiome (2017) 5:112 Page 12 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Received: 11 July 2017 Accepted: 23 August 2017
References
1. Wrighton KC, Thomas BC, Sharon I, Miller CS, Castelle CJ, VerBerkmoes NC,
et al. Fermentation, hydrogen, and sulfur metabolism in multiple
uncultivated bacterial phyla. Science (80-. ). 2012;337:16611665.
2. Hugenholtz P, Pitulle C, Hershberger KL, Pace NR. Novel division level
bacterial diversity in a Yellowstone hot spring novel division level bacterial
diversity in a Yellowstone hot spring. J Bacteriol. 1998;180:36676.
3. Harris JK, Kelley ST, Pace NR. New perspective on uncultured bacterial
phylogenetic division OP11. Appl Environ Microbiol. 2004;70:8459.
Available from: http://www.ncbi.nlm.nih.gov/pubmed/14766563
4. Brown CT, Hug LA, Thomas BC, Sharon I, Castelle CJ, Singh A, et al. Unusual
biology across a group comprising more than 15% of domain bacteria.
Nature. 2015;523:20811. Available from: http://www.nature.com/doifinder/
10.1038/nature14486
5. Kantor RS, Wrighton KC, Handley KM, Sharon I, Hug LA, Castelle CJ, et al.
Small genomes and sparse metabolisms of sediment-associated bacteria
from four candidate phyla. MBio. 2013;4:111.
6. Anantharaman K, Brown CT, Hug LA, Sharon I, Castelle CJ, Probst AJ, et al.
Thousands of microbial genomes shed light on interconnected
biogeochemical processes in an aquifer system. Nat Commun. 2016;7:13219.
Available from: http://www.nature.com/doifinder/10.1038/ncomms13219
7. Hug LA, Baker BJ, Anantharaman K, Brown CT, Probst AJ, Castelle CJ, et al. A
new view of the tree of life. Nat Microbiol. 2016;1:16048. Available from:
http://www.nature.com/articles/nmicrobiol201648
8. Hu P, Tom L, Singh A, Thomas BC, Baker BJ, Piceno YM, et al. Genome-
resolved metagenomic analysis reveals roles for candidate phyla and other
microbial community members in biogeochemical transformations in oil
reservoirs. MBio. 2016;7:e0166915. Available from: http://mbio.asm.org/
lookup/doi/10.1128/mBio.01669-15
9. Nelson WC, Stegen JC. The reduced genomes of Parcubacteria (OD1)
contain signatures of a symbiotic lifestyle. Front Microbiol. 2015;6:114.
Available from: http://journal.frontiersin.org/Article/10.3389/fmicb.2015.
00713/abstract
10. Probst AJ, Castelle CJ, Singh A, Brown CT, Anantharaman K, Sharon I, et al.
Genomic resolution of a cold subsurface aquifer community provides
metabolic insights for novel microbes adapted to high CO
2
concentrations.
Environ Microbiol. 2017;19:45974.
11. Gong J, Qing Y, Guo X, Warren A. Candidatus Sonnebornia yantaiensis,a
member of candidate division OD1, as intracellular bacteria of the ciliated
protist Paramecium bursaria (Ciliophora, Oligohymenophorea). Syst Appl
Microbiol. 2014;37:3541. Available from: https://doi.org/10.1016/j.syapm.
2013.08.007
12. León-Zayas R, Peoples L, Biddle JF, Podell S, Novotny M, Cameron J, et al.
The metabolic potential of the single cell genomes obtained from the
Challenger Deep, Mariana Trench within the candidate superphylum
Parcubacteria (OD1). Environ. Microbiol. 2017; Available from: http://doi.
wiley.com/10.1111/1462-2920.13789
13. Castelle CJ, Brown CT, Thomas BC, Williams KH, Banfield JF. Unusual
respiratory capacity and nitrogen metabolism in a Parcubacterium (OD1) of
the Candidate Phyla Radiation. Sci Rep. 2017;7:40101. Available from: http://
www.nature.com/articles/srep40101
14. Wrighton KC, Castelle CJ, Wilkins MJ, Hug LA, Sharon I, Thomas BC, et al.
Metabolic interdependencies between phylogenetically novel fermenters
and respiratory organisms in an unconfined aquifer. ISME J. 2014;8:145263.
Available from: http://www.nature.com/doifinder/10.1038/ismej.2013.249
15. Solden L, Lloyd K, Wrighton K. The bright side of microbial dark matter: lessons
learned from the uncultivated majority. Curr Opin Microbiol. 2016;31:21726.
Available from: http://linkinghub.elsevier.com/retrieve/pii/S1369527416300558
16. Luef B, Frischkorn KR, Wrighton KC, Holman H-YN, Birarda G, Thomas BC, et al.
Diverse uncultivated ultra-small bacterial cells in groundwater. Nat Commun.
2015;6:6372. Available from: http://www.ncbi.nlm.nih.gov/pubmed/25721682
17. Joshi N, Fass J. Sickle: a sliding-window, adaptive, quality-based trimming
tool for FastQ files. 2011. Available from: https://github.com/najoshi/sickle
18. Peng YY, Leung HCM, Yiu SM, Chin FYL. IDBA-UD: a de novo assembler for
single-cell and metagenomic sequencing data with highly uneven depth.
Bioinformatics. 2012;28:14208. Available from: https://academic.oup.com/
bioinformatics/article-lookup/doi/10.1093/bioinformatics/bts174
19. R Development Core Team. R: a language and environment for statistical
computing team RDC, editor. R Found. Stat. Comput. R Foundation for
Statistical Computing; 2011. p. 409. Available from: http://www.r-project.org
20. Wickham H. ggplot2: elegant graphics for data analysis Springer-Verlag New
York; 2009. Available from: http://ggplot2.org
21. Bray RJ, Curtis JT. An ordination of the upland forest communities of
southern Winsconin. Ecol Monogr. 1957;27:32549.
22. Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D, et al.
vegan: Community Ecology Package. R package version 2.42. 2017.
Available from: https://cran.r-project.org/web/packages/vegan/index.html
23. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat
Methods. 2012;9:3579. Available from: http://www.nature.com/doifinder/10.
1038/nmeth.1923
24. Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately
reconstructing single genomes from complex microbial communities. PeerJ.
2015;3:e1165. Available from: https://peerj.com/articles/1165
25. Wu M, Scott AJ. Phylogenomic analysis of bacterial and archaeal sequences
with AMPHORA2. Bioinformatics. 2012;28:10334. Available from: https://
academic.oup.com/bioinformatics/article-lookup/doi/10.1093/
bioinformatics/bts079
26. Hyatt D, Chen G-L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal:
prokaryotic gene recognition and translation initiation site identification.
BMC Bioinformatics. 2010;11:119. Available from: http://www.biomedcentral.
com/1471-2105/11/119
27. Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes.
Nucleic Acids Res. 2000;28:2730. Available from: https://academic.oup.com/
nar/article-lookup/doi/10.1093/nar/28.1.27
28. Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a
reference resource for gene and protein annotation. Nucleic Acids Res.
2016;44:D45762. Available from: https://academic.oup.com/nar/article-
lookup/doi/10.1093/nar/gkv1070
29. Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new
perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res.
2017;45:D35361. Available from: https://academic.oup.com/nar/article-
lookup/doi/10.1093/nar/gkw1092
30. Suzek BE, Wang Y, Huang H, McGarvey PB, Wu CH. UniRef clusters: a
comprehensive and scalable alternative for improving sequence
similarity searches. Bioinformatics. 2015;31:92632. Available from:
https://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/
bioinformatics/btu739
31. Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, et al.
InterProScan: protein domains identifier. Nucleic Acids Res. 2005;33:W116
20. Available from: https://academic.oup.com/nar/article-lookup/doi/10.
1093/nar/gki442
32. Edgar RC. Search and clustering orders of magnitude faster than BLAST.
Bioinformatics. 2010;26:24601. Available from: https://academic.oup.com/
bioinformatics/article-lookup/doi/10.1093/bioinformatics/btq461
33. Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic
genome annotation and pathway reconstruction server. Nucleic Acids Res.
2007;35:W1825. Available from: https://academic.oup.com/nar/article-
lookup/doi/10.1093/nar/gkm321
34. Eddy S, Wheeler T. HMMER 3.1 2013. Available from: http://hmmer.org/
35. Yin Y, Mao X, Yang J, Chen X, Mao F, Xu Y. dbCAN: a web resource for
automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2012;
40:W44551. Available from: https://academic.oup.com/nar/article-lookup/
doi/10.1093/nar/gks479
36. Lombard V, Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B. The
carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res.
2014;42:D4905. Available from: https://academic.oup.com/nar/article-
lookup/doi/10.1093/nar/gkt1178
37. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and
high throughput. Nucleic Acids Res. 2004;32:17927. Available from: https://
academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkh340
38. Li Y, Hodak M, Bernholc J. Enzymatic mechanism of copper-containing
nitrite reductase. Biochemistry. 2015;54:123342. Available from: http://pubs.
acs.org/doi/abs/10.1021/bi5007767
39. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al.
Geneious Basic: an integrated and extendable desktop software platform for
the organization and analysis of sequence data. Bioinformatics. 2012;28:
16479. Available from: https://academic.oup.com/bioinformatics/article-
lookup/doi/10.1093/bioinformatics/bts199
Danczak et al. Microbiome (2017) 5:112 Page 13 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
40. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-
analysis of large phylogenies. Bioinformatics. 2014;30:13123. Available from:
https://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/
bioinformatics/btu033
41. Yu G, Smith D, Zhu H, Guan Y, Lam TT-Y. ggtree: an R package for
visualization and annotation of phylogenetic trees with their covariates and
other associated data. Methods Ecol Evol. 2017;8:2836. Available from:
http://onlinelibrary.wiley.com/doi/10.1111/2041-210X.12628/abstract
42. Nawrocki EP. Structural RNA homology search and alignment using covariance
models. Ph.D. thesis, Washington University in Saint Louis, School of Medicine.
2009. Available from: http://eddylab.org/software/ssu-align/
43. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA
ribosomal RNA gene database project: improved data processing and web-
based tools. Nucleic Acids Res. 2013;41:D5906. Available from: https://
academic.oup.com/nar/article-lookup/doi/10.1093/nar/gks1219
44. Pruesse E, Peplies J, Glöckner FO. SINA: accurate high-throughput multiple
sequence alignment of ribosomal RNA genes. Bioinformatics. 2012;28:1823
9. Available from: https://academic.oup.com/bioinformatics/article-lookup/
doi/10.1093/bioinformatics/bts252
45. Haft DH, Selengut JD, White O. The TIGRFAMs database of protein families.
Nucleic Acids Res. 2003;31:3713. Available from: http://www.ncbi.nlm.nih.
gov/pmc/articles/PMC165575/
46. Yarza P, Yilmaz P, Pruesse E, Glöckner FO, Ludwig W, Schleifer K-H, et al.
Uniting the classification of cultured and uncultured bacteria and archaea
using 16S rRNA gene sequences. Nat Rev Microbiol. 2014;12:63545.
Available from: http://www.nature.com/doifinder/10.1038/nrmicro3330
47. Hira D, Toh H, Migita CT, Okubo H, Nishiyama T, Hattori M, et al. Anammox
organism KSU-1 expresses a NirK-type copper-containing nitrite reductase
instead of a NirS-type with cytochrome cd 1. FEBS Lett. 2012;586:165863.
Available from: https://doi.org/10.1016/j.febslet.2012.04.041
48. Adhikari UK, Rahman MM. Comparative analysis of amino acid composition
in the active site of nirk gene encoding copper-containing nitrite reductase
(CuNiR) in bacterial spp. Comput Biol Chem. 2017;67:10213. Available from:
https://doi.org/10.1016/j.compbiolchem.2016.12.011
49. Peters JW, Schut GJ, Boyd ES, Mulder DW, Shepard EM, Broderick JB, et al.
[FeFe]- and [NiFe]-hydrogenase diversity, mechanism, and maturation.
Biochim. Biophys. Acta - Mol. Cell Res. 2015;1853:135069. Available from:
https://doi.org/10.1016/j.bbamcr.2014.11.021
50. Wrighton KC, Castelle CJ, Varaljay VA, Satagopan S, Brown CT, Wilkins MJ, et
al. RubisCO of a nucleoside pathway known from Archaea is found in
diverse uncultivated phyla in bacteria. ISME J. 2016;10:270214. Available
from: http://www.nature.com/doifinder/10.1038/ismej.2016.53
51. Danczak RE, Yabusaki SB, Williams KH, Fang Y, Hobson C, Wilkins MJ.
Snowmelt induced hydrologic perturbations drive dynamic microbiological
and geochemical behaviors across a shallow riparian aquifer. Front Earth Sci.
2016;4 Available from: http://journal.frontiersin.org/Article/10.3389/feart.2016.
00057/abstract
52. Danczak RE, Sawyer AH, Williams KH, Stegen JC, Hobson C, Wilkins MJ.
Seasonal hyporheic dynamics control coupled microbiology and
geochemistry in Colorado River sediments. J Geophys Res Biogeosci. 2016;
121:297687. Available from: http://doi.wiley.com/10.1002/2016JG003527
53. Bollag J-M, Henninger NM. Effects of nitrite toxicity on soil bacteria under
aerobic and anaerobic conditions. Soil Biol Biochem. 1978;10:37781.
Available from: http://linkinghub.elsevier.com/retrieve/pii/0038071778900615
54. Basaglia M, Toffanin A, Baldan E, Bottegal M, Shapleigh JP, Casella S.
Selenite-reducing capacity of the copper-containing nitrite reductase of
Rhizobium sullae. FEMS Microbiol Lett. 2007;269:12430.
55. Richardson DJ, Cole JA. Respiration of nitrate and nitrite. EcoSal Plus. 2008;3
Available from: http://www.asmscience.org/content/journal/ecosalplus/10.
1128/ecosal.3.2.5
We accept pre-submission inquiries
Our selector tool helps you to find the most relevant journal
We provide round the clock customer support
Convenient online submission
Thorough peer review
Inclusion in PubMed and all major indexing services
Maximum visibility for your research
Submit your manuscript at
www.biomedcentral.com/submit
Submit your next manuscript to BioMed Central
and we will help you at every step:
Danczak et al. Microbiome (2017) 5:112 Page 14 of 14
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com

Supplementary resources (12)

... We speculate that its simple metabolic pathways and low-energy attachment mechanism facilitate adaptation to the optimal co-culture system; this also explains the increase in the relative abundance of Patescibacteria over the 3 d co-culture period. Similarly, many studies have shown that the relative abundance of ultra-small Patescibacteria lineages frequently exceed those of other bacteria in diversity surveys [63][64][65][66][67]. Proteobacteria was the second largest phylum of hydrogenogenic CO oxidizers [68], and the decrease in Proteobacteria after 3 days allowed the scallops and seaweed to maintain a good physiological state in the optimal co-culture system. ...
Article
Full-text available
Seaweeds are critically important for the maintenance of biodiversity in marine aquaculture ecosystems, as they can inhibit the growth of Vibrio. Here, we determined the optimal environmental parameters for co-culturing green macroalgae (Ulva pertusa) and red macroalgae (Gracilariopsis lemaneiformis) with Chinese scallop (Chlamys farreri) by measuring dissolved oxygen (DO), pH, and the strength of Vibrio inhibition under laboratory conditions and validating the effectiveness of this optimal co-culture system from the perspectives of nutrient levels, enzyme activities, and microbial diversity. The results show that co-culturing 30 g of seaweed and three scallops in 6 L of seawater with aeration in the dark (1.25 L min⁻¹, 12:12 h L:D) significantly decreased the number and abundance of Vibrio after 3 days. The activities of superoxide dismutase, catalase, pyruvate kinase, and lactate dehydrogenase in C. farreri were significantly higher, indicating that its immune defense and metabolism enhanced in this optimal co-culture system. High DO and pH levels significantly decreased the alpha diversity of microorganisms, and the abundance of pathogenic microorganisms decreased. The optimal co-culture system was effective for the control of vibriosis. Generally, our findings suggest that seaweeds could be used to enhance the aquaculture environment by conferring healthy and sustainable functions in the future.
Article
The functional roles of bacterial symbionts associated with microalgae remain understudied despite the importance of microalgae in biotechnology and environmental microbiology. 16S rRNA gene sequencing was conducted to analyze bacterial communities associated with two microalgae optimized for growth with flue gas containing 5%–10% CO 2 . Two dominant bacteria with no taxonomic classification beyond the class level (Paceibacteria) were discovered repeatedly in the most productive algal cultures. Long-read metagenomic sequencing was conducted to yield high-quality metagenomes, from which two novel species were discovered under the Seqcode (seqco.de/r:ywe1blo2), Phycocordibacter aenigmaticus gen. nov. sp. nov. and Minusculum obligatum gen. nov. sp. nov. The genus Phycocordibacter gen. nov. was proposed as the nomenclatural type of the family Phycocordibacteraceae fam. nov. and the order Phycocordibacterales ord. nov. Both bacteria possessed features typical of Patescibacteria such as reduced genomes (<800 kbp), lack of complete glycolysis and tricarboxylic acid (TCA) cycle pathways, and inability to synthesize amino acids. Instead, they rely on the reductive pentose phosphate pathway (Calvin cycle) for essential biosynthesis and redox balance. P. aenigmaticus may also rely on elemental sulfur oxidation ( sdo ), partial nitrite reduction ( nirK ), and sulfur-related amino acid metabolism (SAMe → SAH). Both bacteria were found in high relative abundance in cultures of Tetradesmus obliquus HTB1 (freshwater) and Nannochloropsis oceanica IMET1 (marine), suggesting a tight association with microalgae in various environments. The absence of full metabolic pathways for energy production suggests extreme metabolic limitations and obligate symbiosis, most likely with other bacteria associated with the microalgae. IMPORTANCE To our knowledge, this is the first report of Patescibacteria as dominant bacteria associated with microalgae or within a biologically mediated carbon capture system. Two novel Patescibacteria were found in two ecologically distinct microalgal cultures (one freshwater strain and one marine) regardless of whether the cultures were bubbled with air, 5% CO 2 , or 10% CO 2 . This unexpected and unprecedented dominance led to long-read sequencing and the assembly of high-quality metagenomes for both Patescibacteria, as well as five other bacteria in the system. The discovery of two novel species belonging to two novel genera, one novel family, and one novel order has enabled us to fill in gaps of a major, uncharacterized branch within the bacterial tree of life. Additionally, the extreme gene loss found in both Patescibacteria, Phycocordibacter aenigmaticus and Minusculum obligatum , contributes knowledge to a rapidly advancing body of research on the scavenging metabolic nature of this enigmatic and largely unclassified phylum.
Preprint
Full-text available
Saccharibacteria are episymbionts that require host-bacteria to grow. They are positively associated with inflammatory diseases within the human microbiome, yet their mechanisms for interacting with the human host and contributing to diseases remain unknown. This study investigated interactions between a Saccharibacterium ( Nanosynbacter lyticus ), its host-bacteria ( Schaalia odontolytica ), and oral epithelial cells. The host-bacteria induced proinflammatory cytokines in epithelial cells, while Saccharibacteria were immune silent. Remarkably, Saccharibacteria dampened cytokine responses to host-bacteria during coinfection. This effect was driven by Saccharibacteria-induced clustering of TLR2 receptors, a process likely facilitated by type IV, ultimately leading to reduced TLR2-mediated cytokine signalling. High resolution imaging showed that Saccharibacteria were endocytosed by oral epithelial cells, and colocalized with endosome markers, eventually trafficking to lysosomes. Moreover, a subset of the Saccharibacteria survive endocytosis long-term, and retains their capability to reinfect host-bacteria, highlighting a mechanism for persistence in the oral microbiome and a vital role in mammalian immune system modulation.
Preprint
Full-text available
Groundwater ecosystems harbor diverse microbial communities adapted to energy-limited, light-deprived conditions, yet the role of viruses in these environments remains poorly understood. Here, we analyzed 1.26 terabases of metagenomic and metatranscriptomic data from seven wells in the Hainich Critical Zone Exploratory (CZE) to characterize groundwater viromes. We identified 257,252 viral operational taxonomic units (vOTUs) (>=5 kb), with 99% classified as novel, highlighting extensive uncharacterized viral diversity. Viruses exhibited a distinct host range, primarily targeting Proteobacteria, Candidate Phyla Radiation (CPR) bacteria, and DPANN archaea. Notably, CPR lineages displayed low virus-host ratios and viral CRISPR targeting multiple hosts, suggesting a virus decoy mechanism where they may absorb viral pressure, protecting bacteria hosts. Additionally, 3,378 vOTUs encoded auxiliary metabolic genes (AMGs) linked to carbon, nitrogen, and sulfur cycling, with viruses targeting 31.5% of host metabolic modules. These findings demonstrate viruses influence on microbial metabolic reprogramming and nutrient cycling in groundwater, shaping subsurface biogeochemistry.
Preprint
Full-text available
Background : The Candidate Phyla Radiation (CPR) comprises a widespread but poorly understood group of bacteria with limited cultured representatives, largely due to their metabolic dependencies on microbial hosts. In laboratory incubations, CPR often decline sharply in relative abundance, even when samples originate from natural environments where they dominate, such as groundwater, where they can represent over 50% of the microbiome. Suitable enrichment conditions and host interactions remain poorly defined. Results : Here, we analyzed 16S rRNA gene amplicon data from 397 groundwater incubation samples across 31 treatments, including 22 under oxic conditions, to identify factors that promote CPR survival and growth. Despite an initial decline, CPR abundances recovered over longer incubation times, reaching up to 11-30% of the microbial community. In total, we detected 1,410 CPR amplicon sequence variants (ASVs), spanning six major CPR classes commonly found in groundwater. Enrichment success was treatment-specific: Cand. Saccharimonadia dominated in incubations with polysaccharides (up to 31.4%), while Cand. Parcubacteria were enriched (>23%) in treatments stimulating methylotrophs and autotrophs. ASV-specific growth rates based on quantitative PCR showed that some CPR doubled within 1-2 days, comparable to faster-growing non-CPR groundwater bacteria, while most CPR had doubling times around 15 days. Strikingly, CPR reached higher absolute abundances under oxic conditions than under anoxic conditions. Metabolic network analysis based on metagenome-assembled genomes revealed that up to 62% of annotated genes were associated with functions linked to oxic conditions. In fact, 25 CPR genomes encoded enzymes that directly utilize oxygen, challenging the long-standing view of CPR as strictly anaerobic, fermentative organisms. Conclusions : Our findings demonstrate that diverse CPR lineages not only survive but actively grow in groundwater incubations, even under oxic conditions. The discovery of genes for oxygen-dependent reactions and substantial CPR enrichment in oxic treatments reveals unexpected metabolic flexibility, helping to explain their persistence and ecological success across a wide range of environments.
Article
Full-text available
Recent genomic surveys have uncovered candidate phyla radiation (CPR) bacteria and DPANN archaea as major microbial dark matter lineages in various anoxic habitats. Despite their extraordinary diversity, the biogeographic patterns and ecological implications of these ultra-small and putatively symbiotic microorganisms have remained elusive. Here, we performed metagenomic sequencing on 90 geochemically diverse acid mine drainage sediments sampled across southeast China and recovered 282 CPR and 189 DPANN nonredundant metagenome-assembled genomes, which collectively account for up to 28.6% and 31.2% of the indigenous prokaryotic communities, respectively. We found that, remarkably, geographic distance represents the primary factor driving the large-scale ecological distribution of both CPR and DPANN organisms, followed by pH and Fe. Although both groups might be capable of iron reduction through a flavin-based extracellular electron transfer mechanism, significant differences are found in their metabolic capabilities (with complex carbon degradation and chitin degradation being more prevalent in CPR whereas fermentation and acetate production being enriched in DPANN), indicating potential niche differentiation. Predicted hosts are mainly Acidobacteriota, Bacteroidota, and Proteobacteria for CPR and Thermoplasmatota for DPANN, and extensive, unbalanced metabolic exchanges between these symbionts and putative hosts are displayed. Together, our results provide initial insights into the complex interplays between the two lineages and their physicochemical environments and host populations at a large geographic scale. IMPORTANCE Candidate phyla radiation (CPR) bacteria and DPANN archaea constitute a significant fraction of Earth’s prokaryotic diversity. Despite their ubiquity and abundance, especially in anoxic habitats, we know little about the community patterns and ecological drivers of these ultra-small, putatively episymbiotic microorganisms across geographic ranges. This study is facilitated by a large collection of CPR and DPANN metagenome-assembled genomes recovered from the metagenomes of 90 sediments sampled from geochemically diverse acid mine drainage (AMD) environments across southeast China. Our comprehensive analyses have allowed first insights into the biogeographic patterns and functional differentiation of these major enigmatic prokaryotic groups in the AMD model system.
Article
“Candidate Phyla Radiation” (CPR) bacteria, representing ~15 % of bacterial diversity and over 70 phyla, are extremely small bacteria that primarily survive in parasitic or symbiotic forms. CPR bacteria, including Candidatus Brownbacteria, Candidatus Hugbacteria, and Candidatus Saccharibacteria (formerly TM7), were first identified in humans in 2007. They are linked to the microbiota of healthy and diseased individuals, being present in the oral cavity, gastrointestinal, and reproductive tracts. CPR bacteria, such as Saccharibacteria, are associated with dysbiotic conditions like periodontitis and can act as pathogens and potential protectors against inflammatory damage caused by host-associated bacteria.This study aimed to assess the effect of a placebo on gut Saccharibacteria in healthy Armenian women and those with Familial Mediterranean Fever (FMF) disease, a condition with high prevalence in Armenia and often associated with oral microbiota disturbances. Stool samples were analyzed using a culture-independent, high-density DNA microarray method, and statistical analyses were performed with Multibase 2015 Excel Add-in program (NumericalDynamics, Tokyo, Japan). Results indicate that Saccharibacteria respond variably to placebo depending on health status, with some showing significant quantitative or qualitative changes while others remained unchanged. In conclusion, this study confirms the presence of CPR bacteria in the gut microbiota of both healthy women and those with FMF. The distinct responses of intestinal CPR bacteria to placebo highlight the importance of placebo-controlled trials in microbiota research. Furthermore, the findings emphasize the potential role of Saccharibacteria in gut-brain processes and their implications in health and disease.
Article
Background: For many years, scientists have accepted Darwin's conclusion that "Survival of the Fittest" involves successful competition with other organisms for life-endowing molecules and conditions. Summary: Newly discovered "partial" organisms with minimal genomes that require symbiotic or parasitic relationships for growth and reproduction suggest that cooperation in addition to competition was and still is a primary driving force for survival. These two phenomena are not mutually exclusive, and both can confer a competitive advantage for survival. In fact, cooperation may have been more important in the early evolution for life on Earth before autonomous organisms developed, becoming large genome organisms. Key messages: This suggestion has tremendous consequences with respect to our conception of the early evolution of life on Earth as well as the appearance of intercellular interactions, multicellularity and the nature of interactions between humans and their societies (e.g., Social Darwinism).
Article
Full-text available
Background Candidate Phyla Radiation (CPR) is a large monophyletic group encompassing about 25% of bacterial diversity. Among CPR, “Candidatus Saccharibacteria” is one of the most clinically relevant phyla. Indeed, it is enriched in the oral microbiota of subjects suffering from immune-mediated disorders and it has been found to have immunomodulatory activities. For these reasons, it is crucial to have reliable methods to detect and quantify this bacterial lineage in human samples, including saliva. Methods and results Four qPCR protocols for quantifying “Ca. Saccharibacteria” (one targeting the 23S rRNA gene and three the 16S) were tested and compared. The efficiency and coverage of these four protocols were evaluated in silico on large genomic datasets, and in vitro on salivary DNA samples, already characterized by amplicon sequencing on the V3-V4 regions of the 16S rRNA. In silico PCR analyses showed that all qPCR primers lose part of the “Ca. Saccharibacteria” genetic variability, even if the 23S qPCR primers matched more lineages than the 16S qPCR primers. In vitro qPCR experiments confirmed that all 16S-based protocols strongly underestimated “Ca. Saccharibacteria” in salivary DNA, while the 23S qPCR protocol gave quantifications more comparable to 16S amplicon sequencing. Conclusion Overall, our results show that the 23S-based qPCR protocol is more precise than the 16S-based ones in quantifying “Ca. Saccharibacteria”, although all protocols probably underestimate specific lineages. These results underline the current limits in quantifying “Ca. Saccharibacteria”, highlighting the needs for novel experimental strategies or methods. Indeed, the underestimation of “Ca. Saccharibacteria” in clinical samples could hide its role in human health and in the development of immune-mediated diseases.
Article
Full-text available
Candidate phyla (CP) are broad phylogenetic clusters of organisms that lack cultured representatives. Included in this fraction is the candidate Parcubacteria superphylum. Specific characteristics that have been ascribed to the Parcubacteria include reduced genome size, limited metabolic potential, and exclusive reliance on fermentation for energy acquisition. The study of new environmental niches, such as the marine versus terrestrial subsurface, often expands the understanding of the genetic potential of taxonomic groups. For this reason we analyzed twelve Parcubacteria single amplified genomes (SAGs) from sediment samples collected within the Challenger Deep of the Mariana Trench, obtained during the Deepsea Challenge (DSC) Expedition. Many of these SAGs are closely related to environmental sequences obtained from deep-sea environments based on 16S rRNA gene similarity and BLAST matches to predicted proteins. DSC SAGs encode features not previously identified in Parcubacteria obtained from other habitats. These include adaptation to oxidative stress, polysaccharide modification, and genes associated with respiratory nitrate reduction. The DSC SAGs are also distinguished by relative greater abundance of genes for nucleotide and amino acid biosynthesis, repair of alkylated DNA and the synthesis of mechanosensitive ion channels. These results present an expanded view of the Parcubacteria, among members residing in an ultra-deep hadal environment. This article is protected by copyright. All rights reserved.
Article
Full-text available
The Candidate Phyla Radiation (CPR) is a large group of bacteria, the scale of which approaches that of all other bacteria. CPR organisms are inferred to depend on other community members for many basic cellular building blocks and all appear to be obligate anaerobes. To date, there has been no evidence for any significant respiratory capacity in an organism from this radiation. Here we report a curated draft genome for ‘Candidatus Parcunitrobacter nitroensis’ a member of the Parcubacteria (OD1) superphylum of the CPR. The genome encodes versatile energy pathways, including fermentative and respiratory capacities, nitrogen and fatty acid metabolism, as well as the first complete electron transport chain described for a member of the CPR. The sequences of all of these enzymes are highly divergent from sequences found in other organisms, suggesting that these capacities were not recently acquired from non-CPR organisms. Although the wide respiration-based repertoire points to a different lifestyle compared to other CPR bacteria, we predict similar obligate dependence on other organisms or the microbial community. The results substantially expand the known metabolic potential of CPR bacteria, although sequence comparisons indicate that these capacities are very rare in members of this radiation.
Article
Full-text available
KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an encyclopedia of genes and genomes. Assigning functional meanings to genes and genomes both at the molecular and higher levels is the primary objective of the KEGG database project. Molecular-level functions are stored in the KO (KEGG Orthology) database, where each KO is defined as a functional ortholog of genes and proteins. Higher-level functions are represented by networks of molecular interactions, reactions and relations in the forms of KEGG pathway maps, BRITE hierarchies and KEGG modules. In the past the KO database was developed for the purpose of defining nodes of molecular networks, but now the content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases. The newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined. Furthermore, the DISEASE and DRUG databases have been improved by systematic analysis of drug labels for better integration of diseases and drugs with the KEGG molecular networks. KEGG is moving towards becoming a comprehensive knowledge base for both functional interpretation and practical application of genomic information.
Article
Full-text available
The subterranean world hosts up to one-fifth of all biomass, including microbial communities that drive transformations central to Earth's biogeochemical cycles. However, little is known about how complex microbial communities in such environments are structured, and how inter-organism interactions shape ecosystem function. Here we apply terabase-scale cultivation-independent metagenomics to aquifer sediments and groundwater, and reconstruct 2,540 draft-quality, near-complete and complete strain-resolved genomes that represent the majority of known bacterial phyla as well as 47 newly discovered phylum-level lineages. Metabolic analyses spanning this vast phylogenetic diversity and representing up to 36% of organisms detected in the system are used to document the distribution of pathways in coexisting organisms. Consistent with prior findings indicating metabolic handoffs in simple consortia, we find that few organisms within the community can conduct multiple sequential redox transformations. As environmental conditions change, different assemblages of organisms are selected for, altering linkages among the major biogeochemical cycles.
Article
Full-text available
We present an r package, ggtree , which provides programmable visualization and annotation of phylogenetic trees. ggtree can read more tree file formats than other softwares, including newick , nexus , NHX , phylip and jplace formats, and support visualization of phylo, multiphylo, phylo4, phylo4d, obkdata and phyloseq tree objects defined in other r packages. It can also extract the tree/branch/node‐specific and other data from the analysis outputs of beast , epa , hyphy , paml , phylodog , pplacer , r8s , raxml and revbayes software, and allows using these data to annotate the tree. The package allows colouring and annotation of a tree by numerical/categorical node attributes, manipulating a tree by rotating, collapsing and zooming out clades, highlighting user selected clades or operational taxonomic units and exploration of a large tree by zooming into a selected portion. A two‐dimensional tree can be drawn by scaling the tree width based on an attribute of the nodes. A tree can be annotated with an associated numerical matrix (as a heat map), multiple sequence alignment, subplots or silhouette images. The package ggtree is released under the artistic‐2.0 license . The source code and documents are freely available through bioconductor ( http://www.bioconductor.org/packages/ggtree ).
Article
The nirk gene encoding the copper-containing nitrite reductase (CuNiR), a key catalytic enzyme in the environmental denitrification process that helps to produce nitric oxide from nitrite. The molecular mechanism of denitrification process is definitely complex and in this case a theoretical investigation has been conducted to know the sequence information and amino acid composition of the active site of CuNiR enzyme using various Bioinformatics tools. 10 Fasta formatted sequences were retrieved from the NCBI database and the domain and disordered regions identification and phylogenetic analyses were done on these sequences. The comparative modeling of protein was performed through Modeller 9v14 program and visualized by PyMOL tools. Validated protein models were deposited in the Protein Model Database (PMDB) (PMDB id: PM0080150 to PM0080159). Active sites of nirk encoding CuNiR enzyme were identified by Castp server. The PROCHECK showed significant scores for four protein models in the most favored regions of the Ramachandran plot. Active sites and cavities prediction exhibited that the amino acid, namely Glycine, Alanine, Histidine, Aspartic acid, Glutamic acid, Threonine, and Glutamine were common in four predicted protein models. The present in silico study anticipates that active site analyses result will pave the way for further research on the complex denitrification mechanism of the selected species in the experimental laboratory
Article
Riverbed microbial communities play an oversized role in many watershed ecosystem functions, including the processing of organic carbon, cycling of nitrogen, and alterations to metal mobility. The structure and activity of microbial assemblages depends in part on geochemical conditions set by river-groundwater exchange, or hyporheic exchange. To assess how seasonal changes in river-groundwater mixing affect these populations in a snowmelt-dominated fluvial system, vertical sediment and pore water profiles were sampled at three time points at one location in the hyporheic zone of the Colorado River and analyzed using geochemical measurements, 16S rRNA gene sequencing, and ecological modeling. Oxic river water penetrated deepest into the subsurface during peak river discharge, while under base-flow conditions, anoxic groundwater dominated shallower depths. Over a 70-cm thick interval, riverbed sediments were therefore exposed to seasonally fluctuating redox conditions and hosted microbial populations statistically different from those at both shallower and deeper locations. Additionally, microbial populations within this zone were shown to be the most dynamic across sampling time points, underlining the critical role that hyporheic mixing plays in constraining microbial abundances. Given such mixing effects, we anticipate that future changes in river discharge in mountainous, semi-arid western US watersheds may affect microbial community structure and function in riverbed environments, with potential implications for biogeochemical processes in riparian regions.