ArticlePDF Available

Experimental parameters defining ultra-low biomass bioaerosol analysis

Authors:
  • Berkeley Education Alliances for Research in Singapore

Abstract and Figures

Investigation of the microbial ecology of terrestrial, aquatic and atmospheric ecosystems requires specific sampling and analytical technologies, owing to vastly different biomass densities typically encountered. In particular, the ultra-low biomass nature of air presents an inherent analytical challenge that is confounded by temporal fluctuations in community structure. Our ultra-low biomass pipeline advances the field of bioaerosol research by significantly reducing sampling times from days/weeks/months to minutes/hours, while maintaining the ability to perform species-level identification through direct metagenomic sequencing. The study further addresses all experimental factors contributing to analysis outcome, such as amassment, storage and extraction, as well as factors that impact on nucleic acid analysis. Quantity and quality of nucleic acid extracts from each optimisation step are evaluated using fluorometry, qPCR and sequencing. Both metagenomics and marker gene amplification-based (16S and ITS) sequencing are assessed with regard to their taxonomic resolution and inter-comparability. The pipeline is robust across a wide range of climatic settings, ranging from arctic to desert to tropical environments. Ultimately, the pipeline can be adapted to environmental settings, such as dust and surfaces, which also require ultra-low biomass analytics.
Content may be subject to copyright.
ARTICLE OPEN
Experimental parameters dening ultra-low biomass
bioaerosol analysis
Irvan Luhung
1,3
, Akira Uchida
1,3
, Serene B. Y. Lim
1
, Nicolas E. Gaultier
1
, Carmon Kee
1
, Kenny J. X. Lau
1
, Elena S. Gusareva
1
,
Cassie E. Heinle
1
, Anthony Wong
1
, Balakrishnan N. V. Premkrishnan
1
, Rikky W. Purbojati
1
, Enzo Acerbi
1
, Hie Lim Kim
1
,
Ana C. M. Junqueira
1,2
, Sharon Longford
1
, Sachin R. Lohar
1
, Zhei Hwee Yap
1
, Deepa Panicker
1
, Yanqing Koh
1
, Kavita K. Kushwaha
1
,
Poh Nee Ang
1
, Alexander Putra
1
, Daniela I. Drautz-Moses
1
and Stephan C. Schuster
1
Investigation of the microbial ecology of terrestrial, aquatic and atmospheric ecosystems requires specic sampling and analytical
technologies, owing to vastly different biomass densities typically encountered. In particular, the ultra-low biomass nature of air
presents an inherent analytical challenge that is confounded by temporal uctuations in community structure. Our ultra-low
biomass pipeline advances the eld of bioaerosol research by signicantly reducing sampling times from days/weeks/months to
minutes/hours, while maintaining the ability to perform species-level identication through direct metagenomic sequencing. The
study further addresses all experimental factors contributing to analysis outcome, such as amassment, storage and extraction, as
well as factors that impact on nucleic acid analysis. Quantity and quality of nucleic acid extracts from each optimisation step are
evaluated using uorometry, qPCR and sequencing. Both metagenomics and marker gene amplication-based (16S and ITS)
sequencing are assessed with regard to their taxonomic resolution and inter-comparability. The pipeline is robust across a wide
range of climatic settings, ranging from arctic to desert to tropical environments. Ultimately, the pipeline can be adapted to
environmental settings, such as dust and surfaces, which also require ultra-low biomass analytics.
npj Biofilms and Microbiomes (2021) 7:37 ; https://doi.org/10.1038/s41522-021-00209-4
INTRODUCTION
Great naturalists of centuries-past have catalogued planetary
ecosystems at the macroscopic level, primarily for terrestrial and
aquatic environments, where organisms were most accessible
1,2
.
Microscopic life was subsequently given the same attention, again
initially focusing on terrestrial and aquatic systems
3,4
. Microbial
inhabitants of the third ecosystem of planetary scale, the
atmosphere, proved much more difcult to assess due to
technological challenges in regard to accessibility. These chal-
lenges are largely associated with the low-density gaseous state
and resulting ultra-low biomass of air
57
. As a consequence,
atmospheric research rst described the physicochemical nature
of the atmosphere, thereby generating a comprehensive under-
standing of inanimate components of the troposphere and
stratosphere
8
. The origin of these components of air is typically
categorised as either inorganic gases or volatile organic com-
pounds (VOCs), the latter of which serve as proxies for the
biological activity of organisms
9,10
. The following progression in
the eld involved the identication of airborne organisms via
cultivation and microscopy
11,12
. This provided a foundation for
understanding the composition of airborne microbial organisms
via nucleic acid taxonomic identication. A large increase of the
taxonomic resolution was subsequently achieved by the use of ITS
and 16S rRNA gene markers. The ultra-low biomass nature of air
posed major technical obstacles to using these molecular
techniques, with inherent requirements such as long sampling
duration and high amounts of gene marker amplication
1318
.
The nascent eld of bioaerosol studies was further progressed
by employing metagenomics, which enabled direct nucleic acid
analysis without the biases associated with gene amplication.
However, to overcome issues associated with limited biomass,
long sampling duration times (days to weeks) were unavoidable,
which in turn impeded the temporal resolution and the number of
required samples analysed
1921
.
Advances in temporal and taxonomic resolution only became
possible with the onset of new technologies involving high
volumetric ow rate air samplers coupled with metagenomic data
generated by next-generation sequencing platforms that had low
biomass requirements
22
. This approach, which analyses the accessible
spectrum of airborne community DNA, therefore enables assessment
of the functional complement of airborne microorganisms.
Here, we detail optimisation of multiple stages of an ultra-low
biomass analysis pipeline for air samples, which can also be tailored
to studies of similarly ultra-low biomass environments such as dust
and surfaces. The versatility and robustness of the presented
pipeline enable analysis of a wide range of environmental settings,
both indoor and outdoor, encompassing a wide scope of climatic
settings including tropical, temperate, desert and arctic regions.
RESULTS
Environmental samples: soil, water, air
Ecosystems and habitats are highly variable and complex, and hence
a universal approach is not always applicable. Using DNA
concentration as a proxy, terrestrial, aquatic and atmospheric
ecosystems can harbour up to a six-log difference in microbial
biomass (Fig. 1a). This results in vastly different sampling require-
ments and volumes for molecular analysis (Fig. 1b). In addition,
biomass concentrations might follow cyclic processes resulting in
density uctuations, as shown in marine environments
23
as well as
1
Singapore Centre for Environmental Life Sciences Engineering (SCELSE), Nanyang Technological University, Singapore, Singapore.
2
Present address: Departamento de Genética,
Instituto de Biologia, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-590, Brazil.
3
These authors contributed equally: Irvan Luhung, Akira Uchida.
email: scschuster@ntu.edu.sg
www.nature.com/npjbiolms
Published in partnership with Nanyang Technological University
1234567890():,;
atmospheric environments where higher bioaerosol concentrations
are typically observed at night
22
(Fig. 1c) or during haze events
24
.
To address the challenges in analysing a wide range of biomass
concentrations at different spatial and temporal settings, we
developed a robust ultra-low biomass pipeline, comprising the
four-stages of amassment, storage, extraction and nucleic acid
analysis. Parameters that impact upon the pipelinesefcacy were
investigated, with the aim of enabling customisation (Fig. 1d). The
summarised results are displayed in Fig. 2. The subsequent
sections detail each investigated parameter individually.
Amassment
The ultimate success of sequencing and PCR-based analyses rests
on sufcient quantities of nucleic acids being amassed, which for
air sampling is a trade-off between sampling ow rate and
sampling duration. While this study uses a lter-based sampler,
other types of air samplers, such as liquid impingers serve a similar
function and produce comparable results (Supplementary
Fig. 8)
25
. For our purpose, ideal air samplers should be portable,
battery-powered and have an acceptable noise emission (~50 dB).
The air sampling ow rate and duration were optimised to
improve the temporal resolution of each sample from days, weeks
or months to hours or even minutes, while still maintaining
maximal taxonomic resolution. This was achieved by evaluating
how these two factors directly impact the DNA quantity and
metagenomic prole of the sample. Using 300 L/min ow rate, the
minimal required sampling duration was investigated using
different time-based sampling regimes (Fig. 3a). Sampling
duration was segmented into sequentially doubling time intervals.
For example, the rst and second 15-min intervals (5:005:15 am
and 5:15 to 5:30 am) were individually analysed and compared
to a 30-min sample (5:005:30 am) taken in parallel. This process
was undertaken for 15, 30 and 60-min intervals with a nal
sampling duration up to 180 min. Quantitative analysis showed
consistently increasing DNA yields as a function of sampling
duration (Fig. 2ac). No notable loss of DNA yield was observed
within the tested range of duration (15 min3 h). Within this
range, combining two successive time segments resulted in
similar DNA quantities as a single time segment of the combined
duration, as quantied using Qubit and qPCR (Fig. 3b, Supple-
mentary Fig. 1). Within the three investigated intervals (three
duration groups each for Qubit, bacterial and fungal qPCR), the
differences averaged 25%, with a median of 18%. Importantly, the
microbial taxonomic proles from comparable time intervals were
not affected. This is demonstrated by the shift in relative
abundances of taxa, such as Kocuria palustris, and Leifsonia xyli,
between the two subsequent 15-min samples (Fig. 3c). Averaging
these species compositions from the two subsequent 15-min lter
samples resulted in abundances that mirror that of the 30 min
time interval sample collected in parallel (Fig. 3d). This was
consistent across all sampling duration regimes with three
replicates each (BrayCurtis and Jaccard p> 0.05).
The second experiment examined the impact of the air ow
rate and the total volume of air sampled. With the sampling
duration set at 2 h, airow was varied between 100, 200 and 300 L/
min, resulting in total air volumes of 12, 24 and 36 m
3
,
respectively. The DNA yield and copy number of marker genes
(16S and 18S rRNAs) increased as a function of air volume sampled
(Fig. 2df). However, DNA concentration normalised per air
volume diminished by up to 20% when the ow rate was
increased from 100 to 300 L/min (Supplementary Fig. 2a). The
diminishing return of amassment is likely due to decreasing
particle retention efciency at higher ow rates for extended
periods of time
26
. For the purpose of this study, optimal sampling
efciency is forfeited in favour of higher ow rates (300 L/min)
because the total amount of biomass collected per unit of time
still out-performs the decrease in amassment efciency. This
enables measurements with higher time resolution within a day
for environmental time-series studies. The biological signicance
of this was demonstrated by the discovery of diel dynamics of
outdoor airborne microbial communities
22
.
Amassment Storage Nucleic Acid Analysisd
OceanSoil Air
15
10
5
0
-5
DNA yield (Log10)
(ng/mass equivalent)
a
OceanSoil Air
5
0
-5
-10
Log10 (unit of sample
to yield 5 ng DNA)
b
Extraction
0
50
100
150
200
250
DNA yield (ng)
11 15 19 23
37
c
PBS buffer
+0.1% Triton X-100
Air samples
Anodiscs
0.02 µm DNA extraction
DNA sequencing
Air
sampler
(filter)
Fig. 1 Challenges in air microbiome analysis. a Total DNA yield (ng/mass equivalent) for soil, ocean water and air sample collected from the
same proximity and processed with the same method. bestimated sample volume required to yield 5 ng of DNA. For box plots, the centre
line, bound of box and whiskers represent median, 25th75th percentile and min-to-max values, respectively. cFluctuation of airborne
biomass (ng) at different times of the day. The red dots and error bars are mean and standard deviation among the replicates. dDeveloped
sampling and analysis pipeline for metagenomic analysis of ultra-low biomass environmental samples.
I. Luhung et al.
2
npj Biofilms and Microbiomes (2021) 37 Published in partnership with Nanyang Technological University
1234567890():,;
Further analysis demonstrated that owratedoesnotimpactthe
qualitative and quantitative assessment of metagenomic data
(Supplementary Fig. 2b). The community structure (BrayCurtis, p>
0.05) and richness (Jaccard, p> 0.05) were not signicantly different
for samples collected with different ow rates.
Storage
Analysis of the storage component in this pipeline evaluated the
integrity (biomass quality and composition) of air lter samples
stored under different conditions. The three conditions investigated
were (i) instant processing (Fsh), (ii) 5-day storage at 20 °C (Frz), and
(iii) 5-day storage at room temperature (RT, average 23 °C, RH 65%).
No signicant differences were observed between fresh and
freezer samples in terms of both absolute (Qubit, qPCR) and relative
(metagenomic) abundances. This suggests temporary freezer storage
is a viable alternative to immediate lter processing. However, RT
samples were signicantly different from the fresh and freezer
regimes in regard to DNA quantities (2030% loss) (Fig. 2gi). Also, a
minor decrease of relative abundance of certain taxa was observed
(BrayCurtis, p<0.05) (Fig. 4a); however, there was no loss in the
number of species detected (Jaccard, p> 0.05) (Supplementary Fig.
3). This outcome implies that microbial growth on the lter substrate
is impeded within the course of several days, thus enabling sample
collection for eld surveys without the need for refrigeration
27
.
Filter processing and DNA extraction
As library construction for DNA sequencing requires the removal
of particle/biomass from the air lter substrate (referred to as lter
processing), the protocol was optimised for efcient biomass
retrieval. Importantly, the ultra-low biomass nature of the sample
renders lter processing the most limiting, and hence, the most
critical step across the entire pipeline for maximising yield.
In general, lter samples can be processed in one of two ways,
either direct DNA extraction on the lter, or by rst removing the
biomass from the lter prior to DNA extraction. Direct DNA
extraction was deemed inefcient as the lter absorbs most of the
lysis buffer, which consequently inhibits cell lysis. In contrast, rst
removing the biomass by washing the lter in a buffer (PBS) and
then concentrating on a thinner membrane with smaller mesh-
size (0.2 µm PES or Anodisc membrane)
28
, resulted in signicantly
higher DNA recovery (Fig. 4b).
To further improve biomass recovery, additional steps such as
water-bath sonication (RT, 1 min)
29,30
and the use of detergent
(Triton-X 100) during lter wash were tested. For comparison of
samples processed with and without sonication, no signicant
difference in either quantitative or metagenomic analyses was
found (Fig. 2jl and Supplementary Fig. 4). In contrast, adding
detergent during the lter wash signicantly improved DNA yield
(Fig. 2mo). The hydrophobic nature of the air sampling lter
impeded wetting by the wash buffer. Hence, particles were not
effectively suspended in the wash buffer when mechanically
agitated. The addition of non-ionic detergent, Triton X-100, at
varying concentrations (%v/v) (PBS-T) to the initial PBS buffer was
effective in overcoming this challenge.
The detergent wash resulted in signicant differences in
absolute and relative abundance analyses, especially in the
instance of bacteria. DNA yield, as well as copy number of
bacterial 16S and fungal 18S rRNA genes, increased 2.4, 8.6 and
2.0-fold, respectively (Fig. 2mo). The metagenomic analysis
conrmed this nding (BrayCurtis and Jaccard, p< 0.05, Fig. 4c,
Supplementary Fig. 5). The number of detected bacterial taxa
increased eight-fold compared to a 1.3-fold increase in fungal
taxa. Expectedly, PBS-T treated samples also showed greater
taxonomic diversity (Fig. 4c).
Varying concentrations of Triton X-100 (0.01, 0.1 and 0.5% (v/v))
in PBS were investigated, with no signicant difference between
the three concentrations for quantitative analyses (Fig. 2mo).
However, metagenomic analysis identied notable differences in
microbiome composition (BrayCurtis p< 0.05, Supplementary Fig.
16S rRNA CN(x105) 18S rRNA CN(x108)
0
2
3
1
0
50
100
150
200
DNA yield(ng)
0
2
4
6
100 200 300
Flowrate
(L/min)
0
50
100
150
0
1
2
3
Fsh Frz RT
0
1
2
3
0
50
100
150
200
0
1
2
3
4
0
1
2
3
4
Incubation
at 55˚C
1h 2h ON
0
40
80
120
0
2.5
2.0
1.5
1.0
0.5
0
2
4
6
0
0.01
0.1
0.5
Triton X-100
(%, v/v)
FungiBacteria
*
*
*
*
*
45
a
b
c
d
e
f
g
h
i
j
k
l
0
2
4
6
0
100
200
300
0
3
6
9
Sonication
+-
m
n
o
0
20
40
60
80
100
15 30 60 120
0
0.8
1.6
2.4
0
50
100
150
Sampling duration
(min)
Amassment Storage NA Analysis
WGS
ITS
16S
Fungi
Bacteria
Extraction
p
q
r
s
Storage
n = 3 n = 3 n = 3n = 4 n = 4 n = 4
Fig. 2 Summary of quantitative analysis with DNA yield, 18S copy number (CN) and 16S copy number (CN). acAssessment of air
sampling duration from 15 min to 3 h. dfAssessment of air sampling ow rate from 100 L/min to 300 L/min. giThe integrity of sampled
biomass when processed fresh (Fsh), stored in freezer for 5 d (Frz) or stored at room temperature for 5 d (RT). jlImpact of sonication on DNA
yield. (mo) Impact of detergent addition at different concentrations (0.010.5% v/v) during lter sample wash. prImpact of extended pre-
incubation (1 h to overnight) at 55 °C during DNA extraction. The centre line, bound of box and whiskers represent median, 25th75th
percentile and min-to-max values, respectively. sWhole-genome shotgun (WGS) and amplicon (ITS/16S) sequencing approaches. * denotes
statistical signicance (p< 0.05) tested with MannWhitney tests.
I. Luhung et al.
3
Published in partnership with Nanyang Technological University npj Biofilms and Microbiomes (2021) 37
5ab) driven by an increase in bacterial taxa. Increasing Triton X-
100 beyond 0.1% concentration, yielded no signicant further
gains (Fig. 4d). Hence, Triton X-100 at 0.1% was deemed sufcient
for wetting the lter and releasing attached bioaerosol particles
into the buffer medium. Despite the 0.1% concentration of Triton
X-100 being above the critical micelle concentration
31
, Triton X-
100 did not trigger unwanted premature lysis of microbial cells, as
there were no signicant differences in DNA yield between the
three concentrations. If premature lysis occurred, extracellular
DNA would not have been retained on the subsequent Anodisc
membrane, resulting in lower DNA recovery.
Following lter processing, the recovered biomass was ltered
through a 0.02 µm pore-sized Anodisc membrane (Whatmann,
USA) mounted on a vacuum manifold (Fig. 1d), with the Anodisc
directly tting into the DNA extraction kit bead tube. DNA
extraction used the standard protocol of the extraction kit with
slight modication to improve lysis
26
. In this regard, the addition
of overnight pre-incubation of the samples at 55 °C is recom-
mended as it improves evenness among the samples, especially
for the representation of fungal taxa, as shown by the quantitative
and PERMDISP analysis (Fig. 2pr, Fig. 4e).
Nucleic acid analysis of ultra low biomass samples
The outcome of the above sample processing pipeline results in
double-stranded DNA samples (in the range of 0.17.1 ng DNA/m
3
of air sampled). These can subsequently be analysed, not only by
amplication-based techniques (16S/ITS), but also via direct DNA
Amassment
5:00
am
5:30 6:00 7:00 8:00
150
200
DNA yield (ng)
15 30 60120 180
Sampling duration
(min)
0
50
100
a
b
Pestalotiopsis fici
Ktedonobacter racemifer
050010001500
Number of reads
0 500 1000 1500 2000
Number of reads
1st 15 min
2nd 15 min
Kocuria palustris
Coprinopsis cinerea
Tulasnella calospora
Leifsonia xyli
Moniliophthora roreri
Ceriporiopsis subvermispora
Eutypa lata
Punctularia strigosozonata
Zea mays
Brachybacterium muris
Staphylococcus cohnii
Subdoligranulum variabile
Corynebacterium xerosis
Micrococcus luteus
Kocuria marina
Agaricus bisporus
Fibroporia radiculosa
Cyphellophora europaea
30 min
avg. of 15-min samples
d
Dichomitus squalens
Fomitiporia mediterranea
Schizophyllum commune
Phlebiopsis gigantea
Trametes cinnabarina
Auricularia delicata
Phanerochaete carnosa
Schizopora paradoxa
Trametes versicolor
Rhizoctonia solani
Kocuria palustris
Coprinopsis cinerea
Tulasnella calospora
Leifsonia xyli
Moniliophthora roreri
Ceriporiopsis subvermispora
Eutypa lata
Punctularia strigosozonata
Zea mays
Brachybacterium muris
Staphylococcus cohnii
Subdoligranulum variabile
Corynebacterium xerosis
Micrococcus luteus
Kocuria marina
Agaricus bisporus
Fibroporia radiculosa
Cyphellophora europaea
Pestalotiopsis fici
Ktedonobacter racemifer
5:00 5:005:15 5:30 6:00
15 min 30 min 1 hr 2 hrs 3 hrsSampling time
Start time 5:00 5:00 5:00
c
Fig. 3 Sampling duration assessment. a Illustration of different time-based sampling regimes. bComparison of DNA yield (ng) between the
corresponding sampling regimes, e.g. rst 15-min yield (orange) +second 15-min yield (light blue) compared to rst 30-min yield (orange).
The bars represent mean values and the error bars were standard deviation among the replicates. cTaxonomic compositions of the top
30 species, the highlighted portion focuses on species which shifted in abundance between the rst and second 15-min samples. d
Comparison of relative abundances of the selected species, the rst and second 15-min samples were averaged and compared to the
taxonomic composition of rst 30-min sample. The bars represent mean values and the error bars were standard deviation among the
replicates.
I. Luhung et al.
4
npj Biofilms and Microbiomes (2021) 37 Published in partnership with Nanyang Technological University
sequencing (shotgun), resulting in either gene-based or metage-
nomic proles of airborne environmental communities. Both
approaches, 16S/ITS amplicon and whole-genome shotgun meta-
genomic (WGS), produce sequence data that may be compared
against publicly available data archives. For the remainder of this
manuscript, the advantages and disadvantages of both techniques
will be discussed in relation to ultra-low biomass analysis.
Amplicon-based sequencing approaches have been the
method of choice in the majority of past bioaerosol studies
1318
.
This was due to the assumption that the low amount of amassable
biomass from air was insufcient for shotgun metagenomics
32
.
Our study shows that the above-described ultra-low biomass air
sampling and processing pipeline is capable of robustly producing
metagenomic datasets, as demonstrated i) for a range of DNA
input amounts, ii) by the reproducibility among replicates, iii) by
the robustness of air samples analysis from various climatic
conditions and iv) contamination control.
Required input amount: from the same DNA sample, a range of
DNA input amounts for shotgun metagenomic sequencing and
analysis (0.510 ng) were tested. Using our pipeline, taxonomic
representation for each DNA input condition was visualised at the
species level using bubble charts (Fig. 5a). For the tested range of
0.510 ng, no signicant change of species-level composition was
observed. The species-level metagenomic prole for each sample
was consistent even when the PCR cycles required during DNA
library construction were increased from 6 to 15 (Fig. 5a).
Reproducibility: An experimental time series of outdoor air in a
tropical setting
22
was used to assess sample-to-sample variability.
Over 24 h, air samples were collected at 2-h time intervals (12 time
points) in triplicate. The metagenomic proles of samples within the
same replicate group were highly consistent, with an average
similarity of 91% (8795%, SIMPER analysis). The taxonomic proles,
however, were distinct between sampling time points (Fig. 5b). The
higher variability observed for day-time samples can be attributed
to increased atmospheric turbulence due to convection, while a
narrower range was observed during night-time hours.
Robustness: The above-tested range of 0.510 ng of DNA
templates, with their respective PCR cycles (15 to 6 cycles), was
suitable for a global air microbiome survey that involved a wide
range of environmental conditions. The pipeline presented here
robustly produced metagenomic datasets from air samples
collected in locations with a diverse range of temperature (10
to 39 °C) and humidity (3690%), within the four climatic zones
(temperate, dessert, sub-arctic and tropical) (Fig. 5c).
Contamination control: The negative controls consisted of lter
blanks (clean, unused lter) mounted on the air samplers for 1 min
without airow, which were then transported and processed in an
identical manner to air samples. The DNA yield from negative
controls was not detectable (Supplementary Fig. 7a). The number
of reads generated by Illumina sequencing were on average 1000-
fold less for the negative controls compared to the air samples
(Supplementary Fig. 7b), with taxonomic analysis indicating
human contamination as the most likely source (Supplementary
Fig. 7c). The number of reads from our air samples which could be
mapped back to the lter blanks were very low and they were
removed by our statistical analysis threshold (<0.05% of assigned
reads). It can be deemed that despite the ultra-low biomass nature
of our analytical pipeline, contamination is not a concern
(Supplementary Discussion 7).
In a nal step, extracted genomic DNA from the pipeline was
analysed by both metagenomic and 16S/ITS amplicon sequencing,
resulting in sets of distinct taxonomic proles based on their
respective databases (Fig. 6a). For fungi, results from both
sequencing analysis methods concur with the observed trends
for the specic abundances of microbial taxa during day/night at
higher taxonomic resolution, e.g., Ascomycota being prevalent
during day-time and Basidiomycota during night-time. The 16S
amplicon analysis, however, was less robust as three out of four
samples resulted in no detectable PCR product, even with higher
DNA input (446 ng) and additional PCR cycles (Fig. 6a). This was
caused by low amounts of 16S rDNA gene template in tropical air
samples (Supplementary Fig. 9). The only successfully analysed
16S sample resulted in a similar taxonomic prole to that of the
WGS pipeline at the phylum level, with Firmicutes dominating
over Actinobacteria and Proteobacteria.
The above analysis highlights biases in the success rate of
fungal ITS and/or bacterial 16S amplication for air samples from a
diverse range of environmental conditions. Numerous studies
have reported similar challenges
13,16
. In contrast, regardless of
potential inhibitor content and/or taxonomic composition of the
air samples, the WGS pipeline consistently captured the biological
diversity of airborne microbial communities in various climatic
0
20
40
60
80
DNA yield (ng)
+-
Filter
processing
bc d
0
10
20
30
40
50
Bacteria
0
0.01
0.1
0.5
Fungi
0
30
60
90
120
Triton X-100 (%)
Species richness
a
Storage
Fsh Frz RT
-6 -4 -2
-2
-1
0
1
2
3
24
0
PCO 2 (9%)
PCO 1 (66%)
PCO 2 (4%)
PCO 1 (93%)
-6
-3
0
3
6
10 20
-10 0
0% 0.01% 0.1% 0.5%
Triton X-100
Extraction
0
1
2
3
4
0
5
10
15
1h 2h ON
Incubation at 55˚C
PERMDISP
FungiBacteria
e
n = 3 n = 4
Fig. 4 Storage and biomass extraction. a Principal coordinate analysis (Bray-Curtis) on genus level for samples processed fresh (Fsh), stored
in freezer (Frz) and room temperature (RT). bComparison of DNA yield (ng) with (+) and without () the lter wash step. cTotal identied
species for fungi (orange) and bacteria (blue) for samples processed with different concentration of detergent (00.5% v/v) during the wash
step. The bars represent mean values and the error bars were standard deviation among the replicates. dPrincipal coordinate analysis (Bray-
Curtis) on genus level for samples processed with different concentration of detergent (00.5% v/v) during the wash step. ePERMDISP analysis
for samples processed with extended incubation at 55 °C prior to cell lysis. The centre line, bound of box and whiskers represent median,
25th75th percentile and min-to-max values, respectively.
I. Luhung et al.
5
Published in partnership with Nanyang Technological University npj Biofilms and Microbiomes (2021) 37
conditions (Figs. 5c, 6a). Moreover, unlike the single gene
amplicon approach, the WGS pipeline directly compared DNA
read abundances from a diverse set of taxa (bacteria, fungi, plants
and others) at a single quantitative scale.
In contrast to phylum level analysis, WGS and amplicon
analytical pipelines are substantially less congruent at the genus
or species level, due to the respective database sizes. Our
metagenomic reads were aligned to the non-redundant (nr)
database and assigned to taxa using the MEGAN software
33
, while
the amplicon reads were aligned to the 16S SILVA database for
bacteria
34
and ITS UNITE
35
database for fungi using blastn
36
. The
resulting taxonomic classications from the two analysis
approaches show signicant agreement at higher taxonomic
levels (e.g., up to phylum level). At genus and species levels,
taxonomic concordance is diminished, as shown for the top 40
most abundant taxa for both analysis types (Fig. 6b, c). In this
regard, the metagenomic and 16S amplicon approach agree in
7278% of instances on the genus level. However, only one out of
40 taxa (2.5%) was in agreement on the species level. This
concordance is even less for fungal taxonomy. On the genus level,
19 out of 40 taxa (47.5%) were in agreement when the WGS was
used as a reference. When the ITS was chosen as a reference,
seven out of 40 (17.5%) assignments were in agreement. As
observed for bacteria, only 1 out of 40 taxonomic assignments
was shared on a species level. In general, the amplicon databases
possess a much larger representation of fungal and bacterial taxa.
The higher overlap for bacteria was likely due to higher
representation of bacterial genomes in the nr sequence database
due to increasing accessibility for generating genome-wide data
for small microbial genomes. In contrast, the accessibility does not
extend to genome sizes exceeding 100 MB for some fungal
organisms. In this regard, the sequencing, assembly and annota-
tion of fungal genomes are still challenging.
rDNA sequences generated by both sequencing methods concur
when analysed for marker gene content. In this regard, the
metagenomic datasets analysed in this study contain about 1%
rDNA genes (ITS and 16S), which can be aligned to 16S SILVA and
ITS UNITE databases. The metagenomics rDNA read analysis and the
amplicon sequencing results produced highly overlapping taxo-
nomic proles for the top 40 most abundant taxa for fungi and the
top 10 most abundant taxa for bacteria (Supplementary Fig. 6). With
metagenomic sequencing becoming more accessible, it is therefore
possible to combine the benets of 16S- and ITS-based taxonomy to
investigate understudied ultra-low biomass environments, while
simultaneously enabling taxonomic and functional analyses
37
.
DISCUSSION
The here-presented air sampling and analysis pipeline enable
qualitative and quantitative assessment of microbial diversity in an
ultra-low biomass ecosystem. The 57 log difference in biomass
concentration of air samples, compared to seawater or soil,
requires sufciently large volumes of air to be sampled. Based on
our optimisation results, we propose default sampling parameters
of 300 L/min for 2 h. This enables DNA accumulation rate which is
~8170-fold higher than reported in recent studies
26,28,38
(Sup-
plementary Table 1). This large improvement allows for shorter
sampling time (15 min), while still enabling WGS metagenomic
analysis with species-level taxonomic classication. Such high
temporal and taxonomic resolution are crucial for ecological
studies of air microbiomes, which rapidly respond to diel
dynamics or sudden environmental changes. It should be noted
that factors such as time of sampling within a day, sampling
duration and climatic settings of the sampling location impact the
10 5 2 0.5DNA input (ng)
Anncaliia algerae
Musca domestica
Harpegnathos saltator
Ceratitis rosa
Aedes aegypti
Culex quinquefasciatus
Gemmatimonadetes bacterium KBS708
Drosophila melanogaster
Hydra vulgaris
Dichomitus squalens
Candidatus Sulcia muelleri
Cryptolestes ferrugineus
Actinomycetospora chiangmaiensis
Fomitopsis pinicola
Ctenolepisma lineata
Camponotus floridanus
Sporisorium reilianum
Fomitiporia mediterranea
Pyrinomonas methylaliphatogenes
Anopheles gambiae
Anopheles darlingi
Anopheles sinensis
Acidobacteriaceae bacterium KBS 96
Calothrix sp. PCC 7103
Tribolium castaneum
Cystobacter fuscus
Sorghum bicolor
uncultured bacterium
Ceratitis capitata
Acinetobacter baumannii
Baudoinia compniacensis
Aureobasidium melanogenum
Silanimonas lenta
Gloeocapsa sp. PCC 7428
Oryza sativa
Trametes cinnabarina
Bombyx mori
Gemmata obscuriglobus
Singulisphaera acidiphila
Cotesia sesamiae bracovirus
PCR Cycle 6 8 12 15
a
Temperate
(Germany)
Dessert
(Israel)
Arctic
(Russia)
Tropical
(Singapore)
Saccharopolyspora rectivirgula
Micrococcus luteus
Cutibacterium acnes
Geodermatophilus obscurus
Actinomycetospora chiangmaiensis
Sphingomonas astaxanthinifaciens
Streptococcus pneumoniae
Thermoactinomyces vulgaris
Gemmatirosa kalamazoonesis
Acinetobacter baumannii
Paracoccus sanguinis
Roseomonas gilardii
Kocuria polaris
Klebsiella pneumoniae
Nocardioides sp. CF8
Nevskia ramosa
Rubellimicrobium mesophilum
Singulisphaera acidiphila
Skermanella stibiiresistens
Thermoactinomyces sp. CDF
Dichomitus squalens
Aspergillus ruber
Phlebiopsis gigantea
Fomitiporia mediterranea
Trametes cinnabarina
Eutypa lata
Aspergillus nidulans
Phanerochaete carnosa
Baudoinia panamericana
Botrytis cinerea
Wallemia mellicola
Trametes versicolor
Schizophyllum commune
Penicillium steckii
Verruconis gallopava
Pestalotiopsis fici
Vitis vinifera
Morus notabilis
Oryza sativa
Eucalyptus grandis
Bacteria Fungi Plants
Temperature (C):
RH (%):
Climate:
-10 to -15 27 to 3335 to 3912 to 15
36 to 38 70 to 9078 to 8080 to 83
c
-60 -40 -20 0 20 40
-40
-20
0
20
40
PCO1 (59.7%)
PCO2 (23.7%)
Night
Day
9am
% ass. reads
11am
% ass. reads
1pm
% ass. reads
3pm
% ass. reads
1am
% ass. reads
3am
% ass. reads
11pm
% ass. reads
9pm
% ass. reads
5pm
% ass. reads
7pm
% ass. reads
5am
% ass. reads
7am
% ass. reads
Basidiomycota
Ascomycota
Fungi
Proteobacteria
Actinobacteria
aCyanobacteri
Firmicutes
Bacteroidetes
Bacteria
Viridiplantae
Other phyla
Others
PCO:
Bar composition:
Avg. SIMPER (replicates): 91.3%
b
Nucleic Acid Analysis
Fig. 5 Whole genome shotgun (WGS) sequencing of air samples.
aComparison of taxonomic prole at species level for the same air
sample that was subjected to WGS sequencing with different DNA
input amounts (100.5 ng). bReproducibility of samples collected at
the same time and location (triplicates) illustrated in principal
coordinate analysis (BrayCurtis) at species level. The bars show the
microbial community composition of the triplicates in % of assigned
reads. cRobustness of air sampling and processing pipeline tested
at locations with temperate, dessert, sub-arctic and tropical climates.
I. Luhung et al.
6
npj Biofilms and Microbiomes (2021) 37 Published in partnership with Nanyang Technological University
analysis outcome, and therefore comparability between the above
studies. Our proposed method, however, has also been evaluated
for its robustness across a wide range of environmental settings
(arctic, desert, temperate and tropical climatic zones) (Fig. 5c).
Further, our results indicate that biomass amassed from air
samples using lter-based devices during remote eldwork may
be stored at room temperature for extended periods of time with
tolerable loss of extractable DNA (20% in 5 d) and without
compromising microbial community structure. While these effects
could potentially be counter-acted by nucleic acid stabilisation
methods
39
, this approach is not recommended during sampling
campaigns, as it would require additional handling of the air lter
Nucleic Acid Analysis
Viridiplantae
Others
Basidiomycota
Ascomycota
Firmicutes
Cyanobacteria
Actinobacteria
Proteobacteria
Day1 Night1 Day2 Night2
Extracted genomic DNA
Day1 Night1 Day2 Night2
PCR (ITS1F - ITS2R)
Amplicon Sequencing
0
20
40
60
80
100
% Assigned reads (phylum)
Day1 Night1 Day2 Night2
WGS Sequencing
Day1 Night1 Day2 Night2
PCR (16S 338F - 806R)
Amplicon Sequencing
No Amplification
No Amplification
No Amplification
DNA Yield(ng)
36 m³ of air
DNA Input(ng)
PCR Cycle
18224 16 275
55 55
88 88
18224 16 275
1
9114
15 15 15 15
18224 16 275
430346
25 25 25 25
Legends:
a
16S
Acinetobacter
Pseudomonas
Sphingomonas
Belnapia
Chondromyces
Rickettsia
Bartonella
Paracoccus
Methylobacterium
Blautia
Clostridium
Faecalibacterium
Lactobacillus
Staphylococcus
Megamonas
Enterococcus
Lachnoclostridium
Jeotgalicoccus
Eubacterium
Bacillus
Streptococcus
Deinococcus
Alistipes
Bacteroides
Chryseobacterium
Kocuria
Corynebacterium
Brachybacterium
Streptomyces
Microbacterium
Mycobacterium
Dietzia
Nocardiopsis
Brevibacterium
Kibdelosporangium
Arthrobacter
Collinsella
Bifidobacterium
Methanobrevibacter
Prevotella
WGS
Pseudomonas
Acinetobacter
Rickettsiella
Sphingomonas
Craurococcus
Paracoccus
Shinella
Fusobacterium
Turicibacter
Erysipelatoclostridium
Subdomigranulum
Faecalibacterium
Romboutsia
Peptococcus
Oribacterium
Lachnoclostridium
Fusicatenibacter
Blautia
Clostridium
Streptococcus
Lactobacillus
Enterococcus
Aerococcus
Staphylococcus
Macrococcus
Jeotgalicoccus
Bacillus
Deinococcus
Ktedonobacter
Alistipes
Parabacteroides
Bacteroides
Streptomyces
Rothia
Micrococcus
Kocuria
Brachybacterium
Brevibacterium
Dietzia
Corynebacterium
16S
WGS
Schizophyllum
Dichomitus
Rhizotonia
Trametes
Fomitiporia
Phlebiopsis
Phanerochaete
Schizopora
Moniliophthora
Ganoderma
Tulasnella
Gelatoporia
Agaricus
Punctularia
Fibroporia
Heterobasidion
Botryobasidium
Cylindrobasidium
Pleurotus
Phlebia
Stereum
Serendipita
Coprinopsis
Tilletiaria
Laccaria
Termitomyces
Eutypa
Talaromyces
Fusarium
Pestalotiopsis
Verruconis
Diaporthe
Aspergillus
Cyphellophora
Colletotrichum
Valsa
Pseudocercospora
Saitoella
Penicillium
Auricularia
WGS
ITS
WGS
Resinicium
Pseudolagarobasidium
Schizophyllum
Peniophora
Ceriporia
Phellinus
Earliella
Lentinus
Auricularia
Grammothele
Porostereum
Ganoderma
Trichaptum
Scopuloides
Phlebia
Perenniporia
Heterochaete
Rigidoporus
Sidera
Fomitopsis
Peniophorella
Hyphodermella
Phanerochaete
Hyphodontia
Phlebiopsis
Favolus
Trechispora
Trametes
Thanatephorus
Exidia
Gymnopus
Antrodia
Hymenochaete
Pilatoporus
Megasporoporia
Hyphoderma
Marasmius
Sistotrema
Peroneutypa
Helvella
ITS
Bacteria Fungi
Genus
Proteobacteria
Fusobacteria
Firmicutes
Deinococcus
Chloroflexi
Basidiomycota
Ascomycota
Chlamydiae
Bacterioidetes
Actinobacteria
Archaea
Phylum
Reference
Reference Reference Reference
b
C. apiculatus
C. crocatus
C. fuscus
S. variabile
F. prausnitzii
L. aviarius
L. crispatus
L. ingluviei
L. reuteri
L. salivarius
L. vaginalis
E. cecorum
S. arlettae
S. aureus
S. cohnii
Phascolarctobacterium sp.
Turicibacter sp.
Firmicutes bacterium
Firmicutes bacterium
S. saprophyticus
J. marinus
B. thuringiensis
B. cereus
B. desmolans
T. carboxidivorans
K. racemifer
C. psittaci
Alistipes sp.
B. salanitronis
M. luteus
K. rhizophila
K. marina
K. palustris
L. xyli
B. muris
B. linens
C. falsenii
C. stationis
T. massiliensis
Eubacterium sp.
Acinetobacter sp.
Acinetobacter sp.
Acinetobacter sp.
A. soli
Acinetobacter sp.
Rickettsiella sp.
C. mobilis
S. fusca
F. cylindroides
Ruminococcaceae sp.
Anaerostipes sp.
S. gallolyticus
L. agilis
L. aviarius
L. pontis
Lactobacillus sp.
Lactobacillus sp.
Staphylococcus sp.
Staphylococcus sp.
Staphylococcus sp.
A. persicus
B. anthracis
rumen bacterium
Enterococcus sp.
Ktedonobacter sp.
Deinococcus sp.
B. barnesiae
B. caecigallinarum
R. amarae
R. endophytica
Micrococcus sp.
M. yunnanensis
Kocuria sp.
Kocuria sp.
Corynebacterium sp.
Corynebacterium sp.
C. glutamicum
C. nuruki
C. nasicanis
M. woesei
S. commune
D. squalens
R. solani
F. mediterranea
P. gigantea
A. delicata
T. cinnabarina
P. carnosa
S. paradoxa
T. versicolor
M. roreri
T. calospora
C. subvermispora
A. bisporus
P. strigosozonata
F. radiculosa
H. irregulare
B. botryosum
C. torrendii
G. sinense
P. ostreatus
S. hirsutum
P. radiata
P. indica
C. cinerea
T. anomala
L. bicolor
Termitomyces sp.
E. lata
P. fici
V. gallopava
D. amepelina
C. europaea
V. mali
P. fijiensis
S. complicata
R. emersonii
P. nodorum
M. oryzae
A. stygium
S. commune
G. incarnatum
P. maipoensis
T. subsphaerospora
P. lycii
P. pyricola
G. lineata
G. fuligo
E. scabrosa
P. spadiceum
P. flavidoalba
P. sordida
C. alachuana
C. lacerata
B. adusta
L. sajor-caju
G. australe
T. hinnuleus
R. ulmarius
M. giganteus
M. bannaensis
F. pinicola
A. lalashana
S. lowei
R. saccharicola
R monticola
R. friabile
P. noxius
S. hydnoides
P. chrysocreas
P. acanthocystis
P. praetermissa
H. rhizomorpha
H. cineracea
H. mutatum
T. cucumeris
H. delicata
A. nigricans
H. rivularis
E. citricola
Species
Bacteria Fungi
Reference
Reference Reference Reference
16S
WGS
16S
WGS
ITS
WGS
ITS
WGS
c
Fig. 6 Comparison of taxonomic proles between WGS and amplicon sequencing pipelines. a Taxonomic prole of WGS, ITS amplicon and
16S amplicon pipeline at phylum level of four independently collected air samples (two days and two nights). bPresenceabsence
comparison of the top 40 most abundant genus for bacteria (WGS vs 16S) and fungi (WGS vs ITS). cPresenceabsence comparison of the top
40 most abundant species for bacteria (WGS vs 16S) and fungi ( WGS vs ITS).
I. Luhung et al.
7
Published in partnership with Nanyang Technological University npj Biofilms and Microbiomes (2021) 37
samples in the eld. This could result in contamination and
complicate transport due to the introduction of liquid materials
(e.g., commercial air travel). The advantage of dry storage and
transport also does not extend to other types of air samplers, such
as liquid impingers.
For nucleic acid extraction, it could be shown that the amassed
biomass should not be extracted directly on the lter, but rather
rst be removed owing to the adherence of the low quantities of
nucleic acids to the large surface area of the lter membrane.
Therefore, extraction and wash buffer conditions should be
optimised to enable the extraction of sub-nanomolar concentra-
tions of DNA/RNA. This optimisation includes the use of detergent
and extended incubation times. In particular, the addition of non-
ionic detergents, such as Triton X-100, signicantly increases the
recovered biomass, while extended incubation times improve the
evenness of the large sets of samples. This observation is also
highly relevant in the context of sampling potentially infectious
biological materials, such as airborne retroviruses, which can
concurrently be inactivated with Triton X-100
40
.
Both metagenomic and amplicon sequencing methods can be
applied to air samples (Fig. 2s). The metagenomic approach is
advantageous with regards to enabling simultaneous functional
and taxonomic analysis and has the advantage that bacteria and
fungi can be analysed within the same quantitative scale. Further,
the rapid expansion of the public WGS databases continues to
enable species-level taxonomic identication at an increasing rate.
In contrast, the content of amplicon sequencing databases (ITS or
16S) are likely to grow at a slower rate, given the increased
accessibility of WGS.
While our study demonstrated that the extracted DNA from the
ultra-low biomass pipeline was sufcient for WGS and ITS
amplicon analyses, 16S amplicons did not perform equally well
for tropical air samples (Fig. 6a). This may be due to the fact that
the DNA library construction for WGS is less sensitive to inhibitors
and the relative ratio of bacterial vs. fungal DNA. Both factors
impact on the efcacy of the polymerase chain reaction. Never-
theless, specic gene marker/amplicon analysis can be advanta-
geous for studies that target well characterised, less diverse
microbial communities.
Finally, due to database biases, both methods appear to
converge on the phylum level, but to a lesser degree at the
genus level. On the species level both methods do not produce
signicant agreement. To harness the advantages from both
sequencing technologies, it is benecial to combine both
approaches by also analysing the rDNA sequences from the
metagenomic data (Supplementary Fig. 6). The results from this
combined analysis enable data interpretation from a single data
source (metagenomic data), to inform both WGS and marker
genes analysis pipelines.
The here-presented methodology is limited by the size range of
the chosen lter medium (0.5>10 µm, <50% efciency for
particles <0.5 µm). As this study aims to reduce required sampling
times, total suspended (biological) particles (TSP) need to be
collected and analysed. While this study does not prole particle
size range, recent studies have demonstrated that the most
relevant airborne bacteria and fungi fall within the size range of
the lter medium
13,15
.
In summary, the above-described ultra-low biomass analysis
pipeline provides detailed insights into the factors that inuence
analysis outcomes for low-biomass microbial environments. High
volumetric air sampling techniques in combination with applied
nucleic acid analysis, results in high temporal and taxonomic
resolution of inherent airborne microbial communities. The
presented ndings are potentially also applicable to other low-
biomass environments, such as dust and surfaces.
METHODS
Air sampling
Air samples for optimisation purposes were collected in Singapore at a
roof-top balcony of a university building (N1.346247, E103.679467). As the
study focuses on improving the time resolution of the analysis, a high-ow
rate, lter-based air sampler (SASS3100, Research International, USA) was
used to collect total suspended particles (TSP) with no size cut-off. The
lter medium was SASS Bioaerosol electret lter (6 cm diameter, expected
50% efciency for 0.5 µm particle size, Research International, USA). For
sample collection, air samplers were attached upright on a tripod 1.5 m
above the concrete oor of the balcony.
In addition to Singapore, samples from different climatic settings were
collected in a consistent manner from sites in Germany, Russia and Israel to
test the robustness of the proposed pipeline. These international locations
showed contrasting settings for temperature (T) and relative humidity (RH).
After sampling, the lters were returned to their original lter pouches
and transported to the laboratory for direct processing or storage at
20 °C. Information on exact sampling time, ow rate, duration and the
environmental parameter measurements of all sampling activities used in
this study can be found in Table 1.
Temperature and relative humidity
Temperature (T) and relative humidity (RH) at the sampling site were
measured using HOBO Temp/RH 2.5% Data Logger (Onset, USA).
Filter processing, DNA extraction, quantitation and
sequencing
All lter samples were subsequently processed for DNA extraction,
quantitation, qPCR, metagenomic sequencing and computational analysis
as described in our previous study
22
. In brief, the lter samples were rst
washed 3 times using 2 mL of phosphate-buffered saline (pH 7.2) with
0.1% (v/v) Triton X-100 assisted with water-bath sonication at room
temperature for 1 min. After washing, the suspension liquid was
concentrated onto a 0.02 µm Anodisc lter (Whatman, UK) using a vacuum
manifold (DHI, Denmark). DNA was then extracted from the Anodisc with
the DNeasy PowerWater kit (Qiagen, USA) following the manufacturers
standard protocol with modications to increase DNA yield
26
.
Final DNA solution was subjected for uorometer quantication, qPCR
and shotgun metagenomic sequencing. Fluorometer quantitation was
measured with Qubit 2.0 (Invitrogen, USA) using the High Sensitivity
double stranded DNA (HS dsDNA) kit. Taqman qPCR assays with universal
bacterial (16S rRNA gene)
41
and fungal (18S rRNA gene)
42
primer set and
probes were used to quantify the copy numbers of bacteria and fungi,
respectively. The complete list of primers can be found in Table 2.
For direct metagenomic sequencing, libraries were prepared using Swift
BiosciencesAccel-NGS 2S Plus DNA Library kit following the standard
protocol. All libraries were subsequently dual-barcoded with Swift
Biosciences2S Dual Indexing kit. PCR amplication selectively enriches
for library fragments that have adapters ligated on both ends. The number
of cycles were adjusted based on the starting amount of DNA (815
cycles). Upon pooling at equal volumes, libraries were sequenced on
Illumina HiSeq2500 Rapid runs at a nal concentration of 1011 pM and a
read-length of 251 bp paired-end (Illumina V2 Rapid sequencing reagents).
Each ultra-low biomass sample was sequenced to a depth of at least two
million paired-reads.
Raw reads from the sequencer were rst trimmed from adapter
sequences, low quality bases (<20 score) and short reads (<30 bp) using
Cutadapt (v.1.8.1)
43
. The processed reads were then aligned against the
NCBIs NR database (v.25-02-2016) using RAPSearch2 (v.2.15)
44
. Results
from the RAPSearch2 alignment were nally converted to read-match
archive (rma) to be visualised with MEGAN5 software
33
.
Experimental parameters optimisation
Important parameters for sampling, extraction and sequencing were tested
and optimised based on absolute (uorometer and qPCR) and relative
abundance assessment (DNA sequencing). Importantly, it should be noted,
that only samples collected at an identical time and location may be
compared. Therefore, it is mandatory as an experimental setup to deploy
multiple air samplers for each set of the parameter optimisation
experiments. This is due to the high volatility of biomass concentration
and composition of air, particularly when sampling at different time points
throughout day and night. The replicability and robustness of this study
I. Luhung et al.
8
npj Biofilms and Microbiomes (2021) 37 Published in partnership with Nanyang Technological University
was, therefore, enabled through simultaneous deployment of up to 12 air
samplers at any given time (n=12).
Comparison to other types of environmental samples: The ultra-low
concentration of airborne biomass was investigated relative to other types
of environmental samples. To negate possible differences due to sampling
location and/or processing method, soil (1 gram per sample extraction),
water (1 mL per sample extraction) and air samples (300 L/min, 2 h
sampling duration) were collected within the same proximity (in Singapore)
and were subsequently processed with identical protocol. Only DNA yield
(ng/unit mass or volume of the samples) was assessed for this experiment.
The amassment parameters are sampling duration and sampling ow
rate. Sampling duration experiment: With a xed air ow rate (300 L/min,
n=3), sampling duration was varied at 15, 30, 60, 120 and 180min. Further,
multiple shorter duration samples were also compared to longer duration
samples with matching time segments, i.e. rst and second 15 min samples
were compared to the matching 30 min sample. Air ow rate experiment:
With a xed duration (2 h), three groups of air samplers (n=4) were run at
the same time with varying ow rate at 100, 200 and 300 L/min. The
experiments were assessed based on the impact of sampling duration and
airow variations on DNA quantity and microbial composition.
Table 2. List of primers and probes applied in the study.
Name Sequence Notes
16S 341F 5-CCTACGGGDGGCWGCA-3Bacteria qPCR
16S 805R 5-GGACTACHVGGGTMTCTAATC-3Bacteria qPCR
Taqman probe 6FAM-5-CAGCAGCCGCGGTA-3-BBQ Bacteria qPCR probe
FungiQuant-F 5-GGRAAACTCACCAGGTCCAG-3Fungi qPCR
FungiQuant-R 5-GSWCTATCCCCAKCACGA-3Fungi qPCR
FungiQuant-PrbLNA 6FAM-5-TGGTGCATGGCCGTT-3-BBQ Fungi qPCR probe
16S 341F Illumina 5-TCG TCG GCA GCG TCA GAT GTG TAT AAG AGA CAG CCT ACG GGN BGC ASC AG -3Amplicon for bacteria
16S 805R Illumina 5-GTC TCG TGG GCT CGG AGA TGT GTA TAA GAG ACA GGG ACT ACH VGG GTW TCT AAT -3Amplicon for bacteria
ITS1F Illumina 5- TCG TCG GCA GCG TCA GAT GTG TAT AAG AGA CAG CTT GGT CAT TTA GAG GAA GTA A -3Amplicon for fungi
ITS2R Illumina 5- GTC TCG TGG GCT CGG AGA TGT GTA TAA GAG ACA GGC TGC GTT CTT CAT CGA TGC -3Amplicon for fungi
Table 1. Details of sampling activities.
Sample set Sampling date Sampling time, duration and
ow rate
Temperature (°C) RH (%) Rain Sample size No. of
samples
Singapore
1 29-Nov-17 01:0003:00 (2 h, 100 L/min, 200 L/
min, 300 L/min)
24.825.3 98100 No 4 12
2 15-Dec-17 05:0508:05 (15 min, 30 min, 1 h, 2 h,
3 h, 300 L/min)
24.725.7 99100 No 3 27
3 29-Nov-17 06:1508:15 (2 h, 300 L/min) 24.625.4 99100 No 4 12
4 23-Feb-17 17:0017:00 (2 h, 300 L/min) 24.034.0 63100 Yes 3 36
24-Feb-17
5 8-May-16 17:0017:00 (2 h, 300 L/min) 28.033.0 5989 No 3 36
9-May-16
6 29-Nov-17 03:5005:50 (2 h, 300 L/min) 24.524.8 99100 Yes 3 12
7 28-Nov-17 20:4022:40 (2 h, 300 L/min) 24.925.3 9899 No 3 12
8 24-Nov-17 05:0007:00 (2 h, 300 L/min) 23.924.5 99100 No 4 12
9 22-Nov-17 23:0001:00 (2 h, 300 L/min) 26.026.5 9799 No 3 3
10 23-Nov-17 11:0013:00 (2 h, 300 L/min) 29.030.0 7780 No 2 2
11 27-Nov-17 23:0001:00 (2 h, 300 L/min) 24.525.5 9396 No 3 3
12 28-Nov-17 11:0013:00 (2 h, 300 L/min) 28.529.5 7577 No 2 2
13 29-Aug-16 13:0015:00 (2 h, 300 L/min) 31.032.0 7080 No 1 1
14 30-Aug-16 13:0015:00 (2 h, 300 L/min) 31.032.0 7080 No 1 1
15 21-Sep-15 13:3015:30 (2 h, 300 L/min) 31.032.0 6370 Yes 1 1
Germany
16 30-Jul-17 12:0014:00 (2 h, 300 L/min) 12.015.0 8083 No 2 2
Israel
17 4-Jul-17 08:3010:30 (2 h, 300 L/min) 35.039.0 3638 No 2 2
Russia
18 2-Dec-17 09:0011:00 (2 h, 300 L/min) 10.015.0 7880 No 1 1
19 3-Dec-17 15:0017:00 (2 h, 300 L/min) 10.015.0 7880 No 1 1
Total no. of samples
(including blanks):
183
I. Luhung et al.
9
Published in partnership with Nanyang Technological University npj Biofilms and Microbiomes (2021) 37
Sample storage experiment. Three sets of air samples collected simulta-
neously (300 L/min, 2 h, n=4) were subjected to the following storage
regimes; direct processing (fresh), 20 °C storage for 5 days (freezer) and
room temperature storage for 5 days (RT) and compared for both DNA
quantity and microbial proles.
Parameters optimised for lter processing and DNA extraction were the
use of sonication, detergent and impact of pre-incubation. Sonication
experiment: Two sets of air samples collected at the same time (300 L/min,
2h, n=3) were subjected to lter washing with the room temperature
water-bath sonication step included and excluded. Detergent experiment:
Four sets of air samples collected at the same time (300 L/min, 2 h, n=3)
were washed with buffer containing four different concentrations of non-
ionic detergent Triton-X 100 (%v/v): No detergent (0%), 0.01, 0.1 and 0.5%.
Pre-incubation experiment: Three sets of air samples collected at the same
time (300 L/min, 2 h, n=4) were subjected to three different durations of
pre-incubation in 55 °C water bath prior to proceeding with the
subsequent lysis step of the DNA extraction. The durations were 1 h, 2 h
and overnight (1416 h). These durations were selected to enable the
completion of the entire extraction process (lter washing and DNA
extraction) within a standard working day (~8 h).
All the above experiments were assessed based on DNA quantity and
microbial proles of the resulting analysis.
The DNA sequencing result was evaluated for the DNA input amount,
reproducibility, robustness and taxonomic classication difference
between metagenomics and amplicon. DNA input experiment: From a
given extracted DNA sample, four different DNA input amounts for direct
metagenomic sequencing were tested: 10 ng, 5 ng, 2ng and 0.5 ng. The
number of PCR cycles during library construction were adjusted based on
the DNA amount. The nal result was assessed based on the taxonomic
composition of the sequencing analysis. Reproducibility between replicates:
A set of time series samples was analysed to investigate the similarity of
the metagenomic proles between the replicates. The time-series data
contains twelve sets of time points with three replicates each. Each set was
collected with 300 L/min ow rate and 2-hour sampling duration, spanning
across 24 h.Robustness across a range of climatic settings: Air samples
collected from locations with different climates (highly variables T and RH)
were analysed regarding the success rate of DNA sequencing library
construction due to varying amounts and quality of DNA input. 300 L/min
ow rate and 2 h sampling duration were used to collect samples in
Germany (temperate), Israel (dessert) and Russia (sub-arctic). Comparison of
shotgun metagenomic and amplicon marker gene sequencing: The two
sequencing approaches were evaluated using taxonomic assignments
from identical sets of extracted air samples. DNA samples were split for
shotgun metagenomic, 16S bacterial amplicon and ITS fungal amplicon
sequencing. The sequencing and analysis methods for the bacterial and
fungal amplicon sequencing are detailed in the following section.
PCR-based amplicon sequencing and analysis
A subset of our ultra-low biomass samples were also subjected to amplicon
sequencing for direct comparison with the shotgun metagenomic
sequencing approach. For these samples, the rst stage PCR was
performed with the extracted genomic DNA as a template and the
ITS1F-ITS2R
45
primers for fungi and 16S 341F-805R
46
primers for bacteria.
Details of these primer sequences can be found in Table 2. KAPA HiFi
HotStart master mix was used with a total reaction volume of 25 µL. For
DNA input amount, 3 µL and 10 µL of DNA templates were used for fungi
and bacteria, respectively. The cycling condition was 95 °C for 3 min,
amplication cycles with 95 °C for 30 s, 65 °C for 30 s, 72 °C for 30 s, and a
nal extension at 72 °C for 5 min. The fungal samples were amplied with
15 cycles and the bacteria samples were amplied with 25 cycles. The PCR
products were then puried with AMPure XP beads (Beckman Coulter)
before performing the second stage PCR.
The second stage PCR (Indexing PCR) was performed according to the
recommendations in Illuminas16S Metagenomic Sequencing Library
Preparationapplication note. This step uses a limited cycle PCR to
complete the Illumina sequencing adapters and add dual-index barcodes
to the amplicon target. Five microliters of the intermediate PCR product
from the rst stage PCR (Amplicon PCR) were used as template for the
indexing PCR and samples were amplied with eight PCR cycles. Nextera
XT v2 indices were used for dual-index barcoding to allow pooling of the
amplicon targets for sequencing.
Finished amplicon libraries were quantitated using Promegas QuantiFluor
dsDNA assay and the average library size was determined on an Agilent
Tapestation 4200. Library concentrations were then normalised to 4 nM and
validated by qPCR on a QuantStudio-3 real-time PCR system (Applied
Biosystems), using the Kapa library quantication kit for Illumina platforms
(Kapa Biosystems). The libraries were then pooled at equimolar concentrations
and sequenced on the Illumina MiSeq platform with 20% PhiX spike-in and at
a read-length of 300 bp paired-end (MiSeq V3 reagents).
After sequencing, raw reads were rst trimmed from adapter sequences,
low-quality bases and short reads using Cutadapt (v.1.8.1)
43
. After trimming,
the R1 and R2 reads were rst paired with minimum overlap of 10 bp and
subsequently aligned against UNITE ITS database (v.7.1) for the ITS sequences
and SILVA 16S database (release 132) for the 16S sequences using command
line blastn
36
(version 2.2.28 +). Results from blastn alignments were also
converted to read-match archive (rma) format for visualisation with the
MEGAN5 software to facilitate direct comparison with the metagenomic
sequencing analysis. The default LCA parameters were used.
Statistical analysis
For quantitative analysis from Qubit 2.0 Fluorometer and qPCR, all
statistical tests were conducted with MannWhitney test. As mentioned
previously, we acknowledge the limitations of these tests due to the
relatively low number of observations (n=3orn=4) for each set of
samples. Due to the volatile nature of air sample, only samples collected at
the same time and location can be directly compared. Thus, the number of
replications was limited by the number of samplers which could be
deployed at a given time (n=12).
For metagenomic analysis, signicant differences between groups of
samples were mainly determined by ANOSIM test based on distance
matrices between the samples compared. Distance matrices were created
through PRIMER7 software based on taxa (genus level, cut-off at 0.05% of
total assigned reads) read counts of each sample generated by MEGAN5.
The distance matrix calculated based on BrayCurtis algorithm was used to
evaluate proportional difference (community structure) of the microbial
communities between samples, while the distance matrix calculated based
on Jaccard algorithm was used to determine presenceabsence difference
(community membership/richness) of different taxa detected in the
compared group of samples. For reproducibility assessment among
replicates, environmental time series data were used in which air samples
with two-hourly time resolution were collected in 24 h with three replicates
each. Similarity percentage (SIMPER) analysis was conducted with
PRIMER7 software with the samples grouped based on the replicates.
Blank sample collection and analysis
Five lter blank samples were collected and analysed. Filter blank samples
were collected by attaching a clean, unused lter onto the air sampler at
the sampling location and collecting them after 1 min without running the
sampler. They were subjected to the same extraction methods and
metagenomic analysis pipeline as the actual air samples.
Reporting summary
Further information on research design is available in the Nature Research
Reporting Summary linked to this article.
DATA AVAILABILITY
All raw unprocessed reads have been submitted to NCBI under the bio-project
accession number PRJNA638794.
Received: 6 November 2020; Accepted: 19 March 2021;
REFERENCES
1. Darwin, C. The Voyage of the Beagle (Cosimo Inc., 2008).
2. Von Humboldt, A. & Aimé B. Personal Narrative of Travels to the Equinoctial
Regions of America: During the Years 1799-1804 (Cosimo Inc., 2013).
3. Gilbert, J. A., Jansson, J. K. & Knight, R. The Earth Microbiome project: successes
and aspirations. BMC Biol. 12,14 (2014).
4. Silvia, C. M. & Stal. J. L. The Marine Microbiome (Springer International, 2016).
5. Burrows, S. M., Elbert, W. & Lawrence, M. G. Bacteria in the global atmosphere.
Atmos. Chem. Phys. 9, 92639280 (2009).
6. Bauer, H. et al. The contribution of bacteria and fungal spores to the organic carbon
content of cloud water, precipitation and aerosols. Atmos. Res. 64,109119 (2002 ).
I. Luhung et al.
10
npj Biofilms and Microbiomes (2021) 37 Published in partnership with Nanyang Technological University
7. Prussin, A. J., Garcia, E. B. & Marr, L. C. Total concentrations of virus and bacteria in
indoor and outdoor air. Environ. Sci. Technol. Lett. 2,8488 (2015).
8. Jones, A. M. & Harrison, R. M. The effects of meteorological factors on atmospheric
bioaerosol concentrationsareview.Sci. Total Environ. 326,151180 (2004).
9. Schulz-Bohm, K., Martín-Sánchez, L. & Garbeva, P. Microbial volatiles: small
molecules with an important role in intra- and inter-kingdom interactions. Front
Microbiol 8,110 (2017).
10. Misztal, P. K. et al. Emission factors of microbial volatile organic compounds from
environmental bacteria and fungi. Environ. Sci. Technol. 52, 82728282 (2018).
11. Bourdillon, B. Y. R. B., Lidwell, M. & Thomas, J. C. A slit sampler for collecting and
counting air-borne bacteria. Epidemiol. Infect. 41, 197224 (1941).
12. Palmgren, U., Ström, G., Blomquist, G. & Malmberg, P. Collection of airborne
micro-organisms on Nuclepore lters, estimation and analysis-CAMNEA method.
J. Appl. Bacteriol. 61, 401406 (1986).
13. Yamamoto, N. et al. Particle-size distributions and seasonal diversity of allergenic
and pathogenic fungi in outdoor air. ISME J. 6, 18011811 (2012).
14. Lang-Yona, N. et al. Annual distribution of allergenic fungal spores in atmosph eric
particulate matter in the eastern mediterranean; A comparative study between
ergosterol and quantitative PCR analysis. Atmos. Chem. Phys. 12, 26812690
(2012).
15. Hospodsky, D. et al. Human occupancy as a source of indoor airborne bacteria.
PLoS ONE 7, e34867 (2012).
16. Fu, X. et al. Indoor microbiome, environmental characteristics and asthma among
junior high school students in Johor Bahru, Malaysia. Environ. Int.138, 105664 (2020 ).
17. Luhung, I. et al. Exploring temporal patterns of bacterial and fungal DNA accu-
mulation on a ventilation system lter for a Singapore university library. PLoS ONE
13, e0200820 (2018).
18. Amend, A. S., Seifert, K. A., Samson, R. & Bruns, T. D. Indoor fungal composition is
geographically patterned and more diverse in temperate zones than in the
tropics. Proc. Natl Acad. Sci. 107, 1374813753 (2010).
19. Tringe, S. G. et al. The airbone metagenome in an indoor urban environment.
PLoS ONE 3, e1862 (2008).
20. Yooseph, S. et al. A metagenomic framework for the study of airborne microbial
communities. PLoS ONE 8, e81862 (2013).
21. Cao, C. et al. Inhalable microorganisms in BeijingsPM
2.5
and PM
10
pollutants
during a severe smog event. Environ. Sci. Technol. 48, 14991507 (2014).
22. Gusareva, E. S. et al. Microbial communities in the tropical air ecosystem follow a
precise diel cycle. Proc. Natl Acad. Sci. 116, 2329923308 (2019).
23. Ottesen, E. A. et al. Multispecies diel transcriptional oscillations in open ocean
heterotrophic bacterial assemblages. Science 345, 207212 (2014).
24. Kai, W. et al. Ambient bioaerosol particle dynamics observed during haze and
sunny days in Beijing. Sci. Total Environ. 550, 751759 (2016).
25. Dybwad, M., Skogan, G. & Blatny, J. M. Comparative testing and evaluation of nine
different air samplers: end-to-end sampling efciencies as specicperformance
measurements for bioaerosol applications. Aerosol Sci. Technol. 48,282295 (2014).
26. Luhung, I. et al. Protocol improvements for low concentration DNA-based
bioaerosol sampling and analysis. PLoS ONE 10, e0141158 (2015).
27. Spring, A. M. et al. A method for collecting atmospheric microbial samples from
set altitudes for use with next-generation sequencing techniques to characterize
communities. Air Soil Water Res. https://doi.org/10.1177/1178622118788871
(2018).
28. Jiang, W. et al. Optimized DNA extraction and metagenomic sequencing of air-
borne microbial communities. Nat. Protoc. 10, 768779 (2015).
29. Kim, H., Park, K. & Lee, M. Biocompatible dispersion methods for carbon black.
Toxicol. Res. 28, 209216 (2012).
30. Muthukumaran, S. et al. The optimisation of ultrasonic cleaning procedures for
dairy fouled ultraltration membranes. Ultrasonic Sonochem. 12,2935 (2005).
31. Cragg, M. S. et al. Complement-mediated lysis by anti-CD20 mAb correlates with
segregation into lipid rafts. Blood 101, 10451052 (2003).
32. Núñez, A. et al. Monitoring of airborne biological particles in outdoor atmosphere
Part 2: metagenomics applied to urban environments. Int. Microbiol. 19,6980
(2016).
33. Huson, D. H., Auch, A. F., Qi, J. & Schuster, S. C. MEGAN analysis of metagenomic
data. Genome Res. 17, 377386 (2007).
34. Quast, C. et al. The SILVA ribosomal RNA gene database project: improved data
processing and web-based tools. Nucleic Acids Res. 41, D590D596 (2012).
35. Abarenkov, K. et al. The UNITE database for molecular identication of
fungirecent updates and future perspectives. N. Phytol. 186, 281285 (2010).
36. Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local align-
ment search tool. J. Mol. Biol. 215, 403410 (1990).
37. Urich, T. et al. Simultaneous assessment of soil microbial community structure and
function through analysis of the meta-transcriptome. PLoS ONE 3, e2527 (2008).
38. Dommergue, A. et al. Methods to investigate the global atmospheric micro-
biome. Front. Microbiol. 10, 243 (2019).
39. Spens, J. et al. Comparison of capture and storage methods for aqueous mac-
robial eDNA using an optimized extraction protocol: advantage of enclosed lter.
Methods Ecol. Evolution 8, 635645 (2017).
40. Patterson, E. I. et al. Methods of inactivation of SARS-CoV-2 for downstream
biological assays. J. Infect. Dis. 222, 14621467 (2020).
41. Liu, C. M. et al. BactQuant: an enhanced broad-coverage bacterial quantitative
real-time PCR assay. BMC Microbiol.12, 56 (2012).
42. Liu, C. M. et al. FungiQuant: a broad-coverage fungal quantitative real-time PCR
assay. BMC Microbiol.12, 1 (2012).
43. Martin, M. Cutadapt removes adapter sequences from high-throughput
sequencing reads. EMBnet J. 17, 10 (2011).
44. Zhao, Y., Tang, H. & Ye, Y. RAPSearch2: a fast and memory-efcient protein
similarity search tool for next-generation sequencing data. Bioinformatics 28,
125126 (2012).
45. Bokulich, N. A. & Mills, D. A. Improve d selection of Internal Transcribed Spacer-
specic primers enables quantitative, ultra-high-throughput proling of fungal
communities. Appl Environ. Microbiol. 79, 25192526 (2013).
46. Takahashi, S., Tomita, J., Nishioka, K., Hisada, T. & Nishijima, M. Development of a
prokaryotic universal primer for simultaneous analysis of bacteria and archaea
using next-generation sequencing. PLoS ONE 9, e105592 (2014).
ACKNOWLEDGEMENTS
The work was supported by Singapore Ministry of Education Academic Research
Fund Tier 3 grant (grant MOE2013-T3-1-013).
AUTHOR CONTRIBUTIONS
I.L., A.U., S.C.S.: Designed the study, conducted experiments, analysed the data, wrote
the manuscript. S.B.Y.L.: Conducted experiments, analysed the data. D.I.D.-M., S.L.:
Analysed the data, wrote the manuscript. N.E.G., C.K., K.L.: Conducted experiments. B.
N.V.P., R.W.P., E.G., E.A., C.E.H., A.W., H.L.K., A.C.M.J.: Analysed the data. S.R.L., Y.Z.H. D.
P., K.Y., K.K.K., A.P.N., A.P.: Conducted experiments. I.L. and A.U. contributed equally to
this manuscript.
COMPETING INTERESTS
The authors declare no competing interests.
ADDITIONAL INFORMATION
Supplementary information The online version contains supplementary material
available at https://doi.org/10.1038/s41522-021-00209-4.
Correspondence and requests for materials should be addressed to S.C.S.
Reprints and permission information is available at http://www.nature.com/
reprints
Publishers note Springer Nature remains neutral with regard to jurisdictional claims
in published maps and institutional afliations.
Open Access This article is licensed under a Creative Commons
Attribution 4.0 International License, which permits use, sharing,
adaptation, distribution and reproduction in any medium or format, as long as you give
appropriate credit to the original author(s) and the source, provide a link to the Creative
Commons license, and indicate if changes were made. The images or other third party
material in this article are included in the articles Creative Commons license, unless
indicated otherwise in a credit line to the material. If material is not included in the
articles Creative Commons license and your intended use is not permitted by statutory
regulation or exceeds the permitted use, you will need to obtain permission directly
from the copyright holder. To view a copy of this license, visit http://creativecommons.
org/licenses/by/4.0/.
© The Author(s) 2021
I. Luhung et al.
11
Published in partnership with Nanyang Technological University npj Biofilms and Microbiomes (2021) 37
... During these visits, filter-based SASS3100 air samplers (Research International, Monroe, WA, USA) were placed in participant bedrooms and balconies (or other outdoor air sources adjacent to the home) to sample indoor and outdoor air sources, respectively. Samplers were set to run for eight consecutive hours (overnight) at a flow rate of 100 L·min −1 according to previously described protocols for metagenomic assessment of indoor and outdoor air [14,15]. Surface swabs of fans and air-conditioning filters were performed using 4N6Floq (Copan, Murrieta, CA, USA) swabs pre-moistened with 0.1% PBS-Triton-X100 [16][17][18]. ...
... Surface swabs of fans and air-conditioning filters were performed using 4N6Floq (Copan, Murrieta, CA, USA) swabs pre-moistened with 0.1% PBS-Triton-X100 [16][17][18]. Biomass was removed from filters by washing in 0.1% PBS-Triton-X100 with sonication at room temperature for 1 min, after which samples were concentrated using a 0.02 µm Anodisc filter with vacuum manifold (Whatman, Little Chalfont, UK) [15]. ...
... To ensure consistency, all library preparation and metagenomic sequencing were performed at a single site (Nanyang Technological University, Singapore). The DNeasy PowerWater kit (Qiagen, Germantown, MD, USA) was employed for DNA extraction from concentrated Anodisc filters in parallel with filter and reagent blanks [14,15]. Library preparation and sequencing was performed using established low-biomass sequencing protocols optimised as described previously on an Illumina HiSeq 2500 platform (Illumina, San Diego, CA, USA) [8,14,15]. ...
Article
Background Sensitisation to Aspergillus fumigatus is linked to worse outcomes in patients with chronic obstructive pulmonary disease (COPD), however, its prevalence and clinical implications in domestic (residential) settings remains unknown. Methods Individuals with COPD (n=43) recruited in Singapore had their residences prospectively sampled and assessed by shotgun metagenomic sequencing including indoor air, outdoor air, and touch surfaces (total: 126 specimens). The abundance of environmental A. fumigatus and the occurrence of A. fumigatus (Asp f) allergens in the environment were determined and immunological responses to A. fumigatus allergens determined in association with clinical outcomes including exacerbation frequency. Findings were validated in 12 individuals (31 specimens) with COPD in Vancouver, Canada, a climatically different region. Results 157 metagenomes from 43 homes were assessed. Eleven and nine separate Aspergillus spp. were identified in Singapore and Vancouver respectively. Despite climatic, temperature, and humidity variation, A. fumigatus was detectable in the environment from both locations. The relative abundance of environmental A. fumigatus was significantly associated with exacerbation frequency in both Singapore (r=0.27, p=0.003) and Vancouver (r=0.49, p=0.01) and individuals with higher Asp f 3 sensitisation responses lived in homes with a greater abundance of environmental Asp f 3 allergens (p=0.037). Patients exposed and sensitised to Asp f 3 allergens demonstrated a higher rate of COPD exacerbations at 1-year follow-up (p=0.021). Conclusion Environmental A. fumigatus exposure in the home environment including air and surfaces with resulting sensitisation carries pathogenic potential in individuals with COPD. Targeting domestic A. fumigatus abundance may reduce COPD exacerbations.
... Such metagenomic approaches have also recently been applied for low biomass bioaerosol analysis [5] and have revealed the complex nature and diverse origins of the air microbiome [4], including vertical-altitudinal stratification of microbial abundance and distribution [6], and substantial diurnal, seasonal, temperature-, and humidity-dependent f luctuations [7]. ...
... Metagenomic approaches have provided unprecedented insights into the nature, origin, and complexity of the air microbiome [4][5][6][7]. While past studies have relied on traditional short-read sequencing, we here describe the first long-read nanopore sequencing technology-based approaches to robustly assess the air microbiome. ...
... While past studies have relied on traditional short-read sequencing, we here describe the first long-read nanopore sequencing technology-based approaches to robustly assess the air microbiome. Although nanopore sequencing has been applied to various environmental samples, such as water and soil [15,34,35], its applicability to air samples was expected to pose a particular challenge due to the ultra-low biomass of air and the amplification-free nature of nanopore sequencing [5]. We here showed that nanopore shotgun sequencing in combination with active air sampling through liquid impingement and tailored computational analyses can reproducibly describe the air microbiome of different environments (Fig. 1) while leveraging the latest nanopore chemistry improvements, which offer high sequencing, accuracy and reduced minimum DNA input requirements [10,11]. ...
Article
While the air microbiome and its diversity are essential for human health and ecosystem resilience, comprehensive air microbial diversity monitoring has remained rare, so that little is known about the air microbiome’s composition, distribution, or functionality. Here we show that nanopore sequencing-based metagenomics can robustly assess the air microbiome in combination with active air sampling through liquid impingement and tailored computational analysis. We provide fast and portable laboratory and computational approaches for air microbiome profiling, which we leverage to robustly assess the taxonomic composition of the core air microbiome of a controlled greenhouse environment and of a natural outdoor environment. We show that long-read sequencing can resolve species-level annotations and specific ecosystem functions through de novo metagenomic assemblies despite the low amount of fragmented DNA used as an input for nanopore sequencing. We then apply our pipeline to assess the diversity and variability of an urban air microbiome, using Barcelona, Spain, as an example; this randomized experiment gives first insights into the presence of highly stable location-specific air microbiomes within the city’s boundaries, and showcases the robust microbial assessments that can be achieved through automatable, fast, and portable nanopore sequencing technology.
... Studying the atmospheric microbiome presents unique challenges due to its low-biomass nature (26). Overcoming this challenge requires sophisticated bioinformatic algorithms and high sequencing depth for accurate analysis. ...
... While traditional methods like amplicon sequencing, which target specific genetic markers such as the 16S ribosomal RNA gene and internal transcribed spacer (ITS), have been utilized to identify taxonomic features, they primarily limited to the genus level for bacteria and fungi (7,8,10,27). Recent studies have employed shotgun metagenomic and metatranscriptomic sequencing in other ecologies to explore the atmospheric microbiome, utilizing advanced aerosol sampling devices and genomic technologies (26,28,29). However, most investigations of the dust microbiome have focused on shallow taxonomic characterization or have taken a generalized approach to functional characterization (7,8,10,27,30,31), rather than tailoring bioinformatic tools that are specific to each scientific research question. ...
Preprint
Full-text available
The atmosphere hosts a microbiome that connects distant ecosystems yet remains relatively unexplored. In this study, we tested the hypothesis that dust storms enhance the spread of pathogenic microorganisms and whether these microorganisms carry antibiotic resistance and virulence-related genes in the Eastern Mediterranean. We collected air samples during a seasonal transition period, capturing data from 13 dusty days originating from Middle Eastern sources and 32 clear days, with temperatures ranging from 16.5 to 27.1 °C. Using metagenomic analysis, we identified several facultative pathogens like Klebsiella pneumoniae , Stenotrophomonas maltophilia , and Aspergillus fumigatus , which are linked to human respiratory diseases, and others like Zymoseptoria tritici , Fusarium poae , and Puccinia striiformis , which are harmful to wheat. The abundance of these pathogens increased during dust storms and with rising temperatures. Although we did not find strong evidence that these species harbored antibiotic resistance or virulence-related genes, which could be linked to their pathogenic potential, dust storms transported up to 125 times more total antibiotic resistance genes, as measured by RPKM abundance, compared to clear conditions. These levels during dust storms far exceeded those found in other ecosystems. While further research is needed to determine whether dust storms and temperature variations pose an immediate threat to public health and the environment, our findings underscore the importance of continuous monitoring of atmospheric microbiomes. This surveillance is crucial for assessing potential risks to human health and ecosystem stability, particularly in the face of accelerating global climate change.
... Opting for DNA extraction from filter paper posed a risk of the filter paper absorbing most of the lysis buffer, potentially reducing cell lysis efficiency. Additionally, a liquid medium was preferred for air sampling when extensive extraction was required [22]. ...
Article
Full-text available
Background Microorganisms in dental unit water (DUW) play a significant role in dental bioaerosols. If the methods used to decontaminate DUW also help improve air quality in dental clinics is worth exploring. In this study, we aim to identify the source of bacteria in dental bioaerosols and investigate the impact of waterline disinfectants on the quantity and composition of bacteria in DUW and bioaerosols. Methods Two dental chair units in a separate treatment room are installed with two different waterline decontamination systems, a plasma or iodine cartridge. The experiment was performed in two phases, before and after installing the decontamination systems. Aerosol is generated via running airotor in the subject’s mouth. Before and after the procedure, the air samples were collected with an active air sampling machine onto agar plate and filter paper for genomic DNA extraction. The subject’s saliva and DUW samples were also collected. The samples were analyzed further with bacterial counting and metataxonomics analysis. Results The bacteria present in the air sample after the aerosol-generating procedure were confirmed to be derived from the air-before, saliva, and DUW in 51.43%, 6.38%, and 18.60%, respectively. The saliva samples demonstrated the highest alpha diversity (within the sample), whereas the air samples had the least. Both waterline disinfectants effectively controlled bacteria in DUW but did not affect the bacterial number and composition in the air. Conclusions Dental bioaerosols are composed of bacteria from saliva and DUW. Plasma and iodine showed a trend in controlling bacterial contamination in DUW but did not alter the bacterial count and composition in dental bioaerosols.
... Ceci n'est qu'un angle d'approches plus globales visant à définir « l'exposome occidental » et son impact sur les écosystèmes humains, animaux et végétaux. En milieu urbain, des approches se développent en parallèle afin de fournir des données prospectives sur la flore aérienne, en intérieur et en extérieur, avec la difficulté d'analyse métagénomique fiable d'échantillons de très faible biomasse [14]. On voit aujourd'hui très légitimement se développer une approche « une santé » (One health) pour laquelle des progrès sont indispensables, non tant dans les technologies « omiques » que dans le traitement standardisé des « big data » générées par ces approches. ...
Article
Full-text available
Si les effets délétères des activités humaines sur la biodiversité du monde végétal et animal et sur le climat sont un fait acquis, leur impact sur la biodiversité microbienne doit être urgemment considéré, particulièrement sur le microbiome humain. La révolution métagénomique permet une large analyse et un suivi spatio-temporels jusqu’à présent inenvisageables. Une réduction de la richesse et de la diversité des microbiotes humains, en particulier intestinaux, est maintenant avérée, surtout dans les aires industrialisées de la planète. Utilisation inconsidérée des antibiotiques, changements drastiques des régimes alimentaires et éléments restant à déterminer de l’exposome environnemental sont le plus souvent incriminés. En découlent des situations de dysbioses caractérisées par une érosion du cœur d’espèces microbiennes communes à tous les individus et une prolifération de pathobiontes opportunistes, sans doute due à un affaiblissement de l’effet de barrière du microbiome. Le défi actuel est d’établir un lien de causalité entre ces dysbioses et des maladies en émergence épidémique, bien que non transmissibles, comme l’asthme, l’allergie, les maladies auto-immunes, l’obésité, le diabète et certains cancers. Modèles expérimentaux et études cliniques contrôlées prospectives et interventionnelles sont indispensables pour consolider cette causalité, d’autant que dans le déchiffrage des altérations de la symbiose homme-microbiome se profile un nouveau chapitre de la médecine : la « médecine microbienne »
... Other improvements of the system that need to be considered include the reduce of the cost, the shortening of the analytical time of each test and the completion of automated system cleanup of RIAMs. Furthermore, the effects of adjustable flow mode on sampling efficiency, cutoff size, spatial virus resolution, etc., as well as the changes in spatial virus depiction caused by different sampling efficiencies can be studied 49 . Besides, regarding aerosol particle collection, this study focused on the direct collection of particles within the cyclone pipe. ...
Article
Full-text available
Highly sensitive airborne virus monitoring is critical for preventing and containing epidemics. However, the detection of airborne viruses at ultra-low concentrations remains challenging due to the lack of ultra-sensitive methods and easy-to-deployment equipment. Here, we present an integrated microfluidic cartridge that can accurately detect SARS-COV-2, Influenza A, B, and respiratory syncytial virus with a sensitivity of 10 copies/mL. When integrated with a high-flow aerosol sampler, our microdevice can achieve a sub-single-copy spatial resolution of 0.83 copies/m³ for airborne virus surveillance with an air flow rate of 400 L/min and a sampling time of 30 minutes. We then designed a series of virus-in-aerosols monitoring systems (RIAMs), including versions of a multi-site sampling RIAMs (M-RIAMs), a stationary real-time RIAMs (S-RIAMs), and a roaming real-time RIAMs (R-RIAMs) for different application scenarios. Using M-RIAMs, we performed a comprehensive evaluation of 210 environmental samples from COVID-19 patient wards, including 30 aerosol samples. The highest positive detection rate of aerosol samples (60%) proved the aerosol-based SARS-CoV-2 monitoring represents an effective method for spatial risk assessment. The detection of 78 aerosol samples in real-world settings via S-RIAMs confirmed its reliability for ultra-sensitive and continuous airborne virus monitoring. Therefore, RIAMs shows the potential as an effective solution for mitigating the risk of airborne virus transmission.
... Worth mentioning that there is, a very limited number of studies (and consequently, few datasets) on the microbial characterization at high altitude in the troposphere, as the majority of studies were performed at only a few meters above the ground level over either terrestrial or oceanic locations ( 10 , 33 ) ( 34 -36 ). One of the reasons for this lack of studies is possibly the extremely low biomass content of air samples, which presents a challenge for the extraction of sufficient genetic material of high enough quality for efficient sequencing analysis ( 37 ). The main airborne bacterial genera detected both in flight and surface samples are Sphingomonas and Methylobacterium , which echoes other studies where they were reported as dominant airborne taxa in outdoor environments, particularly in the lower troposphere across different geographic regions ( 3 , 38 , 39 ) particularly in Japan ( 40 ). ...
Article
The existence of viable human pathogens in bioaerosols which can cause infection or affect human health has been the subject of little research. In this study, data provided by 10 tropospheric aircraft surveys over Japan in 2014 confirm the existence of a vast diversity of microbial species up to 3,000 m height, which can be dispersed above the planetary boundary layer over distances of up to 2,000 km, thanks to strong winds from an area covered with massive cereal croplands in Northeast (NE) Asia. Microbes attached to aerosols reveal the presence of diverse bacterial and fungal taxa, including potential human pathogens, originating from sewage, pesticides, or fertilizers. Over 266 different fungal and 305 bacterial genera appeared in the 10 aircraft transects. Actinobacteria, Bacillota, Proteobacteria, and Bacteroidetes phyla dominated the bacteria composition and, for fungi, Ascomycota prevailed over Basidiomycota. Among the pathogenic species identified, human pathogens include bacteria such as Escherichia coli, Serratia marcescens, Prevotella melaninogenica, Staphylococcus epidermidis, Staphylococcus haemolyticus, Staphylococcus saprophyticus, Cutibacterium acnes, Clostridium difficile, Clostridium botulinum, Stenotrophomonas maltophilia, Shigella sonnei, Haemophillus parainfluenzae and Acinetobacter baumannii and health-relevant fungi such as Malassezia restricta , Malassezia globosa , Candida parapsilosis and Candida zeylanoides, Sarocladium kiliense, Cladosporium halotolerans, and Cladosporium herbarum . Diversity estimates were similar at heights and surface when entrainment of air from high altitudes occurred. Natural antimicrobial-resistant bacteria (ARB) cultured from air samples were found indicating long-distance spread of ARB and microbial viability. This would represent a novel way to disperse both viable human pathogens and resistance genes among distant geographical regions.
Article
Full-text available
The scientific community has responded to the COVID-19 pandemic by rapidly undertaking research to find effective strategies to reduce the burden of this disease. Encouragingly, researchers from a diverse array of fields are collectively working towards this goal. Research with infectious SARS-CoV-2 is undertaken in high containment laboratories, however, it is often desirable to work with samples at lower containment levels. To facilitate the transfer of infectious samples from high containment laboratories, we have tested methods commonly used to inactivate virus and prepare the sample for additional experiments. Incubation at 80°C, a range of detergents, Trizol reagents and UV energies were successful at inactivating a high titre of SARS-CoV-2. Methanol and paraformaldehyde incubation of infected cells also inactivated the virus. These protocols can provide a framework for in house inactivation of SARS-CoV-2 in other laboratories, ensuring the safe use of samples in lower containment levels.
Article
Full-text available
Indoor microbial diversity and composition are suggested to affect the prevalence and severity of asthma by previous home microbiome studies, but no microbiome-health association study has been conducted in a school environment, especially in tropical countries. In this study, we collected floor dust and environmental characteristics from 21 classrooms, and health data related to asthma symptoms from 309 students, in junior high schools in Johor Bahru, Malaysia. The bacterial and fungal composition was characterized by sequencing 16s rRNA gene and internal transcribed spacer (ITS) region, and the absolute microbial concentration was quantified by qPCR. In total, 326 bacterial and 255 fungal genera were characterized. Five bacterial (Sphingobium, Rhodomicrobium, Shimwellia, Solirubrobacter, Pleurocapsa) and two fungal (Torulaspora and Leptosphaeriaceae) taxa were protective for asthma severity. Two bacterial taxa, Izhakiella and Robinsoniella, were positively associated with asthma severity. Several protective bacterial taxa including Rhodomicrobium, Shimwellia and Sphingobium have been reported as protective microbes in previous studies, whereas other taxa were first time reported. Environmental characteristics, such as age of building, size of textile curtain per room volume, occurrence of cockroaches, concentration of house dust mite allergens transferred from homes by the occupants, were involved in shaping the overall microbial community but not asthma-associated taxa; whereas visible dampness and mold, which did not change the overall microbial community for floor dust, was negatively associated with the concentration of protective bacteria Rhodomicrobium (β = -2.86, p = 0.021) of asthma. The result indicates complex interactions between microbes, environmental characteristics and asthma symptoms. Overall, this is the first indoor microbiome study to characterize the asthma-associated microbes and their environmental determinant in the tropical area, promoting the understanding of microbial exposure and respiratory health in this region.
Article
Full-text available
The atmosphere is vastly underexplored as a habitable ecosystem for microbial organisms. In this study, we investigated 795 time-resolved metagenomes from tropical air, generating 2.27 terabases of data. Despite only 9 to 17% of the generated sequence data currently being assignable to taxa, the air harbored a microbial diversity that rivals the complexity of other planetary ecosystems. The airborne microbial organisms followed a clear diel cycle, possibly driven by environmental factors. Interday taxonomic diversity exceeded day-to-day and month-to-month variation. Environmental time series revealed the existence of a large core of microbial taxa that remained invariable over 13 mo, thereby underlining the long-term robustness of the airborne community structure. Unlike terrestrial or aquatic environments, where prokaryotes are prevalent, the tropical airborne biomass was dominated by DNA from eukaryotic phyla. Specific fungal and bacterial species were strongly correlated with temperature, humidity, and CO 2 concentration, making them suitable biomarkers for studying the bioaerosol dynamics of the atmosphere.
Article
Full-text available
The interplay between microbes and atmospheric physical and chemical conditions is an open field of research that can only be fully addressed using multidisciplinary approaches. The lack of coordinated efforts to gather data at representative temporal and spatial scales limits aerobiology to help understand large scale patterns of global microbial biodiversity and its causal relationships with the environmental context. This paper presents the sampling strategy and analytical protocols developed in order to integrate different fields of research such as microbiology, –omics biology, atmospheric chemistry, physics and meteorology to characterize atmospheric microbial life. These include control of chemical and microbial contaminations from sampling to analysis and identification of experimental procedures for characterizing airborne microbial biodiversity and its functioning from the atmospheric samples collected at remote sites from low cell density environments. We used high-volume sampling strategy to address both chemical and microbial composition of the atmosphere, because it can help overcome low aerosol and microbial cell concentrations. To account for contaminations, exposed and unexposed control filters were processed along with the samples. We present a method that allows for the extraction of chemical and biological data from the same quartz filters. We tested different sampling times, extraction kits and methods to optimize DNA yield from filters. Based on our results, we recommend supplementary sterilization steps to reduce filter contamination induced by handling and transport. These include manipulation under laminar flow hoods and UV sterilization. In terms of DNA extraction, we recommend a vortex step and a heating step to reduce binding to the quartz fibers of the filters. These steps have led to a 10-fold increase in DNA yield, allowing for downstream omics analysis of air samples. Based on our results, our method can be integrated into pre-existing long-term monitoring field protocols for the atmosphere both in terms of atmospheric chemistry and biology. We recommend using standardized air volumes and to develop standard operating protocols for field users to better control the operational quality.
Article
Full-text available
Dispersal of airborne microorganisms is an important ecological process, resulting in the distribution of bacteria to all habitats on Earth. Investigation of this process is limited by the ability to collect uncontaminated high-altitude microbial samples for use with next-generation sequencing approaches. Here, we describe the design of a Remote Airborne Microbial Passive sampling system. Troubleshooting experiments demonstrate that the samplers collect adequate DNA for bacterial 16S rRNA (ribosomal RNA) amplicon–based Mi-Seq sequencing at 2 and 150 m from the ground. When samplers are closed, they retain only a low number of sequences, and may be used as a negative control. We also demonstrate that the optimal amount of collection dishes to include in the sampler is 8, and that freezing collection dishes at −80°C is an alternative to immediate DNA extraction. Samplers may be used to address a variety of ecological and human health–related questions.
Article
Full-text available
Introduction Ventilation system filters process recirculated indoor air along with outdoor air. This function inspires the idea of using the filter as an indoor bioaerosol sampler. While promising, there remains a need to investigate several factors that could limit the accuracy of such a sampling approach. Among the important factors are the dynamics of microbial assemblages on filter surfaces over time and the differential influence of outdoor versus recirculated indoor air. Methods This study collected ventilation system filter samples from an air handling unit on a regular schedule over a 21-week period and analyzed the accumulation patterns of biological particles on the filter both quantitatively (using fluorometry and qPCR) and in terms of microbial diversity (using 16S rDNA and ITS sequencing). Results The quantitative result showed that total and bacterial DNA accumulated monotonically, rising to 41 ng/cm² for total DNA and to 2.8 ng/cm² for bacterial DNA over the 21-week period. The accumulation rate of bacterial DNA correlated with indoor occupancy level. Fungal DNA first rose to 4.0 ng/cm² before showing a dip to 1.4 ng/cm² between weeks 6 and 10. The dip indicated a possible artifact of this sampling approach for quantitative analysis as DNA may not be conserved on the filter over the months-long service period. The sequencing results indicate major contributions from outdoor air for fungi and from recirculated indoor air for bacteria. Despite the quantitative changes, the community structure of the microbial assemblages was stable throughout the 21-week sampling period, highlighting the robustness of this sampling method for microbial profiling. Conclusion This study supports the use of ventilation system filters as indoor bioaerosol samplers, but with caveats: 1) an outdoor reference is required to properly understand the contribution of outdoor bioaerosols; and 2) there is a need to better understand the persistence and durability of the targeted organisms on ventilation system filters.
Article
Full-text available
During the last decades, research on the function of volatile organic compounds focused primarily on the interactions between plants and insects. However, microorganisms can also release a plethora of volatiles and it appears that microbial volatile organic compounds (mVOCs) can play an important role in intra- and inter-kingdom interactions. So far, most studies are focused on aboveground volatile-mediated interactions and much less information is available about the function of volatiles belowground. This minireview summarizes the current knowledge on the biological functions of mVOCs with the focus on mVOCs-mediated interactions belowground. We pinpointed mVOCs involved in microbe-microbe and microbe–plant interactions, and highlighted the ecological importance of microbial terpenes as a largely underexplored group of mVOCs. We indicated challenges in studying belowground mVOCs-mediated interactions and opportunities for further studies and practical applications.
Article
Full-text available
Summary. The air we breathe contains microscopic biological particles such as viruses, bacteria, fungi and pollen, some of them with relevant clinic importance. These organisms and/or their propagules have been traditionally studied by different disciplines and diverse methodologies like culture and microscopy. These techniques require time, expertise and also have some important biases. As a consequence, our knowledge on the total diversity and the relationships between the different biological entities present in the air is far from being complete. Currently, metagenomics and next-generation sequencing (NGS) may resolve this shortage of information and have been recently applied to metropolitan areas. Although the procedures and methods are not totally standardized yet, the first studies from urban air samples confirm the previous results obtained by culture and microscopy regarding abundance and variation of these biological particles. However, DNA-sequence analyses call into question some preceding ideas and also provide new interesting insights into diversity and their spatial distribution inside the cities. Here, we review the procedures, results and perspectives of the recent works that apply NGS to study the main biological particles present in the air of urban environments.
Article
Knowledge of the factors controlling the diverse chemical emissions of common environmental bacteria and fungi is crucial because they are important signal molecules for these microbes that also could influence humans. We show here not only a high diversity of mVOCs but that their abundance can differ greatly in different environmental contexts. Microbial volatiles exhibit dynamic changes across microbial growth phases, resulting in variance of composition and emission rate of species-specific and generic mVOCs. In vitro experiments documented emissions of a wide range of mVOCs (> 400 different chemicals) at high time resolution from diverse microbial species grown under different controlled conditions on nutrient media, or residential structural materials (N = 54, Ncontrol=23). Emissions of mVOCs varied not only between microbial taxa at a given condition but also as a function of life stage and substrate type. We quantify emission factors for total and specific mVOCs normalized for respiration rates to account for the microbial activity during their stationary phase. Our VOC measurements of different microbial taxa indicate that a variety of factors beyond temperature and water activity, such as substrate type, microbial symbiosis, growth phase, and lifecycle affect the magnitude and composition of mVOC emission.