PreprintPDF Available

Porcine cytomegalovirus detection by nanopore-based metagenomic sequencing in a Hungarian pig farm

Authors:
Preprints and early-stage research may not have been peer reviewed yet.

Abstract and Figures

The rapid diagnosis of infectious diseases has an essential impact on their control, treatment, and recovery. Oxford Nanopore Technologies (ONT) sequencing opens up a new dimension in applying clinical metagenomics. In a large-scale pig farm in Hungary, four fattening and one piglet nasal swab pooled samples were sequenced using ONT for metagenomic analysis. Long reads covering 53.69% of the porcine cytomegalovirus genome were obtained in the piglet sample. The 650 bp long read matching the glycoprotein B gene of the virus is sequentially most similar to Japanese, Chinese and Spanish isolates.
Content may be subject to copyright.
Porcine cytomegalovirus detection by
nanopore-based metagenomic sequencing in a
Hungarian pig farm
Adrienn Gr´
eta T ´
oth1, Regina Fiam2,3,´
Agnes Becsei2, S´
andor Spis´
ak4, Istv´
an Csabai2,
L´
aszl ´
o Makrai5, Tam´
as Reibling1, and Norbert Solymosi*1
1Centre for Bioinformatics, University of Veterinary Medicine, 1078 Budapest, Hungary
2Department of Physics of Complex Systems, E¨
otv¨
os Lor´
and University, 1117 Budapest, Hungary
3Saint Ignatius Jesuit College of Excellence, 1085 Budapest, Hungary
4Institute of Enzymology, Research Centre for Natural Sciences, 1117 Budapest, Hungary
5
Department of Microbiology and Infectious Diseases, University of Veterinary Medicine, 1143 Budapest, Hungary
*solymosi.norbert@gmail.com
ABSTRACT
The rapid diagnosis of infectious diseases has an essential impact on their control, treatment and recovery. Oxford Nanopore
Technologies (ONT) sequencing opens up a new dimension in applying clinical metagenomics. In a large-scale pig farm in
Hungary, four fattening and one piglet nasal swab pooled samples were sequenced using ONT for metagenomic analysis. Long
reads covering 53.69% of the porcine cytomegalovirus genome were obtained in the piglet sample. The 650 bp long read
matching the glycoprotein B gene of the virus is sequentially most similar to Japanese, Chinese and Spanish isolates.
Introduction
In both human and animal health, the rapid diagnosis of diseases, including infectious diseases, has a crucial impact on their
treatment and recovery. Oxford Nanopore Technologies (ONT) sequencing opens up a new dimension in the application of
clinical metagenomics
1
in veterinary medicine
2
. This third-generation sequencing technique can rapidly provide information on
specific samples’ microbial components in a few hours to some days. While the preceding and parallel NGS (next-generation
sequencing) methodologies provide higher sequence detection reliability, the sequencing time does not allow rapid microbial
diagnostics in practice.
The results presented here are from a study aimed at gaining experience in the clinical metagenomic applicability of ONT
in veterinary medicine. Here, we present only the main virological result of veterinary relevance: the detection of porcine
cytomegalovirus (PCMV) sequences. The infection occurs in almost all pig populations, but clinical disease is rare, except in
young piglets, where it can be fatal.
3
However, since xenotransplantation from PCMV-infected pigs affects the recovery and
survival of human patients, screening these donor animals has become an important issue.46
Although the virus has already been identified by PCR in Hungary
7
, a phylogenetic comparison of its sequence has not yet
been performed. In addition to ONT-based viral detection, the similarity of its genome sequence to the available genomes is
studied in this work.
Materials and Methods
Nasal swab samples were collected from 5-week-old piglets of the same stable and from 16 (sample id: 3, 4) and 19-week-old
(id: 1, 2) fattening pigs of two-two boxes of two stables from a Hungarian large-scale swine farm located near the town of
Szekszárd on 21 November 2022. After sample collection, the nasal swabs were transported on ice and stored at -20 °C before
the laboratory procedures. Porcine nasal swabs were pooled in nuclease-free molecular biology water as follows. Each five
fattening pig samples deriving from the same stable and box, and two piglet pools of four-four piglet samples were created.
DNA extraction and metagenomics library preparation
DNA extraction was performed with QIAamp Fast DNA Stool Mini Kit from Qiagen. The concentrations of the extracted DNA
solutions were evaluated with an Invitrogen Qubit 4 Fluorometer using the Qubit dsDNA HS (High Sensitivity) Assay Kit. The
concentrations of the 2 piglet samples were insufficient for library preparation. Thus, the two extracted piglet-deriving DNA
solutions were pooled and concentrated with a vacuum concentrator. Consequently, the library preparation was conducted
.CC-BY-NC-ND 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted December 29, 2022. ; https://doi.org/10.1101/2022.12.28.522123doi: bioRxiv preprint
on DNA deriving from four fattening pig nasal swab samples and one piglet nasal swab sample. The metagenomic long-read
library was prepared by the Ligation Sequencing Kit (SQK-LSK110) combined with the PCR-free Native Barcoding Expansion
1-12 (EXP-NBD104) from ONT. The sequencing was implemented with a MinION Mk1C sequencer using an R9.4.1. flow cell
from ONT.
Bioinformatic analysis
From the generated FAST5 files, one fast (configuration file:
dna_r9.4.1_450bps_fast_mk1c.cfg
) and one high-
accuracy (configuration file:
dna_r9.4.1_450bps_hac_mk1c.cfg
) base calling was performed by ONT’s Guppy
basecaller (v6.4.2,
https://nanoporetech.com/community
). The further analytical steps were done using the
two-way called sequences parallel. The raw reads were adapter trimmed and quality-based filtered by Porechop (v0.2.4,
https://github.com/rrwick/Porechop
) and Nanofilt (v2.6.0)
8
, respectively. The resulting reads were taxonomi-
cally classified using Kraken2
9
with the NCBI non-redundant nucleotide database
10
. Evaluating the taxon hits, the cleaned reads
were mapped to the reference genome of Suid betaherpesvirus 2 (KF017583.1) by minimap2.
11,12
The sequence matching the
glycoprotein B gene (
gB
) was used in the phylogenetic analysis. For that purpose, we used the available (19/12/2022) sequences
with complete or partial
gB
CDSs. By blastn (BLAST v2.13.0+)
13
with default settings, pairwise alignments were performed to
identify the matching range and strand of the subject sequences. By the MUSCLE aligner (v3.8.1551)
14
, multiple sequence
alignment was performed on the cropped subject and the query sequences. To construct a maximum-likelihood tree
15
, the func-
tion pml, optim.pml (
model=’JC’
,
optNNi=TRUE
,
optBf=TRUE
,
optQ=TRUE
,
optInv=TRUE
,
optGamma=TRUE
,
optEdge=TRUE
) from the phangorn (v2.10) package was applied.
16
All data processing and visualization were performed in
the R environment.17
Results
In total, 5.94 and 5.97 gigabases of data were generated during the 72 hours of sequencing, according to fast and high-accuracy
basecalling. The descriptive statistics of the sequences generated by the two basecalling procedures are summarized in Figure 1.
The high-accuracy basecalling generated slightly more nucleotides and longer reads, while the number of reads was equal.
Gigabase
Median read length
Read count (M)
fattening 1 fattening 2 fattening 3 fattening 4 piglet unclassified
0.0
0.5
1.0
1.5
0
250
500
750
1000
0.0
0.5
1.0
1.5
Barcode
Basecall
fast
hac
Figure 1. Descriptive statistics of raw reads. The high-accuracy (hac) basecalling, compared to the fast one, generated slightly
more nucleotides and longer reads, while the number of reads was equal.
In the sample fattening 1, 2, 3, 4, the matched read number on the Suid betaherpesvirus 2 reference genome was 2, 5, 8, 1,
respectively. None of the four fattening samples had a read that matched
gB
. Of the unclassified (without barcode) reads, 25
matched the genome, and no hit was on gene
gB
. In the piglet sample, 315 reads aligned to the reference genome, covering
53.69% of it.
2/6
.CC-BY-NC-ND 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted December 29, 2022. ; https://doi.org/10.1101/2022.12.28.522123doi: bioRxiv preprint
One sequence overlaps the
gB
gene and covers the region between nucleotides 49507 and 50172 of the representative
reference genome. Where the identity was 631/671 (94%) and 641/665 (96%) with gaps 32/671 (4%) and 18/665 (2%) for
the fast and high-accuracy basecalled sequence, respectively. The high-accuracy called sequence similarity with other
gB
sequences is shown by cladogram in Fig 2. Based on sequence similarity, the multiple sequence alignment of the closest strains
is shown in Fig 3. The deletions identified in our sample are not found in the other
gB
genes. In addition to the deletions, only
two positions (insertions at 87 and 139) of the sequential variations are shown in Fig 3, where no other
gB
gene modifications
were found.
AF394056.1
FJ870561.1
FJ844360.1
KX575702.1
KX575699.1
KX575694.1
KX575686.1
KX575684.1
KX575682.1
KX575680.1
KX575679.1
KX575678.1
KX575676.1
KX575675.1
KX575674.1
KX575672.1
KX575671.1
KX575670.1
KX575668.1
KX575667.1
KX575666.1
KX575701.1
KC342285.1
KC342275.1
KC342269.1
KC342268.1
KC342284.1
KC342274.1
KC342283.1
KC342279.1
KC342282.1
KC342280.1
KC342286.1
KC342267.1
KC342288.1
KC342287.1
KC342281.1
KC342278.1
KC342276.1
KC342273.1
KC342272.1
KC342270.1
JN701021.1
KC342266.1
KC342271.1
KX575691.1
KX575688.1
strain_RT
KX575698.1
KX575693.1
AF268040.2
KX575690.1
KX575689.1
LC064808.1
KF017583.1
AF268041.2
FJ870563.1
KX575692.1
KX575708.1
KX575707.1
KX575706.1
KX575705.1
KX575704.1
KX575703.1
KX575700.1
KX575695.1
KX575687.1
KX575685.1
KX575683.1
KX575681.1
KX575677.1
KX575673.1
KX575669.1
KX575697.1
KX575665.1
KX575696.1
EF460488.1
HQ686080.1
FJ595497.1
FJ870562.1
FJ870564.1
AB771707.1
KC342277.1
KC342289.1
AB771708.1
AF268039.2
AF394057.1
HQ686081.1
AB771706.1
Origin
China
Hungary
Japan
South Korea
Spain
United Kingdom
NA
Figure 2. Cladogram based on gB gene sequence similarity. The strain_RT is the sequence obtained by high-accuracy
basecalling.
3/6
.CC-BY-NC-ND 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted December 29, 2022. ; https://doi.org/10.1101/2022.12.28.522123doi: bioRxiv preprint
strain_RT_fast TCTGTTCATTAAATACCTCTTTAGAAGTTCCATTAAAAATGGCTGATATCTCCAACCGATTCACCTGTACCGAGTATANAATTATN87
strain_RT_hac TCTGTTCATTAAATACCTCTTTAGAAGTTCCATTAAAAAATGGTGATATCTNCAACCGATTCACCTGTACCGAGTATAAAATTATC87
AF268040.2 TCTGTTCATTAAATACCTCTTTAGAAGTTCCATTAAAAAATGGTGATATCTNCAACCGATTCACCCGTACCGAGTATAAAATTATN87
FJ870563.1 TCTGTTCATTAAATACCTCTTTAGAAGTTCCATTAAAAAATGGTGATATCTNCAACCGATTCACCTGTACCGAGTATAAAATTATN87
LC064808.1 TCTGTTCATTAAATACTTCTTTAGAAGTTCCATTAAAAAATGGTGATATCTNCAACCGATTCACCTGTACCGAGTATAAAATTATN87
KF017583.1 TCTGTTCATTAAATACTTCTTTAGAAGTTCCATTAAAAAATGGTGATATCTNCAACCGATTCACCTGTACCGAGTATAAAATTATN87
AF268041.2 TCTGTTCATTAAATACTTCTTTAGAAGTTCCATTAAAAAATGGTGATATCTNCAACCGATTCACCTGTACCGAGTATAAAATTATN87
strain_RT_fast CATATGGGTAATCAGATTTACCAATAGTATCAGTTACTATGCAATTAATAGAAAGATGAGCTTTTATATAACCAATGGGTTCCATA 173
strain_RT_hac CATATGGGTAATCAGATTTACCAATAGTATCAGTTACTATGCAATTAATAGAAAGATGAGCTTTTATATAACCAATGGGTTCCATA 173
AF268040.2 CATATGGGTAATCAGATTTACCCATAGTATCAGTTATTATGCAATTAATAGNAAGATGAGCTTTTATATAACCAATGGGTTCCATA 173
FJ870563.1 CATATGGGTAATCAGATTTACCCATAGTATCAGTTATTATGCAATTAATAGNAAGATGAGCTTTTATATAACCAGTGGGTTCCATA 173
LC064808.1 CATATGGGTAATCAGATTTACCCATAGTATCAGTTACTATGCAATTAATAGNAAGATGAGCTTTTATATAACCAATGGGTTCCATA 173
KF017583.1 CATATGGGTAATCAGATTTACCCATAGTATCAGTTACTATGCAATTAATAGNAAGATGAGCTTTTATATAACCAATGGGTTCCATA 173
AF268041.2 CATATGGGTAATCAGATTTACCCATAGTATCAGTTACTATGCAATTAATAGNAAGATGAGCTTTTATATAACCAATGGGTTCCATA 173
strain_RT_fast TGTGAACTGAAAATCAGGCGTGGATATGTAACGAGTATTAATAGTAGATCCAAATTTCAAACGATATATTTGTATTGTNNNNTTTG 259
strain_RT_hac TGTGAACTGAAAATCAGGCGTGGATATGTAACGAGTATTAATAGTAGATCCAAATTTCAAACGATATATNNNNNTTGTATTGTTTG 259
AF268040.2 TGTGAACTGAAAATCAGGCGTGGATATGTAACGAGTATTAATAGTAGATCCAAATTTCAAACGATATAATCTCATTGTATTGTTTG 259
FJ870563.1 TGTGAACTGAAAATCAGGCGTGGATATGTAACGAGTATTAATAGTAGATCCAAATTTCAAACGATATAATCTCATTGTATTGTTTG 259
LC064808.1 TGTGAACTGAAAATCAGGCGTGGATATGTAACGAGTATTAATAGTAGATCCAAATTTCAAACGATATAATCTCATTGTATTGTTTG 259
KF017583.1 TGTGAACTGAAAATCAGGCGTGGATATGTAACGAGTATTAATAGTAGATCCAAATTTCAAACGATATAATCTCATTGTATTGTTTG 259
AF268041.2 TGTGAACTGAAAATCAGGCGTGGATATGTAACGAGTATTAATAGTAGATCCAAATTTCAAACGATATAATCTCATTGTATTGTTTG 259
strain_RT_fast TATTATCATCTTTATGATATACCCGATAATTTATTCCCTGGTTTCTAATCTCGGCAGCGGAAAAAACATTGACCATTCAAATTTAT 345
strain_RT_hac TATTATCATCTTTATGATATACCCGATAATTTATTCCCTGGTTTCTAATCTCGGCAGCGGNAAAAACATTGACCATTCAAATTTAT 345
AF268040.2 TATTATCATCTTTATGATATACACGATAATTTATTCCCTGGTTTCTAATCTCGGCAGCGGNAAAAACATTGACCATTTAAATTTAT 345
FJ870563.1 TATTATCATCTTTATGATATACCCGATAATTTATTCCCTGGTTTCTAATCTCGGCAGCGGNAAAAACATTGACCATTCAAATTTAT 345
LC064808.1 TATTATCATCTTTATGATATACCCGATAATTTATTCCCTGGTTTCTAATCTCGGCAGCGGNAAAAACATTGACCATTCAAATTTAT 345
KF017583.1 TATTATCATCTTTATGATATACCCGATAATTTATTCCCTGGTTTCTAATCTCGGCAGCGGNAAAAACATTGACCATTCAAATTTAT 345
AF268041.2 TATTATCATCTTTATGATATACCCGATAATTTATTCCCTGGTTTCTAATCTCGGCAGCGGNAAAAACATTGACCATTCAAATTTAT 345
strain_RT_fast ATATCCAGCTTCATCTACAGGAACGGGCACTTTATATGAAGACCTATCAACGAGATAGATGACATGAACGTCTCTATACGTCGTTT 431
strain_RT_hac ATATCCAGCTTCATCTACAGGAACGGGCACTTTATATGAAGACCTATCAACGAGATAGATGACATGAACGTCTCTATACGTCGTTT 431
AF268040.2 ATATCCAGCTTCATCTACAGGAACGGGCACTTTATATGAAGACCTATCAACAAGATAGATGACATGAACGTCTCTATACGTCGTTT 431
FJ870563.1 ATATCCAGCTTCATCTACAGGAACGGGCACTTTATATGAAGACCTATCAACAAGATAGATGACATGAACGTCTCTATACGTCGTTT 431
LC064808.1 ATATCCAGCTTCATCTACAGGAACGGGCACTTTATATGAAGACCTATCAACGAGATAGATGACATGAACGTCTCTATACGTCGTTT 431
KF017583.1 ATATCCAGCTTCATCTACAGGAACGGGCACTTTATATGAAGACCTATCAACAAGATAGATGACATGAACGTCTCTATACGTCGTTT 431
AF268041.2 ATATCCAGCTTCATCTACAGGAACGGGCACTTTATATGAAGACCTATCAACAAGATAGATGACATGAACGTCTCTATACGTCGTTT 431
strain_RT_fast GAAACGACAATTCCTTGGTATACGTCTAACAAAAAAAAATTATTGCGGAACTATANTTTTCTTAAACAATAACAAAATACCTTCCA 517
strain_RT_hac GAAACGACAATTCCTTGGTATACATCCTAACAAAAAAAGTNNGTGCGGAACTATATTTTTCTTAAACAATAACNAAATACCTTNCA 517
AF268040.2 GAAACGACAATTCCTTGGTATACGTCCTAACAAAAAAAGTNNGTGCGGAACTATATTTTTCTTAAACAATAACAAAATACCTTNCA 517
FJ870563.1 GAAACGACAATTCCTTGGTATACGTCCTAACAAAAAAAGTNNGTGCGGAACTATATTTTTCTTAAACAATAACAAAATACCTTNCA 517
LC064808.1 GAAACGACAATTCCTTGGTATACGTCCTAACAAAAAAAGTNNGTGCGGAACTATATTTTTCTTAAACAATAACAAAATACCTTNCA 517
KF017583.1 GAAACGACAATTCCTTGGTATACGTCCTAACAAAAAAAGTNNGTGCGGAACTATATTTTTCTTAAACAATAACAAAATACCTTNCA 517
AF268041.2 GAAACGACAATTCCTTGGTATACGTCCTAACAAAAAAAGTNNGTGCGGAACTATATTTTTCTTAAACAATAACAAAATACCTTNCA 517
strain_RT_fast GAATACTGANNNNNNNNNNNTTACTTATTACAAGTGATATAATTATNAAATCTATATAGATCTGTTCCAACGACACATATTGCATA 603
strain_RT_hac GAATACTGAAGNNNNNNNNNATACTTATTACAAGTGATATAATTATNAAATCTATATAGATCTGTTCCAACGGCNCATATTGCATA 603
AF268040.2 GAATACTGAGTCTCCGTATCATACTTATTACAAGTGATATAATTATCAAATCTATATAGATCTGTTCCAACGGCNCATATTGCATA 603
FJ870563.1 GAATACTGAGTCTCCGTATCATACTTATTACAAGTGATATAATTATCAAATCTATATAGATCTGTTCCAACGGCNCATATTGCATA 603
LC064808.1 GAATACTGAGTCTCCGTATCATACTTATTACAAGTGATATAATTATCAAATCTATATAGATCTGTTCCAACGGCNCATATTGCATA 603
KF017583.1 GAATACTGAGTCTCCGTATCATACTTATTACAAGTGATATAATTATCAAATCTATATAGATCTGTTCCAACGGCNCATATTGCATA 603
AF268041.2 GAATACTGAGTCTCCGTATCATACTTATTACAAGTGATATAATTATCAAATCTATATAGATCTGTTCCAACGGCNCATATTGCATA 603
strain_RT_fast CACGAAAAGGATANTTTTCATCATCGCTTGCTTCAGTGTATTCCTGTGAAGAATTTCCGGTAATGTNNN 672
strain_RT_hac CACGAAAAGGATATTTTTCATCATCGCTTGCTTCAGTGTATTCCTGTGAAAAATTTCCGGTAATGTTCA 672
AF268040.2 CACGAAAAGGATATTTTTCATCATCGCTTGCTTCAGTGTATTCCTGTGAAAAATTTCCGGTAATGTTCA 672
FJ870563.1 CACGAAAAGGATATTTTTCATCATCGCTTGCTTCAGTGTATTCCTGTGAAAAATTTCCGGTAATGTTCA 672
LC064808.1 CACGAAAAGGATATTTTTCATCATCGCTTGCTTCAGTGTATTCCTGTGAAAAATTTCCGGTAATGTTCA 672
KF017583.1 CACGAAAAGGATATTTTTCATCATCGCTTGCTTCAGTGTATTCCTGCGAAAAATTTCCGGTAATGTTCA 672
AF268041.2 CACGAAAAGGATATTTTTCATCATCGCTTGCTTCAGTGTATTCCTGTGAAAAATTTCCGGTAATGTTCA 672
Figure 3. Multiple sequence alignments with the most similar strains. The sequence matching gene gB basecalled with fast
and high-accuracy configurations is represented by label strain_RT_fast and strain_RT_hac, respectively. Strain
FJ870563.1 and KF017583.1 (representative reference genome) strains originated from China, AF268041.2 and
LC064808.1 from Japan, while the AF268040.2 one from Spain. The unique insertions are highlighted by blue.
Discussion
By an ONT-based metagenomic study, we identified sequences of PCMV, including a 650 bp long one that matched the
gB
gene. It is the second report of the virus presence in Hungary but the first with a comparable genomic sequence.
High-accuracy basecalling resulted in fewer polymorphisms and shorter deletions compared to fast basecalling. Although
we cannot know with certainty the exact sequence of the virus in our sample, we suppose that polymorphisms identified by
the high-accuracy approach may be reliable. Not only do the deletions represent a difference from the reference genome, but
none of the
gB
genes has similar ones. A closer look at the pattern of the reference genome at the beginning of the deletions in
Figure 3reveals a sequence
TCTC
. This may be a short tandem repeat (STR or microsatellite), which Delahaye and Nicolas
18
associate with the appearance of deletions originating from the ONT-basecall. Unfortunately, only one read in our samples
matched the
gB
gene. Perhaps if we had sequenced the samples more deeply and had more overlapping reads, these deletions
could have been filled in.
The vast majority of PCMV sequences were found in the piglet sample, which the age-specificity of infection and disease
can explain. A persistent but mild problem on the farm for several years is the presence of sneezing and rhinitis in piglets up
to 6 weeks of age. Several pathogens have been identified in the past to investigate the background of this problem, but their
treatment has not resulted in a solution. The small number of reads in the fattening pigs may be a consequence of barcode
cross-talk and are, in fact, derived from the piglet sample.
19
However, it is also possible that minimal levels of the virus are
present in the fatteners. This is supported by our metagenomic analysis of nasal swabs from fattening pigs on the same farm,
4/6
.CC-BY-NC-ND 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted December 29, 2022. ; https://doi.org/10.1101/2022.12.28.522123doi: bioRxiv preprint
using Illumina sequencing in 2019, in which two pooled samples were analyzed. One had 205, and the other 118 reads matching
the PCMV reference genome, but no reads for the gB gene.
The experience of our study tells us that ONT-metagenomics can be a promising tool for rapidly detecting pathogens in
farm animals. However, instead of using multiplex sequencing as we have done, we should consider using smaller, single-use
flow cells to sequence samples individually, avoiding the barcode cross-talks.
Declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Availability of data and material
The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.
Competing interests
The authors declare that they have no competing interests.
Funding
The research was supported by the European Union’s Horizon 2020 research and innovation program under Grant Agreement
No. 874735 (VEO).
Author contributions statement
NS takes responsibility for the data’s integrity and the data analysis’s accuracy. AGT, NS, and TR conceived the concept
of the study. AGT, LM, NS, RF, and TR performed sample collection. AGT, ÁB, RF, and SS did the DNA extraction and
metagenomics library preparation. NS participated in the bioinformatic analysis. AGT and NS participated in the drafting of
the manuscript. AGT, IC, and NS completed the manuscript’s critical revision for important intellectual content. All authors
read and approved the final manuscript.
References
1. Chiu, C. Y. & Miller, S. A. Clinical metagenomics. Nat. Rev. Genet. 20, 341–355 (2019).
2.
Karlsson, O. E., Norling, M., Granberg, F., Belák, S. & Bongcam-Rudloff, E. Viral metagenomics–new applications for
the broad-range detection of viromes in veterinary and public health settings. EMBnet.journal 19, 21–22 (2013).
3.
Mettenleiter, T. C., Ehlers, B., Müller, T., Yoon, K.-J. & Teifke, J. P. Herpesviruses, chap. 35, 548–575 (John Wiley Sons,
Ltd, 2019).
4. Denner, J. Xenotransplantation and porcine cytomegalovirus. Xenotransplantation 22, 329–335 (2015).
5.
Denner, J. et al. Impact of porcine cytomegalovirus on long-term orthotopic cardiac xenotransplant survival. Sci. Rep. 10,
1–14 (2020).
6.
Halecker, S. et al. How, where and when to screen for porcine cytomegalovirus (PCMV) in donor pigs for xenotransplanta-
tion. Sci. Rep. 12, 1–10 (2022).
7.
Deim, Z., Glávits, R., Biksi, I., Dencs˝
o, L. & Ráczné, A. Inclusion body rhinitis in pigs in Hungary. The Vet. Rec. 158, 832
(2006).
8.
De Coster, W., D’hert, S., Schultz, D. T., Cruts, M. & Van Broeckhoven, C. NanoPack: visualizing and processing
long-read sequencing data. Bioinformatics 34, 2666–2669 (2018).
9. Wood, D. E., Lu, J. & Langmead, B. Improved metagenomic analysis with kraken 2. Genome biology 20, 1–13 (2019).
10.
Pruitt, K. D., Tatusova, T. & Maglott, D. R. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence
database of genomes, transcripts and proteins. Nucleic acids research 33, D501–4 (2005).
11. Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
12. Li, H. New strategies to improve minimap2 alignment accuracy. Bioinformatics 37, 4572–4574 (2021).
5/6
.CC-BY-NC-ND 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted December 29, 2022. ; https://doi.org/10.1101/2022.12.28.522123doi: bioRxiv preprint
13.
Zhang, Z., Schwartz, S., Wagner, L. & Miller, W. A greedy algorithm for aligning DNA sequences. J. Comput. biology 7,
203–214 (2000).
14.
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic acids research 32,
1792–1797 (2004).
15. Yu, G. Using ggtree to visualize data on tree-like structures. Curr. protocols bioinformatics 69, e96 (2020).
16. Schliep, K. P. phangorn: phylogenetic analysis in R. Bioinformatics 27, 592–593 (2011).
17.
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna,
Austria (2022).
18. Delahaye, C. & Nicolas, J. Sequencing DNA with nanopores: Troubles and biases. PLoS ONE 16, 1–29 (2021).
19.
Xu, Y. et al. Detection of viral pathogens with multiplex nanopore minion sequencing: be careful with cross-talk. Front.
microbiology 9, 2225 (2018).
6/6
.CC-BY-NC-ND 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted December 29, 2022. ; https://doi.org/10.1101/2022.12.28.522123doi: bioRxiv preprint
... Ezzel a módszerrel azokat a genomokat, rezisztomokat lehet meghatározni, amik már szerepelnek az adatbázisban. Bár az adatbázisok már meglehetősen fejlettek, az ezzel kapcsolatos alapkutatás egyik célja éppen bővítésük, illetve rendszerezésük [13][14][15][16][17][18][19]. ...
Article
Az antibiotikum-rezisztencia a modern orvoslás egyre több kihívást adó problémája. A kutyaharapás gyógyításának számos esetben elengedhetetlen része az antibiotikus kezelés. Az eredményes gyógyításhoz minél hamarabb ismerni kellene a kórokozókat és azok antibiotikum-érzékenységét. A jelenlegi módszerekkel – baktériumtenyésztés és antibiogram-készítés – napokba, akár hetekbe telhet, mire lelethez jutunk. A közben alkalmazott empirikus antibiotikum-kezelés újabb rezisztens törzseket szelektálhat ki. Az új generációs genomszekvenálással (NGS), illetve nanopore szekvenálással nyert genomszintű adatok gyors, teljes-körű mikrobiológiai lelet lehetőségét ígérik kutyaharapás esetében is. A harmadik generációs nanopore szekvenálás jövőbe mutató módszernek látszik a humán orvoslásban is a mikrobiológiai leletek elkészítésében.
Article
Full-text available
Porcine cytomegalovirus (PCMV), that is actually a porcine roseolovirus (PRV), is a common herpesvirus in domestic pigs and wild boars. In xenotransplantation, PCMV/PRV has been shown to significantly reduce the survival time of pig kidneys and hearts in preclinical trials with different non-human primates. Furthermore, PCMV/PRV has been transmitted in the first pig to human heart xenotransplantation and contributed to the death of the patient. Although transmitted to the recipient, there is no evidence that PCMV/PRV can infect primate cells including human cells. PCMV/PRV is closely related to the human herpesviruses 6 and 7, and only distantly related to the human CMV (HCMV). Antiviral drugs used for the treatment of HCMV are less effective against PCMV/PRV. However, there are well described strategies to eliminate the virus from pig facilities. In order to detect the virus and to eliminate it, highly sensitive detection methods and the knowledge of how, where and when to screen the donor pigs is required. Here, a comparative testing of organs from pigs of different ages using polymerase chain reaction (PCR)-based and immunological methods was performed. Testing young piglets, PCMV/PRV was detected effectively by PCR in blood, bronchoalveolar lavage fluid, tonsils and heart. In adult animals, detection by PCR was not successful in most cases, because the virus load was below the detection limit or the virus was in its latent stage. Therefore, detection of antibodies against selected recombinant proteins corresponding to epitopes detected by nearly all infected animals in a Western blot assay is advantageous. By contrast, immunological testing is not beneficial in young animals as piglets might have PCMV/PRV-specific antibodies obtained from their infected mother via the colostrum. Using a thoughtful combination of PCR-based and immunological methods, detection of PCMV/PRV in donor pigs for xenotransplantation is feasible and a controlled elimination of the virus by early weaning or other methods is possible.
Article
Full-text available
Oxford Nanopore Technologies’ (ONT) long read sequencers offer access to longer DNA fragments than previous sequencer generations, at the cost of a higher error rate. While many papers have studied read correction methods, few have addressed the detailed characterization of observed errors, a task complicated by frequent changes in chemistry and software in ONT technology. The MinION sequencer is now more stable and this paper proposes an up-to-date view of its error landscape, using the most mature flowcell and basecaller. We studied Nanopore sequencing error biases on both bacterial and human DNA reads. We found that, although Nanopore sequencing is expected not to suffer from GC bias, it is a crucial parameter with respect to errors. In particular, low-GC reads have fewer errors than high-GC reads (about 6% and 8% respectively). The error profile for homopolymeric regions or regions with short repeats, the source of about half of all sequencing errors, also depends on the GC rate and mainly shows deletions, although there are some reads with long insertions. Another interesting finding is that the quality measure, although over-estimated, offers valuable information to predict the error rate as well as the abundance of reads. We supplemented this study with an analysis of a rapeseed RNA read set and shown a higher level of errors with a higher level of deletion in these data. Finally, we have implemented an open source pipeline for long-term monitoring of the error profile, which enables users to easily compute various analysis presented in this work, including for future developments of the sequencing device. Overall, we hope this work will provide a basis for the design of better error-correction methods.
Article
Full-text available
Xenotransplantation using pig organs has achieved survival times up to 195 days in pig orthotopic heart transplantation into baboons. Here we demonstrate that in addition to an improved immunosuppressive regimen, non-ischaemic preservation with continuous perfusion and control of post-transplantation growth of the transplant, prevention of transmission of the porcine cytomegalovirus (PCMV) plays an important role in achieving long survival times. For the first time we demonstrate that PCMV transmission in orthotopic pig heart xenotransplantation was associated with a reduced survival time of the transplant and increased levels of IL-6 and TNFα were found in the transplanted baboon. Furthermore, high levels of tPA-PAI-1 complexes were found, suggesting a complete loss of the pro-fibrinolytic properties of the endothelial cells. These data show that PCMV has an important impact on transplant survival and call for elimination of PCMV from donor pigs.
Article
Full-text available
Although Kraken's k-mer-based approach provides a fast taxonomic classification of metagenomic sequence data, its large memory requirements can be limiting for some applications. Kraken 2 improves upon Kraken 1 by reducing memory usage by 85%, allowing greater amounts of reference genomic data to be used, while maintaining high accuracy and increasing speed fivefold. Kraken 2 also introduces a translated search mode, providing increased sensitivity in viral metagenomics analysis.
Article
Full-text available
Metagenomic sequencing with the Oxford Nanopore MinION sequencer offers potential for point-of-care testing of infectious diseases in clinical settings. To improve cost-effectiveness, multiplexing of several, barcoded samples upon a single flow cell will be required during sequencing. We generated a unique sequencing dataset to assess the extent and source of cross barcode contamination caused by multiplex MinION sequencing. Sequencing libraries for three different viruses, including influenza A, dengue, and chikungunya, were prepared separately and sequenced on individual flow cells. We also pooled the respective libraries and performed multiplex sequencing. We identified 0.056% of total reads in the multiplex sequencing data that were assigned to incorrect barcodes. Chimeric reads were the predominant source of this error. Our findings highlight the need for careful filtering of multiplex sequencing data before downstream analysis, and the trade-off between sensitivity and specificity that applies to the barcode demultiplexing methods.
Article
Full-text available
Here we describe NanoPack, a set of tools developed for visualization and processing of long read sequencing data from Oxford Nanopore Technologies and Pacific Biosciences. Availability and implementation: The NanoPack tools are written in Python3 and released under the GNU GPL3.0 License. The source code can be found at https://github.com/wdecoster/nanopack, together with links to separate scripts and their documentation. The scripts are compatible with Linux, Mac OS and the MS Windows 10 subsystem for Linux and are available as a graphical user interface, a web service at http://nanoplot.bioinf.be and command line tools. Contact: wouter.decoster@molgen.vib-ua.be. Supplementary information: Supplementary tables and figures are available at Bioinformatics online.
Article
Ggtree is an R/Bioconductor package for visualizing tree‐like structures and associated data. After 5 years of continual development, ggtree has been evolved as a package suite that contains treeio for tree data input and output, tidytree for tree data manipulation, and ggtree for tree data visualization. Ggtree was originally designed to work with phylogenetic trees, and has been expanded to support other tree‐like structures, which extends the application of ggtree to present tree data in other disciplines. This article contains five basic protocols describing how to visualize trees using the grammar of graphics syntax, how to visualize hierarchical clustering results with associated data, how to estimate bootstrap values and visualize the values on the tree, how to estimate continuous and discrete ancestral traits and visualize ancestral states on the tree, and how to visualize a multiple sequence alignment with a phylogenetic tree. The ggtree package is freely available at https://www.bioconductor.org/packages/ggtree . © 2020 by John Wiley & Sons, Inc. Basic Protocol 1 : Using grammar of graphics for visualizing trees Basic Protocol 2 : Visualizing hierarchical clustering using ggtree Basic Protocol 3 : Visualizing bootstrap values as symbolic points Basic Protocol 4 : Visualizing ancestral status Basic Protocol 5 : Visualizing a multiple sequence alignment with a phylogenetic tree
Article
Clinical metagenomic next-generation sequencing (mNGS), the comprehensive analysis of microbial and host genetic material (DNA and RNA) in samples from patients, is rapidly moving from research to clinical laboratories. This emerging approach is changing how physicians diagnose and treat infectious disease, with applications spanning a wide range of areas, including antimicrobial resistance, the microbiome, human host gene expression (transcriptomics) and oncology. Here, we focus on the challenges of implementing mNGS in the clinical laboratory and address potential solutions for maximizing its impact on patient care and public health.
Article
Porcine microorganisms may be transmitted to the human recipient when xenotransplantation with pig cells, tissues, and organs will be performed. Most of such microorganisms can be eliminated from the donor pig by specified or designated pathogen-free production of the animals. As human cytomegalovirus causes severe transplant rejection in allotransplantation, considerable concern is warranted on the potential pathogenicity of porcine cytomegalovirus (PCMV) in the setting of xenotransplantation. On the other hand, despite having a similar name, PCMV is different from HCMV. The impact of PCMV infection on pigs is known; however, the influence of PCMV on the human transplant recipient is unclear. However, first transplantations of pig organs infected with PCMV into non-human primates were associated with a significant reduction of the survival time of the transplants. Sensitive detection methods and strategies for elimination of PCMV from donor herds are required.