Technical ReportPDF Available

Automated Library Construction Using KAPA Library Preparation Kits on the Agilent NGS Workstation Yields High-Quality Libraries for Whole-Genome Sequencing on the Illumina Platform

Authors:

Abstract

A new method was developed to automate the KAPA HTP Library Preparation kit for microbial whole genome sequencing. This method uses the Agilent NGS Workstation, consisting of the NGS Bravo liquid handling platform with its accessories for heating, cooling, shaking, and magnetic bead manipulations in a 96-well format. User intervention in multistep protocols is minimized through the use of other components of the workstation such as the BenchCel 4R Microplate Handler and Labware MiniHub for labware storage and movement. This method has been validated for sequencing on the Illumina platform and consists of three protocols: the first is for end repair to post-ligation cleanup; the second is used for library amplification setup; and the third is for the post-amplification cleanup. The modular design provides the end-user with the flexibility to complete library construction over two days, and is suitable for the construction of high-quality libraries from bacteria of various GC content. This combined solution produced a workflow that is suitable for production-scale sequencing projects such as the 100K Pathogen Genome Project.
Automated Library Construction
Using KAPA Library Preparation Kits
on the Agilent NGS Workstation
Yields High-Quality Libraries for
Whole-Genome Sequencing on the
Illumina Platform
Authors
Nguyet Kong, Kao Thao, Carol Huang,
and Bart C. Weimer
Population Health and Reproduction
Department, School of Veterinary
Medicine, University of
California-Davis, Davis, CA, USA
Maryke Appel
Kapa Biosystems, Inc.
Wilmington, MA USA
Stephen Lappin, Lisa Knapp, and
Lenore Kelly
Agilent Technologies, Inc.
Santa Clara, CA, USA
Application Note
Abstract
A new method was developed to automate the KAPA HTP Library Preparation kit for
microbial whole genome sequencing. This method uses the Agilent NGS
Workstation, consisting of the NGS Bravo liquid handling platform with its acces-
sories for heating, cooling, shaking, and magnetic bead manipulations in a 96-well
format. User intervention in multistep protocols is minimized through the use of
other components of the workstation such as the BenchCel 4R Microplate Handler
and Labware MiniHub for labware storage and movement. This method has been
validated for sequencing on the Illumina platform and consists of three protocols:
the first is for end repair to post-ligation cleanup; the second is used for library
amplification setup; and the third is for the post-amplification cleanup. The modular
design provides the end-user with the flexibility to complete library construction
over two days, and is suitable for the construction of high-quality libraries from bac-
teria of various GC content. This combined solution produced a workflow that is
suitable for production-scale sequencing projects such as the 100K Pathogen
Genome Project.
2
Introduction
Reduced costs and higher throughput have rendered microbial
whole genome sequencing (WGS) accessible to many appli-
cations in infectious disease, food safety, and public health to
produce genomes on an unprecedented scale. The 100K
Pathogen Genome Project hosted at UC Davis and founded by
Agilent Technologies, the FDA, and UC Davis, is a novel
public/private/government consortium to sequence 100,000
bacterial zoonotic and food-borne pathogens that are of sig-
nificant importance to the public, using next generation
sequencing (NGS) technologies. The project has sufficient
sequencing capacity to produce as many as 25,000 bacterial
genomes per year using automated workflows. This capacity
in sequencing created a challenge to establish an automated
library construction workflow to fully enable the reliable and
robust sequencing pipeline.
NGS library construction comprises repetitive processes,
making it amenable to automation. Automation offers many
advantages, including sample throughput, reduced hands-on
time, greater reproducibility, and improved process control.
However, the development and validation of automated proto-
cols is not trivial. Optimal workflows require robust chemistry,
robotics, and meticulous optimization of many parameters to
achieve acceptable libraries that yield sequence data of qual-
ity similar to those produced by skilled, experienced, and
attentive technicians in a low-throughput setting.
The NGS Workstation platform offers reliable and repro-
ducible automation of library preparation processes. With
minimal user-intervention to produce robust libraries, the
graphical user interface of the Agilent VWorks Automation
Control Software makes the system easy to operate in a pro-
duction setting, while the open architecture of the software
facilitates new method development and customization to
optimize specific steps of the protocol.
Library preparation kits from Kapa Biosystems offer robust
library construction methods for a range of DNA inputs
(100 pg–5 µg) for a variety of sequencing applications, includ-
ing WGS, targeted sequencing, ChIP-Seq, RNA-Seq, and
Methyl-Seq. Reagents are formulated for optimal activity and
stability, and exhibit excellent conversion rates of input DNA
to adapter-ligated libraries through the use of a highly opti-
mized, automation-friendly protocol using beads [1]. Kits con-
tain the engineered KAPA HiFi DNA Polymerase, which has
become widely accepted for high-efficiency, high-fidelity,
low-bias NGS library amplification [2,3,4,5]. These reagents
were selected by the 100K Pathogen Genome Project for auto-
mated library construction due to the excellent sequence
results in the initial phase of the project using manual
approaches. However, increased capacity required automation
of this protocol to meet the high-throughput demands. This
application note details the automated production of high
quality bacterial libraries that are reproducible and provide
excellent sequencing results using the KAPA HTP Library
Preparation Kit on the Agilent NGS Workstation for use in the
100K Pathogen Genome Project.
Materials and Methods
The 100K Pathogen Genome Project sample preparation work-
flow for multiplexed, paired-end (2 × 100 bp) Illumina sequenc-
ing libraries uses accepted methods for bacterial sequencing
projects (Figure 1). Collection of high molecular weight
genomic DNA is critical for successful library construction.
Optimization of the cell lysis and DNA isolation is described in
previous documentation [6]. High-throughput QC of the high
molecular weight genomic DNA using the Agilent 2200
TapeStation System is also documented elsewhere [7].
Figure 1. 100K Pathogen Genome Project sample preparation workflow for
multiplexed, short-read Illumina sequencing.
DNA extraction
QC: DNA quality and quantity
Shear DNA
QC: fragmentation
Library construction
Library normalization
and pooling
Submit for sequencing
3
To validate the automated method, DNA for NGS library con-
struction was extracted from selected bacterial isolates with
different GC content. These microbes are listed in Table 1.
Organisms were lysed using the KAPA Express Extract Kit
(KK7102) [8], after which the DNA was purified with a Qiagen
QIAamp DNA Mini Kit (51306) using the manufacturer’s
instructions [9]. Before shearing, the extracted DNA was
analyzed using an Agilent 2200 TapeStation system with the
Genomic DNA ScreenTape assay for integrity of high
molecular weight DNA [7,10,11]. Fragmented DNA was quanti-
fied with the method described by Jeannotte et al. [7] using an
Agilent 2200 TapeStation system as the 96-well plate
high-throughput workflow for quantitation and sizing of gDNA
samples for library construction [9].
Table 1. Library Construction Metrics for Libraries Prepared from Different
Bacteria
Bacterium
Gram
reaction
Approximate
genome size
(MB)
GC
content
(%)
Average
library
size (bp)
Final
library
yield (ng)
Lactococcus Positive 2 35 430 679
Listeria Positive 2 38 307 949
Vibrio Negative 5 (two
chromosomes)
41 304 198
Escherichia Negative 5 51 347 361
Salmonella Negative 5 52 299 287
DNA was sheared in batches of 96 samples using microtubes
with the Covaris E220 Focused Ultrasonicator [12]. The frag-
mented DNA size was determined with the Agilent 2100
Bioanalyzer system and High Sensitivity DNA Kit, to confirm a
normal size distribution around a ~300 bp peak.
The input into library construction with the KAPA HTP Library
Preparation Kit (KK8234, Figure 2) was normalized to 1–5 µg
for all samples [13]. The standard KAPA protocol with
dual-SPRI size selection after adapter-ligation was used to
achieve libraries with a fragment distribution in the range of
250–450 bp. Library amplification was done for eight cycles
using the KAPA HiFi HotStart ReadyMix, followed by a final 1X
SPRI bead cleanup step. The size distributions of amplified
libraries were confirmed to be in the range of 200–500 bp,
using the Agilent 2100 Bioanalyzer system with the High
Sensitivity DNA Kit [14,15]. Libraries were quantified with the
qPCR-based KAPA Library Quantification Kit (KK4824) prior to
normalization and pooling for sequencing [16] at BGI@UCD
(Sacramento, CA) on the Illumina HiSeq 2000.
Input DNA
Fragmentation
Adaptor ligation
Library amplification
End repair
A-tailing
Size selection
Figure 2. Detailed KAPA HTP Library Preparation protocol. The input into
library construction is fragmented DNA or cDNA. Each enzymatic
reaction is followed by a SPRI-bead cleanup (1.7X after End
Repair; 1.8X after A-Tailing, and two consecutive 1X cleanups
after adapter ligation). The “with-bead” protocol uses a single
aliquot of SPRI beads for all cleanups prior to library amplification,
and significantly reduces the loss of library fragments associated
with the physical transfer of material between enzymatic reac-
tions. This results in higher yields of adapter-ligated libraries, and
reduces the number of amplification cycles required to generate
sufficient material for Library QC and sequencing.
4
The entire library construction workflow, from fragmentation
to the post-amplification cleanup, can be completed in as little
as 9 hours. The validated, automated KAPA HTP Library
Preparation method for the NGS Workstation (p/n G5522A;
Figure 3) consists of three protocols, each with a graphical
user interface or Form (Figure 4A–4C). Three variations of the
first protocol (end repair to post-ligation cleanup) are avail-
able to provide the end user with the option of preparing
libraries without size selection, employing a SPRI-bead based
size selection, or performing off-deck size selection. The
second protocol is used for library amplification setup, and the
third for the post-amplification cleanup. This modular design
provides the end-user with the flexibility to complete library
construction over two days, or implement physical separation
of pre- and post-PCR procedures. The VWorks Forms are sup-
plemented by a Microsoft Excel workbook (Figure 4D) to facili-
tate reagent preparation and instrument setup, and can be
used to keep a complete record of each experiment.
Consumables used on the NGS Workstation were selected for
optimal performance. Labware movements and pipetting para-
meters for the KAPA HTP Library Preparation protocols were
carefully optimized to minimize reagent and consumable
waste and ensure equal or better library yields and quality
compared to manual preparation methods.
Figure 3. The Agilent NGS Workstation. This instrument comprises the
highly reliable Agilent Bravo Automated Liquid Handling Platform,
Agilent Bravo accessories for heating, cooling, shaking, and mag-
netic bead manipulations. User intervention in multistep protocols
is minimized through the use of the BenchCel 4R Microplate
Handler and Labware MiniHub for labware storage and movement.
5
Figure 4. VWorks Forms and a Microsoft Excel workbook for the KAPA HTP Library Preparation method. A) End Repair to Post-ligation Cleanup Form, B) Library
Amplification Setup Form, C) Post-amplification Cleanup Form, D) Microsoft Excel workbook. Each run is configured by selecting the number of plate
sample columns (1, 2, 3, 4, 6, or 12 columns, and 8, 16, 24, 32, 48, or 96 samples), after which, the interface displays the deck setup and workstation
setup for the protocol. The Microsoft Excel workbook is designed to guide reagent preparation and instrument setup, and can be used to create a
record of each experiment. The VWorks software has many additional, convenient features, including various ways of tracking the progress during a
run, and intuitive manual control of over instrument components to facilitate error recovery.
A B
C D
6
Results and Discussion
Prior to shearing, the extracted DNA was analyzed using the
Agilent 2200 TapeStation system with the Genomic DNA
ScreenTape assay (Figure 5). A typical electropherogram for
fragmented bacterial DNA used for library construction is
given in Figure 6. Electropherograms and virtual gel images
for representative libraries prepared from Salmonella,
Escherichia, Vibrio, Listeria and Lactococcus isolates using
the KAPA HTP Library Preparation method on an Agilent NGS
Workstation are given in Figure 7. Interestingly, libraries
generated from different lactococcal isolates displayed
different apparent library fragment sizes, but all of these
libraries produced adequate sequence results. The higher
molecular weight peak in the electropherograms for Listeria
libraries are typical of over-amplification (primer depletion
during library amplification). Since the average yield of
adapter-ligated library was higher for Listeria than for the
other bacteria, the number of amplification cycles could have
been reduced. The enteric pathogens also produced similar
sized libraries that produced excellent sequence results.
Library construction metrics (average library size and final
library yields) for libraries prepared from different bacteria are
summarized in Table 1.
Figure 5. Typical electropherogram generated with the Agilent 2200
TapeStation system and Genomic DNA ScreenTape assay to
assess integrity of high molecular weight genomic DNA used for
library construction. Each line represents an individual Listeria
isolate.
600
500
400
300
[bp]
200
100
600
900
1,200
1,500
Sample intensity [FU]
2,000
2,500
3,000
4,000
7,000
15,000
48,500
0
Figure 6. A typical electropherogram (generated on the Agilent 2100 Bioanalyzer system and
High Sensitivity DNA Kit) of Covaris-sheared DNA used for library construction.
Fragmentation parameters were selected to produce a fragment size distribution with
a peak in the range of ~300 bp, which is ideal for 2 x 100 bp paired-end sequencing on
the Illumina HiSeq 2000.
250
[FU]
200
150
100
50
35 100 150 200 300 400 500 600 1,000 2,000 10,380 [bp]
D1
0
7
300
250
200
150
100
50
0
40 50 60 70 80 90 100 110 120 130 sec
300
250
200
150
100
50
0
40 50 60 70 80 90 100 110 120 130 sec
Ladder
200
150
100
50
0
40 50 60 70 80 90 100 110 120 sec
7,000
[bp]
2,000
1,000
600
500
400
300
200
150
100
35
Ladder
7,000
[bp]
2,000
1,000
600
500
400
300
200
150
100
35
Ladder
7,000
[bp]
2,000
1,000
600
500
400
300
200
150
100
35
150
200
100
50
0
40 50 60 70 80 90 100 110 120 130 sec
200
250
150
100
50
0
40 50 60 70 80 90 100 110 120 130 sec
Ladder
7,000
[bp]
2,000
1,000
600
500
400
300
200
150
100
35
Ladder
7,000
[bp]
2,000
1,000
600
500
400
300
200
150
100
35
Lactococcus, 35% GC
The average library size for
Lactococcus is 430 bp
Listeria, 38% GC
The average library size for
Listeria is 307 bp.
Vibrio, 41% GC
The average library size for
Vibrio is 304 bp.
Escherichia, 51% GC
The average library size for
Escherichia is 347 bp.
Salmonella, 52% GC
The average library size for
Salmonella is 299 bp
Figure 7. Representative electropherograms and virtual gel images (generated on the Agilent 2100 Bioanalyzer system with the High Sensitivity DNA Kit) of
bacterial libraries prepared for whole genome sequencing with the KAPA HTP Library Preparation Kit on the Agilent NGS Workstation. The average
library size for each genus was as indicated. Peaks at 35 and 10381 bp are internal standards used for alignment and quantitation determination with
the Agilent 2100 Bioanalyzer system.
www.agilent.com/chem
Agilent shall not be liable for errors contained herein or for incidental or consequential
damages in connection with the furnishing, performance, or use of this material.
Information, descriptions, and specifications in this publication are subject to change
without notice.
© Agilent Technologies, Inc., 2014
Printed in the USA
May 19, 2014
5991-4296EN
Conclusion
The KAPA HTP Library Preparation Kit and Agilent NGS
Workstation provide a robust, high-throughput automation
solution for the construction of high-quality libraries for
microbial whole genome sequencing on the Illumina plat-
form. This combined solution produced a workflow that is
suitable for the construction of high-quality libraries for
microbial whole genome sequencing in a production scale
sequencing project like the 100K Pathogen Genome Project.
Acknowledgements
We gratefully acknowledge the technical assistance provided
by Winnie Ng, Poyin Chen, Narine Arabyan, Soraya Foutouhi,
Elizabeth Marta, Kerry Le, Sum Leung, Lucy Cai, Alan Truong,
Christina Kong, Vivian Lee, Alvin Leonardo, Alex Weimer,
Giovanni Tenorio, Patrick Ancheta, Azarene Foutouhi, and
Kendra Liu from the laboratory of Dr. Bart Weimer at
University of California, Davis.
References
1. S. Fisher, et al., Genome Biology 12, R1 (2011).
2. M.A. Quail, et al., Nature Methods 9, 10–11 (2012).
3. S.O. Oyola, et al., BMC Genomics 13, 1 (2012).
4. M.A. Quail, et al., BMC Genomics 13: 341 (2012).
5. M.G. Ross, et al., Genome Biology 14: R51 (2013).
6. N. Kong, et al. Agilent Technologies Application Note
(5991-3722EN): Production and Analysis of High
Molecular Weight Genomic DNA for NGS Pipelines
Using Agilent DNA Extraction Kit (p/n 200600).
7. R. Jeannotte, et al. Agilent Technologies Application
Note (5991-4003EN): High-Throughput Analysis of
Foodborne Bacterial Genomic DNA Using Agilent 2200
TapeStation and Genomic DNA ScreenTape System.
8. KAPA Express Extract Kits: http://www.kapabiosys-
tems.com/products/name/kapa-express-extract-kits
9. Qiagen QIAamp DNA Mini Kit:
http://www.qiagen.com/products/catalog/sample-
technologies/dna-sample-technologies/genomic-
dna/qiaamp-dna-mini-kit
10. Agilent 2200 TapeStation User Manual, Agilent
Technologies (p/n G2964-90001).
11. Agilent Genomic DNA ScreenTape System Quick Guide,
Agilent Technologies (p/n G2964-90040 rev.B).
12. Covaris E220 Focused-ultrasonicators: http://covaris-
inc.com/products/afa-ultrasonication/e-series/
13. KAPA High Throughput Library Preparation Kit with
SPRI solution and Standard PCR Library
Amplification/Illumina series (96 libraries):
http://www.kapabiosystems.com/products/name/kap
a-library-preparation-kits
14. Agilent 2100 Bioanalyzer User Manual, Agilent
Technologies (p/n G2946-90003).
15. Agilent High Sensitivity DNA Kit Guide (Rev. B), Agilent
Technologies (p/n G2938-90321).
16. KAPA Library Quantification Kit - Illumina/Universal:
http://www.kapabiosystems.com/products/name/kap
a-library-quant-kits
For More Information
These data represent typical results. For more information
on our products and services, visit our Web site at
www.agilent.com/chem.
... The DNA samples were subjected to three quality control (QC) procedures using a Nanodrop spectrophotometer (Thermo Fisher, DE, USA) at A260/280 UV length, gel electrophoresis profiling, and Qubit fluorometric quantitation using fluorescent dye, to ensure the intactness of the samples [61]. DNA samples that passed the QC were purified using magnetic beads and agent court to remove tannin and undergo a mechanical shearing procedure using a Covaris Sonicator (Woburn, MA, USA) to ensure a standardized DNA sample length between 300 and 350 bp measured by an Agilent Bioanalyzer 2100 (Santa Clara, CA, USA) [62][63][64]. The samples were measured by Qubits to ensure that the total recovery after purification was more than 70%, and the total DNA per 50 µ L volume was 1 µ g before adenylation of the 3′ end. ...
... The DNA samples were subjected to three quality control (QC) procedures using a Nanodrop spectrophotometer (Thermo Fisher, DE, USA) at A260/280 UV length, gel electrophoresis profiling, and Qubit fluorometric quantitation using fluorescent dye, to ensure the intactness of the samples [61]. DNA samples that passed the QC were purified using magnetic beads and agent court to remove tannin and undergo a mechanical shearing procedure using a Covaris Sonicator (Woburn, MA, USA) to ensure a standardized DNA sample length between 300 and 350 bp measured by an Agilent Bioanalyzer 2100 (Santa Clara, CA, USA) [62][63][64]. The samples were measured by Qubits to ensure that the total recovery after purification was more than 70%, and the total DNA per 50 µL volume was 1 µg before adenylation of the 3 end. ...
Article
Full-text available
: Soil ecosystems are home to a diverse range of microorganisms, but they are only partially understood because no single-cell sequencing or whole-community sequencing provides a complete picture of these complex communities. Using one of such metagenomics approaches, we succeeded in monitoring the microbial diversity and stress-response gene in the soil samples. This study aims to test whether known differences in taxonomic diversity and composition are reflected in functional gene profiles by implementing whole gene sequencing (WGS) metagenomic analysis of geographically dispersed soils from two distinct pristine forests. The study was commenced by sequencing three rainforest soil samples and three peat swamp soil samples. Soil richness effects were assessed by exploring the changes in specific functional gene abundances to elucidate physiological constraints acting on different soil systems and identify variance in functional pathways relevant to soil biogeochemical cycling. Proteobacteria shows abundances of microbial diversity for 52.15% in Royal Belum Reserved Forest and 48.28% in Raja Musa; 177 out of 1,391,841 and 449 out of 3,586,577 protein coding represent acidic stress-response genes for Royal Belum and Raja Musa, respectively. Raja Musa indicates pH 2.5, which is extremely acidic. The analysis of the taxonomic community showed that Royal Belum soils are dominated by bacteria (98% in Sungai Kooi (SK), 98% in Sungai Papan (SP), and 98% in Sungai Ruok (SR), Archaea (0.9% in SK, 0.9% in SP, and 1% in SR), and the remaining were classed under Eukaryota and viruses. Likewise, the soils of Raja Muda Musa are also dominated by bacteria (95% in Raja Musa 1 (RM1), 98% in Raja Musa 2 (RM2), and 96% in Raja Musa 3 (RM3)), followed by Archaea (4% in RM1, 1% in RM2, and 3% in RM3), and the remaining were classed under Eukaryota and viruses. This study revealed that RBFR (Royal Belum Foresr Reserve) and RMFR (Raja Musa Forest Reserve) metagenomes contained abundant stress-related genes assigned to various stress-response pathways, many of which did not show any difference among samples from both sites. Our findings indicate that the structure and functional potential of the microbial community will be altered by future environmental potential as the first glimpse of both the taxonomic and functional composition of soil microbial communities.
... Genomic analysis. These genomes were part of the 100K Pathogen Genome Project using previously published methods (37)(38)(39)(40)(41)(42)(43)(44)(45). ...
... Library quantification was done using the KAPA library quantification kit (KK4824; Kapa Biosystems) to ensure the final library concentration prior to normalization and pooling for sequencing (40). Sequencing was performed with BGI@UCDavis (BGI@UCDavis, Sacramento, CA, USA) using the Illumina HiSeq 2000 platform with PE100 (Illumina Inc., San Diego, CA, USA) or the Illumina HiSeq 3000 platform with PE150 at the UC Davis Genome Center (Davis, CA, USA) (43,45). ...
Article
Full-text available
Importance: This study examined the link between public health and genomic variation of Campylobacter in relation to disease in humans, primates, and livestock. Use of large-scale whole genome sequencing enabled population level assessment to find new genes that are linked to livestock disease. With 184 Campylobacter genomes we assessed virulence traits, antibiotic resistance susceptibility, and potential for zoonotic transfer to observe there is a 'generalist' genotype that may move between host species.
... Four chitinase deletions (⌬STM0018, ⌬STM0233, ⌬STM0907 , and ⌬STM1869A) were constructed in the Weimer laboratory (University of California, Davis) (7) as described by Datsenko and Wanner (17). Cultures were prepared for sequencing as described previously (18)(19)(20)(21)(22)(23)(24)(25). Genome sequences were de novo assembled using CLC Workbench version 6.5.1 with default parameters (18). ...
Article
Full-text available
Chitinases are glycosyl hydrolases that catalyze the hydrolysis of the β-1,4 linkages in complex carbohydrates and those that contain GlcNAc. These enzymes are considered emerging virulence factors during infection because the host glycan changes. This is the release of four single chitinase deletion mutants in Salmonella enterica serovar Typhimurium LT2.
... Library amplification was done for eight cycles using the Kapa HiFi HotStart ReadyMix, followed by a 1ϫ SPRI bead cleanup step. Size distribution of libraries was confirmed using Agilent 2100 Bioanalyzer system with high-sensitivity DNA kit (15,16), and indexed libraries were quantified with quantitative PCR (qPCR)based Kapa library quantification kit (catalog no. KK4824) prior to pooling for sequencing either on the Illumina HiSeq 2000 with PE100 plus index read at BGI@UCD (Sacramento, CA) or PE150 on the Illumina HiSeq 3000 at the UC Davis Genome Center (Davis, CA). ...
Article
Full-text available
Campylobacter is a food-associated bacterium and a leading cause of foodborne illness worldwide, being associated with poultry in the food supply. This is the initial public release of 202 Campylobacter genome sequences as part of the 100K Pathogen Genome Project. These isolates represent global genomic diversity in the Campylobacter genus.
Article
Full-text available
We report the draft genome sequence for Enterococcus plantarum strain TRW2, isolated from the phyllosphere of romaine lettuce. The draft sequence consists of 3,383,441 bp, with a G+C content of 35.8% and 3,218 protein-coding genes. None of the 22,190 known antibiotic resistance genes were detected.
Article
Full-text available
The 100K Pathogen Genome Project is producing draft and closed genome sequences from diverse pathogens. This project expanded globally to include a snapshot of global bacterial genome diversity. The genomes form a sequence database that has a variety of uses from systematics to public health.
Article
Full-text available
Lysozyme enzymes hydrolyze the β-1,4-glycosidic bond in oligosaccharides. These enzymes are part of a broad group of glucoside hydrolases that are poorly characterized; however, they are important for growth and are being recognized as emerging virulence factors. This is the release of four lysozyme-encoding-gene-deletion mutants in Salmonella enterica serovar Typhimurium LT2.
Article
Full-text available
Sialidases, which are widely distributed in nature, cleave the α-ketosidic bond of terminal sialic acid residue. These emerging virulence factors degrade the host glycan. We report here the release of seven sialidase and one sialic acid transporter deletion in Salmonella enterica serovar Typhimurium strain LT2, which are important in cellular invasion during infection.
Article
Full-text available
Shigella is a major foodborne pathogen that infects humans and non-human primates and is the major cause of dysentery and reactive arthritis worldwide. This is the initial public release of 16 Shigella genome sequences from four species sequenced as part of the 100K Pathogen Genome Project.
Article
Full-text available
Campylobacter jejuni is an intestinal bacterium that can cause abortion in livestock. This publication announces the public release of 15 Campylobacter jejuni genome sequences from isolates linked to abortion in livestock. These isolates are part of the 100K Pathogen Genome Project and are from clinical cases at the University of California (UC) Davis.
Technical Report
Full-text available
The Agilent DNA Extraction Kit (p/n 200600) was compared to standard methods such as beadbeating and enzyme treatment for preparation of genomic DNA from the prokaryote Listeria monocytogenes. Using this extraction kit, with modifications, to lyse the bacteria and isolate high molecular weight DNA reproducibly yielded high quality DNA suitable for further applications such as polymerase chain reactions to produce amplicons, or for next-generation DNA sequencing. The quality of the high molecular weight DNA, and the comparison of extraction methods, was shown on the Agilent 2200 TapeStation with the Agilent Genomic DNA ScreenTape (p/n 5067-5365) and Agilent Genomic DNA Reagents (p/n 5067-5366). 2
Technical Report
Full-text available
The initial step in Next Generation Sequencing is to construct a library from genomic DNA. To gain the optimum result, extracted DNA must be of high molecular weight with limited degradation. High-throughput sequencing projects, such as the 100K Pathogen Genome Project, require methods to rapidly assess the quantity and quality of genomic DNA extracts. In this study, assessment of the applicability of the Agilent 2200 TapeStation was done using genomic DNA from nine foodborne pathogens using several accepted high-throughput methods. The Agilent 2200 TapeStation System with Genomic DNA ScreenTape and Genomic DNA Reagents was easy to use with minimal manual intervention. An important advantage of the 2200 TapeStation over other high-throughput methods was that high molecular weight genomic DNA quality and quantity can be quantified apart from lower molecular weight size ranges, providing a distinct advantage in the library construction pipeline and over other methods available for this important step in the Next Generation Sequencing process.
Article
Full-text available
Next generation sequencing (NGS) technology has revolutionized genomic and genetic research. The pace of change in this area is rapid with three major new sequencing platforms having been released in 2011: Ion Torrent's PGM, Pacific Biosciences' RS and the Illumina MiSeq. Here we compare the results obtained with those platforms to the performance of the Illumina HiSeq, the current market leader. In order to compare these platforms, and get sufficient coverage depth to allow meaningful analysis, we have sequenced a set of 4 microbial genomes with mean GC content ranging from 19.3 to 67.7%. Together, these represent a comprehensive range of genome content. Here we report our analysis of that sequence data in terms of coverage distribution, bias, GC distribution, variant detection and accuracy. Sequence generated by Ion Torrent, MiSeq and Pacific Biosciences technologies displays near perfect coverage behaviour on GC-rich, neutral and moderately AT-rich genomes, but a profound bias was observed upon sequencing the extremely AT-rich genome of Plasmodium falciparum on the PGM, resulting in no coverage for approximately 30% of the genome. We analysed the ability to call variants from each platform and found that we could call slightly more variants from Ion Torrent data compared to MiSeq data, but at the expense of a higher false positive rate. Variant calling from Pacific Biosciences data was possible but higher coverage depth was required. Context specific errors were observed in both PGM and MiSeq data, but not in that from the Pacific Biosciences platform. All three fast turnaround sequencers evaluated here were able to generate usable sequence. However there are key differences between the quality of that data and the applications it will support.
Article
Full-text available
Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences. We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates. We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material.
/www.qiagen.com/products/catalog/sample- technologies/dna-sample-technologies/genomic- dna/qiaamp-dna-mini-kit 10
  • Qiagen Qiaamp
  • Dna Kit
Qiagen QIAamp DNA Mini Kit: http://www.qiagen.com/products/catalog/sample- technologies/dna-sample-technologies/genomic- dna/qiaamp-dna-mini-kit 10. Agilent 2200 TapeStation User Manual, Agilent Technologies (p/n G2964-90001).
Agilent Technologies (p/n G2964-90040 rev.B)
  • Agilent Genomic
  • Dna Screentape System Quick Guide
Agilent Genomic DNA ScreenTape System Quick Guide, Agilent Technologies (p/n G2964-90040 rev.B).
  • M A Quail
M.A. Quail, et al., Nature Methods 9, 10-11 (2012).
  • S Fisher
S. Fisher, et al., Genome Biology 12, R1 (2011).
  • M G Ross
M.G. Ross, et al., Genome Biology 14: R51 (2013).