Technical ReportPDF Available

Abstract

The use of molecular assays to assess the presence of species using environmental DNA (eDNA) and RNA (eRNA) as analytes has diversified as the field of environmental surveillance advances. These methods can be used, for example, to screen for pest species as part of biosecurity measures and risk management, to screen for threatened species as part of development requirements, or for biodiversity monitoring purposes. To ensure that testing results are reliable, it is imperative to develop and validate molecular assays using stringent quality standards that can minimise the potential for false negative and false positive results.
Environmental
DNA test
validation
guidelines
© University of Canberra 2022
Ownership of intellectual property rights
Unless otherwise noted, copyright (and any other intellectual property rights) in this publication is owned
by the University of Canberra.
Creative Commons licence
All material in this publication is licensed under a Creative Commons Attribution 4.0 International Licence
except content supplied by third parties and logos.
Inquiries about the licence and any use of this document should be emailed to EcoDNA@Canberra.edu.au.
Citation
De BrauwerM, CharitonA, ClarkeLJ, CooperMK, DiBattistaJ, FurlanE, Giblot-DucrayD, GleesonD, HarfordA,
HerbertS, MacDonaldAJ, MillerA, MontgomeryK, MooneyT, NobleLM, RourkeM, ShermanCDH, StatM,
SuterL, WestKM, WhiteN, Villacorta-RathC, Zaiko A &Trujillo-GonzalezA(2022).Environmental DNA test
validation guidelines.National eDNA Reference Centre, Canberra.
ISBN 978-1-74088-533-1
Disclaimer
The Australian Government, acting through the Department of Agriculture, Water and the Environment,
has exercised due care and skill in preparing and compiling the information and data in this publication.
Notwithstanding, the Department of Agriculture, Water and the Environment, its employees and advisers
disclaim all liability, including liability for negligence and for any loss, damage, injury, expense or cost
incurred by any person as a result of accessing, using or relying on any of the information or data in this
publication to the maximum extent permitted bylaw.
Acknowledgements
This publication was developed as a collaborative project, funded by the Australian Government
Department of Agriculture, Fisheries and Forestry, and led by Alejandro Trujillo-Gonzalez (ATG) of the
University of Canberra and Maarten De Brauwer (MDB) of CSIRO. Anastasija Zaiko ‘s contribution was
supported by theNew Zealand Ministry of Business, Innovation and Employment funding (CAWX1904—A
toolbox to underpin and enable tomorrow’s marine biosecurity system). We would like to acknowledge
and thank all co-authors who helped develop and write the guidelines, as well as the 53 experts from
across Australia and New Zealand who provided feedback during various stages of the process. The
authors thank the four external expert reviewers of the guidelines for their valuable feedback. ATG
and MDB would also like to thank Biotext for their services and Paula Doyle for her invaluable role as
knowledge broker.
Editing and design: Biotext Pty Ltd
Contents
Glossary  5
Introduction  7
Assaypurposeandselection  10
Species-specicassaydevelopmentandvalidation  12
1 Dene the intended purpose of the assay   13
2 Design and test the assay   13
3 Validate and optimise the assay using reference samples   15
4 Check analytical specicity   16
5 Check analytical sensitivity   17
6 Check repeatability   17
7 Check reproducibility   18
8 Determine thresholds (cut-os)   19
Summary of key steps in species-specic qPCR assay development
and validation   19
Resources   21
Metabarcodingassaydevelopmentandvalidation  22
1 Dene the intended purpose of the assay   23
2 Design and test the assay   23
3 Validate and optimise the assay using reference samples   27
4 Check reproducibility   28
Summary of key steps in metabarcoding assay development and
validation   28
Resources   29
References  30
Environmental DNA test validation guidelines 5
Glossary
Assay: The laboratory workow from DNA extraction to sequence outputs.
Often refers to the target gene and taxonomic group (e.g.16S_Fish,
18Suniversal, COI).
Endogenous control: Either exogenous DNA (i.e.DNA that is spiked in) or
endogenous DNA (i.e.DNA that is naturally occurring) that can be targeted
in environmental samples as a positive control to monitor method success.
Exogenous DNA templates can be generated from custom-synthesised DNA
fragments, DNA extract, plasmids, competent cells or viral particulates, and
be added to samples during any stage of the eDNA workow after sample
collection. Endogenous controls use the fact that DNA is ubiquitous in the
environment, such that every environmental sample will contain DNA from
multiple sources. In this context, a generic primer assay can be designed
to amplify abundant, non-target DNA that will be simultaneously sampled,
captured, extracted and amplied with the target species DNA (Furlan &
Gleeson 2016).
Environmental DNA/RNA (eDNA/eRNA): DNA or RNA directly extracted
from environmental samples (e.g.soil, sediment, water) without any
knowledge of the original organism. DNA carries genetic information,
whereas RNA transfers information to produce specic proteins and is only
shed by physiologically active (living) organisms.
Haplotype: A group of alleles that are inherited together from a single
parent – for example, mitochondrial haplotypes. A haplogroup consists
of haplotypes that shared a common ancestor with a single nucleotide
polymorphism mutation.
High-throughput sequencing (HTS): A technique able to determine the
nucleotide composition of millions of nucleic acid sequences. Dierent
platforms of sequencing are available including sequencing by synthesis
(e.g. Illumina), single molecule real time (e.g. PacBio), and nanopore (e.g.
Oxford Nanopore Technologies).
HTS library: Amplicons amplied for the purpose of HTS, cleaned and
pooled following indexing.
Inhibitory substances: Substances in a sample or extract that have a
negative eect on PCR, reducing assay sensitivity and increasing the risk of
false negative results.
Environmental DNA test validation guidelines 6
Limit of detection (LOD): The lowest concentration of target DNA that can
be detected with a dened level of condence (usually 95% detection rate).
Limit of quantication (LOQ): The lowest amount of DNA in a sample
that can be quantitatively determined with a stated precision, under stated
experimental conditions.
Metabarcoding: Simultaneous taxonomic identication of Operational
Taxonomic Units (OTUs) or Amplicon Specic Variances (ASVs) in eDNA
samples with millions of sequences, generated by PCR amplication using
one of the HTS techniques.
Monitoring: Systematic collection of data over time to detect changes in
a system (Gerber etal. 2005). Data can include information on a range of
factors, such as environmental, ecological, biological and social.
Polymerase chain reaction (PCR): A molecular technique that allows
exponential amplication of a target fragment/region of DNA from a mixture
of DNA fragments. The desired fragment to amplify is recognised from the
other fragments in the mixture by specic primers (small single-stranded
oligonucleotides) complementary to the desired sequence.
Primer: Short DNA fragments used in PCR amplication that bind adjacent
to the target region/gene.
Quantitative PCR (qPCR): A variant of PCR. The main dierence between
the two is that qPCR can quantify how many fragments of DNA are amplied
during each step in the reaction, leading to quantitative data.
Reference library: Database with DNA barcodes of specic species.
Sequencing: Determining the order of nucleotides in DNA or RNA; this can
be done using various methods.
Environmental DNA test validation guidelines 7
Introduction
The use of molecular assays to assess the presence of species
using environmental DNA (eDNA) and RNA (eRNA) as analytes
has diversied as the eld of environmental surveillance
advances. These methods can be used, for example, to
screen for pest species as part of biosecurity measures and
risk management, to screen for threatened species as part
of development requirements, or for biodiversity monitoring
purposes. To ensure that testing results are reliable, it is
imperative to develop and validate molecular assays using
stringent quality standards that can minimise the potential for
false negative and false positive results.
What is the aim of these guidelines?
These Environmental DNA test validation guidelines provide harmonised quality
control and minimum standard considerations for developing or validating
eDNA/eRNA assays for the purpose of single-species or multi-species
detection.
This document is a comprehensive guide for the development and use
of eDNA/eRNA tests, recommended and curated by eDNA experts,
stakeholders and end users in Australia and New Zealand. The guidelines
are designed to support a consistent and best-practice approach to eDNA/
eRNA testing to help detect species of interest. This approach ensures that
surveillance and resource managers are provided with robust scientic
evidence to support decision making.
Environmental DNA test validation guidelines 8
Environmental DNA–based methods exist for a broad variety of organisms;
however, assay performance has only been supercially tested for most
available assays, and few assays are validated to ensure reproducibility
(Thalinger etal. 2021). This document outlines minimal quality requirements
to develop and validate eDNA/eRNA molecular assays and standard
protocols for operational use in Australia and New Zealand.
Who are they for?
The Environmental DNA test validation guidelines provide minimum quality
requirements and assurances to design and validate eDNA assays for
biosecurity and surveillance applications. Although these guidelines will help
improve the accuracy and reliability of eDNA assays, they are not explicitly
designed to provide results for use in compliance and legal situations.
For researchers
The Environmental DNA test validation guidelines detail key steps to be
used in assay development and validation for species-specic testing
andmetabarcoding.
For clients
The Environmental DNA test validation guidelines provide quality assurance for
any contracted eDNA work. They tell end users what services and standards
can be expected, and may also be used to inform sta collecting samples or
involved in other areas of the project.
How have they been developed?
The Environmental DNA test validation guidelines were developed in a
collaborative process with input from eDNA experts and end users from
across Australia and New Zealand. Initial draft frameworks were developed
and led by members of the Standards and Best Practices Committee of
the Southern eDNA Society, after which multiple consultation rounds
with experts, end users and stakeholders from private entities and public
agencies were held to adapt the frameworks to meet Australian and New
Zealand needs. Three consultation periods with eDNA experts, private
stakeholders, government ocials and end users were held in 2021–22 to
ensure that the guidelines were t for purpose and met the highest quality
standards in the eld.
Environmental DNA test validation guidelines 9
Guidelines for single-species detection assays use a framework based
on the current principles and methods of validation for diagnostic
assays as approved by the World Organisation for Animal Health and the
International Standard for Phytosanitary Measures from the International
Plant Protection Convention. These principles and methods dene the
minimum requirements needed to determine the tness of an assay,
including analytical sensitivity, assay specicity, limit of detection, limit of
quantication, and eDNA-based ex situ testing.
Guidelines for multi-species detection assays use a framework based on
current recommendations from existing eDNA metabarcoding guidelines,
including the Environmental DNA sampling and experiment manual from the
Japanese eDNA Society (Minamoto etal. 2021), A practical guide to DNA-based
methods for biodiversity assessment (Bruce etal. 2021, Zaiko etal. 2021), and
existing protocols from experts in Australia and New Zealand.
Updates
It is anticipated that the Environmental DNA test validation guidelines will be
updated and expanded over time, with review and update as required.
The guidelines will be reviewed and updated by the Australian National
eDNA Reference Centre, with input from leading experts in the eld of
environmental DNA.
Environmental DNA test validation guidelines 10
Assay purpose and selection
It is important that the intended purpose of an assay is dened
at the outset. This will help to determine whether a species-
specic, metabarcoding or combined approach is appropriate.
The characteristics of the assay can also be checked against its
purpose throughout the development and validation process,
to ensure that the assay is t for purpose. This ensures that
eDNA and eRNA assays and related procedures are appropriate
and results are relevant to management.
Environmental DNA assay performance is aected by
environmental factors that must be considered for assay
optimisation. After initial optimisation for an intended purpose,
characteristics of the performance of the assay must be tested
and checked against the intended purpose and overall project
principles.
The most common purposes of species-specic eDNA/eRNA assays (see
Species-specic assay development and validation) are to:
contribute to the detection of a targeted species in
an environment
waste products of a host organism
a compartment or storage area
assess the presence of metabolically active species in
an environment
waste products of a host organism
a compartment or storage area
contribute to the detection and eradication of invasive species and pests
in dened areas.
Environmental DNA test validation guidelines 11
The most common purposes of metabarcoding eDNA/eRNA assays (see
Metabarcoding assay development and validation) are to:
characterise and survey biodiversity in
an environment
waste products of a host organism
stomach contents of an organism
contribute to the monitoring of species assemblages
contribute to the detection and eradication of multiple invasive species
and pests in dened areas.
For both species-specic assays and metabarcoding, there may be many
more specic purposes for which assays can be developed. Such specic
applications and their unique purposes need to be clearly dened within the
context of a fully validated assay.
In addition, consider whether a combined approach may be useful. For
example, you may decide to include metabarcoding as your rst step in a
screening tool to determine the range of species at the test site, to then
guide the use of more targeted, species-specic assays.
Environmental DNA test validation guidelines 12
Species-specic assay
development and validation
Species-specic eDNA assays are currently PCR-based assays
that involve amplication of a target DNA sequence present in
environmental matrices, and can be validated and designed
within a diagnostic framework. However, the availability of
suitable eDNA samples and experimental set-ups to formally
assess diagnostic sensitivity and specicity is rare. Validating
eDNA/eRNA assays for the purpose of diagnostic assessments
requires a quantitative analysis of amplication data with
minimal variation to conrm diagnostic capability. Therefore,
the robustness of eDNA/eRNA assays must be considered and
explored extensively todetermine how inhibiting factors and
environmental factors aect the amplication of known eDNA/
eRNA tracesincontrolledmatrices.
Steps for species-specic assay development and validation are:
1 Dene the intended purpose of the assay
2 Design and test the assay
3 Validate and optimise the assay using reference samples
4 Check analytical specicity
5 Check analytical sensitivity
6 Check repeatability
7 Check reproducibility
8 Determine thresholds (cut-os)
Environmental DNA test validation guidelines 13
1 Dene the intended purpose of the assay
The purpose of the assay must be dened at the outset, to act as a
benchmark for testing and validation (see Assay purpose and selection).
The purpose should be dened in terms of:
the high-level purpose (e.g.to detect the target species in a certain area,
to contribute to managing invasive species in an area)
the target organism
what aspect of the target organism will be measured (e.g.presence of live
organisms, presence of excreta, presence of cells)
the sampling matrix.
2 Design and test the assay
The robustness of eDNA detection relies heavily on the design quality
of primers and probes (Langlois etal. 2021). Species-specic assays are
recommended to be designed using probe-based chemistries for species-
specic detection.
Assays must be designed and tested in accordance with the assay’s intended
purpose.
Assay choice or design
The rst step is to ascertain whether tests have already been developed for
the intended purpose or are available for testing. Suitable molecular assays
may have been designed and validated for purposes outside the scope
of eDNA testing. In the specic case of biosecurity applications, national
diagnostic protocols may be available for targeted species with approved
molecular assays that could be suitable for eDNA-based testing.
When designing or testing an eDNA/eRNA assay, DNA degradation and
inhibition must be considered as important factors that aect assay
performance. Environmental DNA/RNA assays are designed considering
environmental factors that aect DNA integrity and the success of detection.
Such factors include temperature, moisture, bacterial activity and UV
radiation, which aect DNA degradation rates. Natural inhibitors such as
algae, humic substances and suspended sediment particles can also aect
detection success (Stoeckle etal. 2017).
Environmental DNA test validation guidelines 14
Therefore, primers and probes are commonly designed to target short
(80–280base pairs) rather than large (>300base pairs) fragments, because
short fragments remain available for detection for longer (Jo etal. 2017).
Targeted regions should be selected following currently accepted barcode
regions, and considering the availability of reference DNA data for the
species.
In silico testing
If relevant sequence data for the target species (and related non-target
species) exist in online repositories, selected assays should undergo in
silicotesting for specicity using sequence alignment tools (e.g.BLAST).
Common repositories include the National Center for Biotechnology
Information, the Barcode of Life Data System, the European Molecular
Biology Laboratory online repositories, and the CSIRO National Biodiversity
DNA Library.
If such data do not exist, reference material will need to be obtained and
sequenced to facilitate assay design. The level of specicity depends on
the purpose of the assay; however, assays should assess the specicity of
the target species, considering potential intraspecic variation in the study
area (i.e.inclusivity), as well as non-amplication of co-occurring and closely
related species (i.e.exclusivity). The biology and ecology of the target species
should be considered – genetically diverse species will require greater
reference resources to achieve a similar level of assay robustness to that of
less diverse species.
Species-specic primers should be designed to display 0 base pair
mismatches with the target species. Probes are recommended to be
designed to display >2mismatches with closely related species (Klymus
etal. 2020a). If target species are part of a species complex or if the target
has minimal genetic dierences from closely related species, the use of
amplication-refractory mutation system primers and locked nucleic acid
probes is recommended to increase assay sensitivity and specicity of
annealing temperatures (Stewart etal. 2016). Testing for specicity should
comprise closely related non-target species, as well as commonly co-habiting
species that users could encounter in the environment. Suitable species and
samples for assay evaluation should be selected considering:
taxonomy – include closest relatives (e.g.all members of the same genus
or family, and members of all suspected subspecies/genetically distinct
populations of the target species)
Environmental DNA test validation guidelines 15
geographic scope of intended use – include representatives of all taxa
within the broader group (e.g.mammals) that might be encountered in the
study area.
Inhibition testing
The robustness of eDNA assays must also be tested against inhibitory
factors that interfere with assay performance. Assessment of robustness
against inhibition should begin during the assay development and
optimisation stages. The factors most likely to aect assay robustness
include pH, temperature and organic matrix factors (Goldberg etal. 2016).
If you have access to suitable environmental reference samples (seeStep3),
testing for inhibitory factors should be performed to conrm which factors
are present and in what concentrations following extraction, and whether
selected extraction methods are sucient to yield high-quality DNA/
RNA. Inhibition tests can then be undertaken using serial dilutions of
conrmed inhibitory matrices spiked with known concentrations of synthetic
oligonucleotides that reproduce the sequence of the qPCR assay target
sequence. Synthetic oligonucleotides can include nucleotide inversions
to allow for additional plate preparation controls to determine potential
cross-contamination of synthetic oligonucleotides (Trujillo-González etal.
2021). Assays would be considered robust if 10fold serial dilutions of the
synthetic oligonucleotides amplify within 3.3cycles ± standard deviation
(95% condence intervals) of each other within serial dilutions of inhibitory
matrices. Instances where amplication falls outside this acceptable
range would indicate lack of robustness in the assessed inhibitory matrix
concentration.
3 Validate and optimise the assay using
reference samples
Optimisation aims to evaluate and adjust the most important physical,
chemical and biological parameters of an assay to ensure that the
performance characteristics of the assay are best suited to the intended
application (OIE 2021). For analysing environmental samples, it is important
to select reference samples that are representative of the target species
and analyte. These reference samples may be uids, tissues, excreta and
environmental samples that contain the analyte of interest and are usually
harvested from the target species as well as their environments. Assays
should be optimised using both sets of reference samples (i.e.samples
derived from the target species and its environment).
Environmental DNA test validation guidelines 16
Environmental reference samples are considered to be samples known to
contain the analyte of interest in varying concentrations (modied from OIE
2021, Chapter 2.2.6: Selection and use of reference samples and panels).
They can ideally be collected from ex situ settings used to maintain the
target species; however, samples collected from natural environments
or urban locations with conrmed records of the target species may also
serve as suitable conrmed environmental samples. Sequences could be
obtained from the study area to account for potential genetic variation;
however, this may only be possible in locations where the target species is
already present in the environment. It is not a viable option for biosecurity
applications where target species can be considered exotic or their presence
is unknown. The suitability of environmental samples for assay optimisation
must be assessed based on the size of the habitat, the abundance of the
target species (Goldberg etal. 2016) and the availability of samples to be
appropriately standardised.
The availability of conrmed environmental samples may be limited,
depending on the target species. Reference environmental samples spiked
using extracted DNA/RNA may be prepared in the laboratory from an
original starting material (e.g.serial dilution of a highly concentrated tissue-
derived extract) by spiking a suitable environmental matrix. The matrix into
which the analyte is placed or diluted should be identical to, or resemble as
closely as possible, the samples that ultimately will be tested in the assay.
Wherever possible, large quantities of these reference samples should be
collected or prepared and preserved for long-term use.
4 Check analytical specicity
Analytical specicity is the ability of the assay to distinguish the target
analyte (i.e.the nucleic acid sequence of the target species) from non-target
analytes (OIE 2021). The assessment is qualitative, and the choice and
sources of sample types, organisms and sequences for evaluation of
analytical specicity should reect test purpose and assay type.
Specicity tests should assess cross-reactivity of the assay against
co-occurring species and species that are closely related to the target
organism.
Environmental DNA test validation guidelines 17
5 Check analytical sensitivity
The limit of detection (LOD) is a measure of the analytical sensitivity of an
assay. The LOD is the estimated amount of analyte in a specied matrix
that would produce a positive result at least a specied percentage of the
time. The LOD is based on detection/non-detection criteria and describes
an assay’s ability to detect the target sequence at low levels (Klymus
etal. 2020b).
The LOD should be assessed using 10-fold dilutions of the target analyte in
a suitable matrix. Dilutions can be completed using dierent matrices and
reference samples to compare the LOD of the assay between tissue-derived
and environmental DNA; however, the LOD of the eDNA assay should be
ultimately estimated using suitable environmental reference samples.
The LOD of an eDNA assay should be assessed as the last dilution showing
100% positive amplication across all technical replicates. A more accurate
estimate may be obtained by a second-stage experiment using narrower
intervals in the dilution scheme, focusing on the region between 100%
and 0%.
6 Check repeatability
Repeatability is the level of agreement between assay results, which can be
quantied within and between relevant hierarchical levels, such as machine
runs and operators, when applying the same test method within the same
laboratory (OIE 2021). Repeatability can be used to dene the expected
precision of an assay in detecting a range of analyte concentrations under
normal operating conditions.
Repeatability is estimated by evaluating variation in results from a minimum
of 3conrmed eDNA-positive independent samples (samples collected
during dierent sampling events or at dierent locations, using cleaned or
sterile equipment) within the operating range of the assay – samples should
range from well-detectable concentrations of target DNA to concentrations
close to the LOD. Initial tests for repeatability should assess assay sensitivity
with DNA reference samples using 6technical replicates in at least
5separate runs completed on multiple days. The variation in results should
be explored within technical replicates of each sample within and between
runs, expressed as standard deviations or coecients of variation (standard
deviation ÷ mean of replicates).
Environmental DNA test validation guidelines 18
For control samples to provide valid inferences about assay precision, they
should be treated in exactly the same way as test samples in each run of
the assay, including the use of blank controls, extraction-negative controls,
positive controls and standard dilutions to assess analyte concentration. It
is not acceptable to prepare a nal working dilution of a sample in a single
tube from which diluted aliquots are pipetted into reaction vessels, or to
create replicates from one extraction of nucleic acid rather than preparing
new dilution standards for each run. Such ‘samples’ do not constitute valid
replicates for repeatability studies (OIE 2021).
7 Check reproducibility
Reproducibility is the ability of a test method to provide consistent results, as
determined by estimates of precision, when applied to aliquots of the same
samples (OIE 2021). Assay reproducibility is required for assay recognition
and implementation where assays have been designed for the purposes
of biosecurity and diagnostics – these assays should be tested in dierent
laboratories using the identical assay (protocol, reagents and controls). For
routine surveillance outside the scope of biosecurity, assays can be tested
for reproducibility by individual laboratories, although inter-laboratory
testing is highly recommended.
To assess the reproducibility of an assay, each of at least 3laboratories
should test the same panel of samples (blinded) containing a suggested
minimum of 20samples, with identical aliquots going to each laboratory.
Measurements of precision can be estimated for both the reproducibility and
repeatability data.
The reproducibility process should be designed to suit the intended
purpose of the assay. For example, if the results are intended to be used
in biosecurity compliance applications, more laboratories should be used
and/or more samples should be tested to increase condence in the
results. If tests are designed to be used by a single laboratory, establishing
repeatability across operators, relevant equipment, batches of consumables
and time may be sucient.
Environmental DNA test validation guidelines 19
8 Determine thresholds (cut-os)
Selection of the assay cut-o values should reect the intended purpose
of the assay and its application. Options and descriptive methods for the
best way to determine the cut-o values of eDNA qPCR assays are available
(Caraguel etal. 2011). The main diculty in establishing assay cut-o
values based on environmental reference samples is the lack of diverse
environmental matrices that are representative of the target species and its
habitat. Moreover, eDNA extracts may contain ampliable traces of target
DNA that fall outside the dened cut-o values of the assay. Despite the
utility of cut-o values in molecular assays, any amplication considered
positive, regardless of whether it falls within or outside assay cut-o values,
must be conrmed by sequencing before conrming results.
Summary of key steps in species-specic
qPCR assay development and validation
Step 1 Attheoutsetoftheproject,denetheintended
purposeoftheassay
Step 2 Designandtesttheassay:
Identify whether an assay exists for the intended purpose.
Identify whether an assay exists for similar purposes or
species that could be adapted for the intended purpose.
Identify environmental factors that may aect DNA integrity
and detection success (e.g.temperature, moisture, bacterial
activity, UV radiation).
Identify natural/human-made inhibitors that may aect
detection success (e.g.algae, humic substances, suspended
sediment, chemical contaminants).
Design probes and primers to target short DNA fragments
(80280base pairs) following currently accepted barcode
regions, if no suitable assay exists.
Test the specicity of the designed or identied assays in silico
using sequence tools against online repositories.
Test the robustness of the designed or identied assays in
inhibition testing.
Environmental DNA test validation guidelines 20
Step 3 Validateandoptimisetheassay:
Collect reference samples containing both the analyte and the
environmental matrix.
- Collect a sample that contains the target species and analyte
(e.g.sera, uids, tissues, excreta).
- If no known sample exists, make a reference sample by
creating a matrix that resembles the natural environmental
setting of the target species, and spiking it with the analyte.
Test assay robustness using known concentrations of reference
eDNA samples spiked into representative environmental
matrices.
Test the assay sensitivity using serial dilutions of reference
samples to determine the level of analyte that can be successfully
detected.
Step 4 Checkanalyticalspecicity:
Assess cross-reactivity of the assay against co-occurring species
and species that are closely related to the target organism.
Step 5 Checkanalyticalsensitivity:
Assess the LOD using 10-fold dilutions of the target analyte in a
suitable matrix.
Step 6 Checkrepeatability:
Evaluate variation in results of independent replicates from a
minimum of 3conrmed eDNA-positive samples within the
operating range of the assay.
Step 7 Checkreproducibility:
Arrange for at least 3laboratories to test the same panel of
samples (blinded) containing at least 20samples, with identical
aliquots going to each laboratory.
Step 8 Determinethresholds(cut-os):
Use your preferred method to determine thresholds, with
the understanding that amplication must be conrmed by
sequencing.
Environmental DNA test validation guidelines 21
Resources
eDNA Validation Scale: https://edna-validation.com/
Thalinger B, Deiner K, Harper LR, Rees HC, Blackman RC, Sint D, Traugott
M, Goldberg CS & Bruce K (2021). A validation scale to determine the
readiness of environmental DNA assays for routine species monitoring.
Environmental DNA 3(4):823–836, doi:10.1002/e dn3.189.
Furlan EM &Gleeson D (2016). Improving reliability in environmental
DNA detection surveys through enhanced quality control. Marine and
Freshwater Research68(2):388–395, doi:10.1071/MF15349.
Furlan EM, Gleeson D, Hardy CM & Duncan RP (2016). A framework for
estimating the sensitivity of eDNA surveys. Molecular Ecology Resources
16(3):641654, doi:10 .1111/ 17 55 - 0 9 9 8.1 24 8 3 .
Wilkes Walburn J, Rourke ML, Furlan E, DiBattista JD, Broadhurst MK,
Fowler AM, Hughes JM & Fielder S (2022). Robust environmental DNA
assay development and validation: A case study with two vulnerable
Australian sh. Aquatic Conservation 32(7):1225–1231, doi:10.1002/aqc.3809.
Environmental DNA test validation guidelines 22
Metabarcoding assay
development and validation
Metabarcoding is the parallel sequencing of complex bulk
samples through the analysis of short, conserved gene regions.
Metabarcoding assays have the capacity to inform users about
the presence of multiple species present in multiple samples
simultaneously, providing valuable information for community-
level species assemblages.
Environmental DNA assays designed for metabarcoding and high-throughput
sequencing (HTS) can be validated and designed within experimental
set-ups. Experimental set-ups to formally assess the performance of
metabarcoding assays can comprise mesocosm studies representative of
the target environment and species. Given the complexity of metabarcoding
assays and their corresponding data analysis requirements, developing
metabarcoding assays requires validation not only of the molecular assay
itself but also of the analytical pipeline used to curate and analyse the
resulting high-throughput data.
Steps for metabarcoding assay development and validation are
1 Dene the intended purpose of the assay (dene the targeted taxa and
outline reference database)
2 Design and test the assay (develop and test the associated bioinformatic
pipeline for data analysis)
3 Validate and optimise the assay using reference samples
4 Check reproducibility.
Environmental DNA test validation guidelines 23
1 Dene the intended purpose of the assay
The purpose of the assay must be dened at the outset, to act as a
benchmark for testing and validation (see Assay purpose and selection).
The purpose should be dened in terms of:
the high-level purpose (e.g.to detect the range of a target species in a
certain area, to contribute to managing invasive species in an area)
the target organisms
what aspect of the target organisms will be measured (e.g.presence of live
organisms, presence of excreta, presence of cells)
the sampling matrix.
2 Design and test the assay
The robustness of metabarcoding assays relies heavily on the design quality
of primers and the availability of reference DNA sequence databases at
suitable resolution to address the question. Assays must be designed and
tested in accordance with the assay’s intended purpose.
Assay choice or design
The rst step is to ascertain whether assays have already been developed
for the intended purpose or are available for testing. The length of the
targeted amplicons and gene regions (i.e.barcode) must be considered when
designing PCR primers for metabarcoding. The barcode cannot be too short
because it must be taxonomically well resolved and should span sucient
genetic variation to distinguish closely related species. However, it also
cannot be too long, because it otherwise does not t technical features of
current sequencing technologies.
Currently, most of the barcodes used in metabarcoding studies range
between 200 and 500base pairs (Pawlowski etal. 2020). Shorter barcodes
(less than 120base pairs) are sometimes used, especially for microbial
species detection, but such short gene fragments persist longer in the
environment and therefore may provide information that is less well
resolved in time and space, and has lower taxonomic resolution (Pawlowski
etal. 2020). Fusion-tagged primers (i.e.primers designed to include unique
tag identiers and sequencing adapters) can simplify library preparation
steps, and improve control of tag-jumping events and chimeric sequences.
Environmental DNA test validation guidelines 24
Sequencing errors during metabarcoding workows are common during PCR
amplication (Berney etal. 2004, Aird etal. 2011) and sequencing (Yoshitake
etal. 2021). These technical errors include nucleotide substitutions and
insertions introduced by the polymerase enzyme (Eckert & Kunkel 1991,
McInerney etal. 2014, Lee etal. 2016), nucleotide substitutions induced by
the DNA damage caused by temperature cycling during PCR (Potapov &
Ong 2017), and formation of chimeras (Fonseca etal. 2012). Chimeric PCR
products are generated when small DNA fragments that did not nish the
elongation during one step are carried over in subsequent amplication
steps. The nal amplicon will be a chimeric sequence that does not exist in
any living organism and is composed of 2 or more dierent DNA fragments
that originate from 2 or more dierent organisms.
Moreover, it is important to be aware that metabarcoding primers will not
amplify all DNA equally in a sample. They are likely to favour sequences
already predominant in the environment, leading to a biased abundance
ratio between DNA from dierent species (Elbrecht & Leese 2015, Piñol etal.
2015). One of the major factors driving PCR bias is the number of primer
mismatches for each species in the community – species with fewer, or no,
mismatches will amplify more eciently during PCR (Clarke etal. 2014). Gene
copy number (e.g.the number of mitochondria per cell), dierences between
species in ease of DNA extraction, and the type and position of primer
mismatches also inuence PCR eciency and bias.
PCR biases can be minimised by using quality measures and controls that
include suitable assay optimisation, using fusion-tagged primers to minimise
PCR steps and identify chimeric sequences, using high-delity polymerase
within reactions to improve replication sensitivity, and using multiple PCR
replicates per sample. Use of unique molecular identiers when designing
fusion-tagged primers or when using two-step PCR protocols (MacConaill
etal. 2018) can also increase analytical resolution, and allow greater
condence in identifying sequencing and PCR biases (Yoshitake etal. 2021).
Assay testing and mesocosm studies
Assays must be tested using appropriate environmental samples from areas
with conrmed presence of targeted species, or from appropriate mesocosm
experimental set-ups (Zaiko etal. 2021) with known species abundances.
Alternatively, articial samples or mock communities can be designed
following protocols in Zaiko etal. (2021).
Environmental DNA test validation guidelines 25
Library preparation
Amplicons produced using metabarcoding assays must be cleaned and
pooled to create a ‘library’ for HTS technologies (i.e.an HTS library). Several
methods are available for HTS library preparation and the steps needed
to prepare amplicons for sequencing. Metabarcoding assays must clearly
outline what library preparation strategy is to be used.
The following steps must be clearly outlined:
PCR amplication regime and replication. Describe how many PCR
technical replicates per sample and per control will be used during the
selected amplication steps. Indicate the reaction mix and temperature
regime to be completed during amplication.
Indexing strategy. Describe the method to be used for the addition of
sample-specic identiers (e.g.1-step PCR with fusion primers, 2-step
PCR using untagged primers with sequence overhangs, 2-step PCR using
primers with tags and sequence overhangs, tagged PCR and library build
on amplicon pool; see Bohmann etal. 2021).
Amplicon pooling for library preparation. Describe how amplicons are to
be pooled for library preparation, indicating required concentrations and
how to select suitable PCR technical replicates for pooling.
Troubleshooting. HTS and the pooling of PCR replicates is highly complex,
and entails costs associated with preventing, detecting and eliminating
errors and biases. Each metabarcoding approach has advantages and
disadvantages that end users must understand to better troubleshoot and
analyse HTS data. This section should provide assay-suitable information
on what complications are to be expected using the selected method,
addressing issues relating to cross-contamination risk, PCR amplication
eciency, chimera formation, tag jumping and index misassignment (see
Bohmann etal. 2021).
Sequencing
HTS libraries can be sequenced using several methods, including:
sequencing by Synthesis (Illumina)
single-molecule real-time sequencing (Pacic Biosciences)
Ion Torrent sequencing (ThermoFisher Scientic)
454 pyrosequencing technology
oligonucleotide ligation and detection (SOLiD) sequencing (Life
Technologies)
nanopore sequencing (Oxford Nanopore Technologies)
small genome sequencing (GenapSys).
Environmental DNA test validation guidelines 26
These dierent methods achieve dierent read lengths and reads per
run, and have advantages and disadvantages associated with sequencing
accuracy, time per run, costs and reagents. Assays must describe which
method is selected in accordance with the purpose of the assay. Sequencing
should be undertaken as per the manufacturer’s instructions. Deviations
from the standard method must be described clearly, supported by
published research.
Data analysis
Data analysis of sequenced outputs requires standardised bioinformatic
scripts that include data quality control, sample demultiplexing and
taxonomic identication against a reliable reference database (Figure1; Zaiko
etal. 2021). Scripts should be written with enough guidance for end users to
reliably run each script and understand what each step achieves in curating
the data.
There are important aspects that bioinformatic scripts should consider and
include for nal analysis and communicating results:
Quality control measures. Outline how data will be curated by the script,
providing an indication of how many reads are being ltered by each step
from each sample. Quality control measures should consider sequence
quality of reads, pairing of paired end reads (if applicable), tag and primer
congruence, and ltering of chimeric or singleton sequences. It is essential
that the metabarcoding assay includes an indication of the accepted
minimal quality requirements, considering the application and context of
the proposed assay.
Sequence taxonomic identication. The taxonomic assignment of curated
sequences must be completed against suitable reference databases,
either available in online repositories or prepared by users before the
analysis. Reference databases can be generated with a series of scripts
(see Arranz etal. 2020), but should always aim to include sequences
that correspond to the targeted genomic region (e.g.16S gene region,
cytochrome oxidaseI), correspond to the appropriate targeted taxonomic
group and taxonomic level, and include sequences with suitable metadata
associated with either museum accessions or peer-reviewed publications.
Environmental DNA test validation guidelines 27
3 Validate and optimise the assay using
reference samples
Optimisation aims to evaluate and adjust the most important physical,
chemical and biological computational parameters of an assay to ensure that
the performance characteristics of the assay are best suited to the intended
application (OIE 2021). To optimise metabarcoding assays, it is important to
select reference samples that are representative of the target environment.
These may be environmental samples that contain a variety of analytes
of interest and are usually harvested from environments with known
occurrence of targeted organisms. Environmental samples may be spiked
using extracted DNA from target organisms in the environment in serial
dilutions of highly concentrated tissue-derived extracts (Coghlan etal. 2021).
Mock mesocosm experiments can also be used to collect environmental
samples containing DNA from known species assemblages and abundances
under controlled conditions (Kelly etal. 2014, Evans etal. 2015).
Quality control
Sequence taxonomic ID
Web-based output
Raw data output
Phred score 30
Primer congruence 2 bp
Tag congruence 100%
Reference database comparison
Report preparation
User analysis
Final decision (presence/absence)
Figure 1 
Example of a bioinformatic pipeline for analysing metabarcoding assayresults
Environmental DNA test validation guidelines 28
4 Check reproducibility
Metabarcoding assays can be assessed for reproducibility by processing
homogenised samples using standardised DNA extraction protocols, primers
and bioinformatic analyses (Zaiko etal. 2021). In this context, metabarcoding
assays should consistently provide comparable results between independent
laboratories, although these may show variation in raw results associated
with the use of dierent amplication regimes, instruments and user-
associated error (Zaiko etal. 2021). Verifying assays through an independent
laboratory is preferred.
Summary of key steps in metabarcoding
assay development and validation
Step 1 Attheoutsetoftheproject,denetheintended
purposeoftheassay
Step 2 Designandtesttheassay:
Identify whether an assay exists for the intended purpose.
Identify whether an assay exists for similar purposes or
species that could be adapted for the intended purpose.
Design primers to target longer DNA fragments (200–500base
pairs) following currently accepted barcode regions.
Address potential PCR biases (e.g.nucleotide substitutions and
insertions) during analyses.
Test the assays using environmental samples from areas with
conrmed presence of targeted species, a mock community or
mesocosm experiments.
Outline a library preparation strategy and create a library
forHTS.
Sequence the HTS library.
Filter and curate sequenced data before taxonomic
assignment, and use a curated reference database suitable
for the targeted gene region using curated sequences from
available molecular repositories.
Environmental DNA test validation guidelines 29
Step 3 Validateandoptimisetheassay:
Evaluate and adjust the physical, chemical and biological
parameters of the assay to suit the intended application, using
reference samples representative of the target environment.
Step 4 Checkreproducibility:
Process homogenised samples using standardised DNA
extraction protocols, primers and bioinformatic analyses.
Resources
Arranz V, Pearman WS, Aguirre JD & Liggins L (2020). MARES, a replicable
pipeline and curated reference database for marine eukaryote
metabarcoding. Scientic Data 7:209, doi:10.1038/s41597-020-0549-9.
Deiner K, Bik HM, Mächler E, Seymor M, Lacoursière-Roussel A,
AltermattF, Creer S, Bista I, Lodge DM, de Vere N, Pfrender ME &
Bernatchez L (2015). Environmental DNA metabarcoding: transforming
how we survey animal and plant communities. Molecular Ecology
26(21):5872–5895, doi:10.1111/mec.14 3 5 0.
Zaiko A, Greeneld P, Abbott C, von Ammon U, Bilewitch J, BunceM,
CristescuME, CharitonA, DowleE, Geller J, Ardura GutierrezA,
HajibabaeiM, Haggard E, Inglis GJ, Lavery SD, SamuilovieneA,
SimpsonT, Stat M, Stephenson S & Pochon X (2021). Towards
reproducible metabarcoding data: lessons from an international
cross-laboratory experiment. Molecular Ecology Resources 22(2):519–538,
doi:10 .1111/ 17 55 - 0 9 9 8. 13 48 5 .
Environmental DNA test validation guidelines 30
References
Aird D, Ross MG, Chen WS, Danielsson M, Fennell T, Russ C, Jae DB, Nusbaum C
& Gnirke A (2011). Analyzing and minimizing PCR amplication bias in Illumina
sequencing libraries.Genome Biology 12 (2):1–14.
Arranz V, Pearman WS, Aguirre JD & Liggins L (2020). MARES, a replicable
pipeline and curated reference database for marine eukaryote
metabarcoding.ScienticData7(1):1–8.
Berney C, Fahrni J & Pawlowski J (2004). How many novel eukaryotic ‘kingdoms’?
Pitfalls and limitations of environmental DNA surveys. BMC Biology 2:13.
Bohmann K, Elbrecht V, Carøe C, Bista I, Leese F, Bunce M, Yu DW, Seymour
M, Dumbrell AJ & CreerS (2021). Strategies for sample labelling and library
preparation in DNA metabarcoding studies. Molecular Ecology Resources
22(4):1231–124 6, doi:1 0.1111 /17 5 5 - 09 9 8 .13 512 .
Bruce K, Blackman R, Bourlat SJ, Hellström AM, Bakker J, Bista I, Bohmann K,
Bouchez A, Brys R, ClarkK, Elbrecht V, Fazi S, Fonseca V, Häning B, Leese F,
Mächler E, Mahon AR, Meissner K, Panksep K, Pawlowski J, Schmidt Yáñez P,
Seymour M, Thalinger B, Valentini A, Woodcock P, Traugott M, Vasselon V &
Deiner K (2021). A practical guide to DNA-based methods for biodiversity assessment,
Advanced Books, doi:10.3897/ab.e6 863 4.
Caraguel CGB, Str yhn H, Gagné N, Dohoo IR & Hammell KL (2011). Selection of a
cuto value for real-time polymerase chain reaction results to t a diagnostic
purpose: analytical and epidemiologic approaches.Journal of Veterinary
Diagnostic Investigation23(1): 2–15, doi:10.1177/104063871102300102.
Clarke LJ, Soubrier J, Weyrich LS & Cooper A (2014). Environmental metabarcodes
for insects:in silicoPCR reveals potential for taxonomic bias. Molecular Ecology
Resources 14(6):1160–1170, doi:10 .1111/1 75 5 - 0 99 8 .12 26 5 .
Coghlan SA, Currier CA, Freeland J, Morris TJ & Wilson CC (2021). Community
eDNA metabarcoding as a detection tool for documenting freshwater
mussel (Unionidae) species assemblages. Environmental DNA 3(6):117 2–1191,
doi:10.1002/edn3.239.
Eckert KA & Kunkel TA (1991). DNA polymerase delity and the polymerase chain
reaction.Genome Research 1.1: 17–24, doi:10.110 1/gr.1.1.17.
Environmental DNA test validation guidelines 31
Elbrecht V & Leese F (2015). Can DNA-based ecosystem assessments quantify
species abundance? Testing primer bias and biomass—sequence relationships
with an innovative metabarcoding protocol. PloS ONE 10(7):e0130324,
doi:10.1371/journal.pone.0130324.
Evans NT, Olds BP, Renshaw MA, Turner CR, Li Y, Jerde CL, Mahon AR, Pfrender ME,
Lamberti GA & Lodge DM (2015). Quantication of mesocosm sh and amphibian
species diversity via environmental DNA metabarcoding. Molecular Ecology
Resources16(1):29–41, doi:10. 1111/1 75 5 - 0 99 8 .12 43 3 .
Fonseca VG, Nichols B, Lallias D, Quince C, Carvalho GR, Power DM & Creer S (2012).
Sample richness and genetic diversity as drivers of chimera formation in nSSU
metagenetic analyses. Nucleic Acids Research40(9):e66, doi:10.1093/nar/gks002.
Furlan EM &Gleeson D (2016). Improving reliability in environmental DNA
detection surveys through enhanced quality control. Marine and Freshwater
Research68(2):388–395, doi:10.1071/MF15349.
Gerber LR, Beger M, McCarthy MA & Possingham HP (2005). A theor y for
optimal monitoring of marine reserves.Ecology Letters8(8): 829–837,
doi:10 .1111/ j .14 61 - 02 4 8 .2 0 0 5. 0 07 8 4 . x.
Goldberg CS, Turner CR, Deiner K, Klymus KE, Thomsen PF, Murphy MA, Spear SF,
McKee A, Oyler-McCance SJ, Cornman RS, Laramie.MB, Mahon AR, Lance RF,
Pilliod DS, Strickler KM, Waits LP, Fremier AK, Takahara T, Herder JE & Taberlet P
(2016). Critical considerations for the application of environmental DNA methods
to detect aquatic species.Methods in Ecology and Evolution7(11):1299–1307,
doi:10 .1111/ 2 0 41-210X .1259 5.
Jo T, Murakami H, Masuda R, Sakata MK, Yamamoto S & Minamoto T (2017).
Rapid degradation of longer DNA fragments enables the improved estimation
of distribution and biomass using environmental DNA.Molecular Ecology
Resources17(6):e25–e33, doi:10. 1111/1 75 5 - 0 99 8 .12 68 5 .
Kelly RP, Port JA, Yamahara KM & Crowder LB (2014). Using environmental
DNA to census marine shes in a large mesocosm. PloS ONE 9(1):e86175,
doi:10.1371/journal.pone.0086175.
Klymus KE, Ruiz Ramos DV, Thompson NL & Richter CA (2020a). Development
and testing of species-specic quantitative PCR assays for environmental DNA
applications. Journal of Visualized Experiments 165:e61825, doi:10.3791/61825 .
Klymus KE, Merkes CM, Allison MJ, Goldberg CS, Helbing CC, Hunter ME, Jackson CA,
Lance RF, Mangan AM, Monroe EM, Piaggio AJ, Stokdyk JP, Wilson CC & Richter CA
(2020b). Reporting the limits of detection and quantication for environmental
DNA assays. Environmental DNA 2(3):271–282, doi:10.1002/edn3. 29.
Environmental DNA test validation guidelines 32
Langlois VS, Allison MJ, Bergman LC, To TA & Helbing CC (2021). The need for robust
qPCR-based eDNA detection assays in environmental monitoring and species
inventories.Environmental DNA 3(3):519–527, doi:10.1002 /edn 3.16 4.
Lee DF, Lu J, Chang S, Loparo JJ & Xie XS (2016). Mapping DNA polymerase errors
by single-molecule sequencing.Nucleic Acids Research44(13): e118–e118,
doi:10.1093/nar/gk w436.
MacConaill LE, Burns RT, Nag A, Coleman HA, Slevin MK, Giorda K, Light M, Lai K,
Jarosz M, McNeill MS, Ducar MD, Meyerson M & Thorner AR (2018). Unique, dual-
indexed sequencing adapters with UMIs eectively eliminate index cross-talk and
signicantly improve sensitivity of massively parallel sequencing. BMC Genomics
19:30, doi:10.1186/s12864-017-4428-5.
McInerney P, Adams P & Hadi MZ (2014). Error rate comparison during polymerase
chain reaction by DNA polymerase.Molecular Biology International,2014,
doi:10.1155/2014/287430.
Minamoto T, Miya M, Sado T, Seino S, Doi H, Kondoh M, Nakamura K, TakaharaT,
Yamamoto S, Yamanaka H, Araki H, Iwasaki W, Kasai A, Masuda R &
UchiiK(2021). An illustrated manual for environmental DNA research: water
sampling guidelines and experimental protocols. Environmental DNA 3(1):8–13,
doi:10.10 02/e dn3 .121.
OIE (World Organisation for Animal Health) (2021). Manual of diagnostic tests and
vaccines for terrestrial animals 2021, OIE, Paris, accessed 15June 2021, oie.int /en/
what-we-do/standards/codes-and-manuals/terrestrial-manual-online-access.
Pawlowski J, Apothéloz-Perret-Gentil L, Mächler E & Altermatt F (2020).
Environmental DNA applications in biomonitoring and bioassessment of
aquatic ecosystems: guidelines. Federal Oce for the Environment, Bern,
www.researchgate.net/publication/346511410_Environmental_DNA_
applications_for_biomonitoring_and_bioassessment_in_aquatic_ecosystems.
Piñol J, Mir G, Gomez-Polo P & Agustí N (2015). Universal and blocking
primer mismatches limit the use of high-throughput DNA sequencing
for the quantitative metabarcoding of arthropods. Molecular Ecology
Resources15(4):819–830, doi:1 0.1111 /17 5 5 - 09 9 8 .12 35 5 .
Potapov V & Ong JL (2017). Examining sources of error in PCRbysingle-molecule
sequencing.PloS ONE 12(1):e0169774 , doi:10.1371/journal.pone.0169774.
Environmental DNA test validation guidelines 33
Stewart D, Zahiri R, Djoumad A, Freschi L, Lamarche J, Holden D, Cervantes S, Ojeda
DI, Potvin A, Nisole A, Béliveau C, Capron A, Kimoto T, DayB, Yueh H, Du C,
Levesque RC, Hamelin RC & Cusson M (2016). A multi-species TaqMan PCR assay
for the identication of Asian gypsy moths (Lymantria spp.) and other invasive
lymantriines of biosecurity concern to North America. PloS ONE 11(8):e0160878,
doi:10.1371/journal.pone.0160878.
Stoeckle BC, Beggel S, Cerwenka AF, Motivans E, Kuehn R & Geist J (2017).
Asystematic approach to evaluate the inuence of environmental conditions
on eDNA detection success in aquatic ecosystems. PloS ONE 12(12):e 0189119,
doi:10.1371/journal.pone.0189119.
Thalinger B, Deiner K, Harper LR, Rees HC, Blackman RC, Sint D, Traugott M,
Goldberg CS & Bruce K (2021). A validation scale to determine the readiness
of environmental DNA assays for routine species monitoring.Environmental
DNA3(4):823–836, doi:10.10 0 2 /ed n3.189 .
Trujillo-Gonzalez A, Hinlo R, Gofwin S, Barmuta L A, Watson A, Turner P, Koch A &
Gleeson D (2021). Environmental DNA detection of the giant freshwater craysh
(Astacopsis gouldi). Environmental DNA 3(5):950–958, doi:10.1002/edn3.204.
Yoshitake K, Fujiwara A, Matsuura A, Sekino M, Yasuike M, Nakamura Y, Nakamichi
R, Kodama M, Takahama Y, Takasuka A, Asakawa S, NishikioriK, Kobayashi T
& Watabe S (2021). Estimation of tuna population by the improved analytical
pipeline of unique molecular identier-assisted HaCeD-Seq (haplotype count
from eDNA). Scientic Reports 11:7031, doi:10.1038/s 41598- 021-86190 -6.
Zaiko A, Greeneld P, Abbott C, von Ammon U, Bilewitch J, Bunce M, Cristescu
ME, Chariton A, Dowle E, Geller J, Ardura Gutierrez A, Hajibabaei M, HaggardE,
InglisGJ, Lavery SD, Samuiloviene A, Simpson T, StatM, Stephenson S & PochonX
(2021). Towards reproducible metabarcoding data: lessons from an international
cross-laboratory experiment. Molecular Ecology Resources 22(2):519–538,
doi:10 .1111/ 17 55 - 0 9 9 8.1 34 8 5 .
... The importance of establishing best-practice guidelines and quality assurance for eDNA-based biosecurity applications is crucial for implementing its nation-wide uptake for routine biosecurity practices. This has been increasingly emphasized by researchers including recognizing First Nations peoples' ownership and stewardship 18,60,[62][63][64] . However, it is important to acknowledge that Māori communities have reservations about molecular tools, being concerned about losing data sovereignty as the DNA sequenced of taonga (treasured) species is often sent overseas, effectively removing the Māori property rights 21 , but also concerns about the genomic sequencing of taonga (treasured) species which has implications for key values and concepts such as whakapapa (the relationship between everything and everybody in the natural world). ...
Article
Full-text available
Aotearoa New Zealand’s Northern region is a major gateway for the incursion and establishment of non-indigenous species (NIS) populations due to high numbers of recreational and commercial vessels. This region also holds a unique marine ecosystem, home to many taonga (treasured) species of cultural and economic importance. Regular surveillance, eradication plans and public information sharing are undertaken by local communities and governmental organizations to protect these ecosystems from the impact of NIS. Recently, considerable investments went into environmental DNA (eDNA) research, a promising approach for the early detection of NIS for complementing existing biosecurity systems. We applied eDNA metabarcoding for elucidating bioregional patterns of NIS distributions across a gradient from harbors (NIS hotspots) to open seas (spreading areas). Samples were collected during a research cruise sailing across three Aotearoa New Zealand harbors, Waitematā, Whangārei and Pēwhairangi (Bay of Islands), and their adjacent coastal waters. The small-ribosomal subunit (18S rRNA) and mitochondrial cytochrome c oxidase I (COI) genes were screened using the online Pest Alert Tool for automated detection of putative NIS sequences. Using a probabilistic modelling approach, location-dependent occupancies of NIS were investigated and related to the current information on species distribution from biosecurity surveillance programs. This study was collaboratively designed with Māori partners to initiate a model of co-governance within the existing science system.
... The same metabarcoding approach has been used to survey benthic biodiversity near Davis station (Clarke et al. 2021), highlighting the potential to co-design eDNA-based biosecurity surveillance and native biodiversity monitoring in a joint framework. Future work will be needed to confirm whether all high-priority marine species would be detected with the currently available metazoan PCR primers with sufficient sensitivity and, if not, primers targeting specific groups (such as ascidians and mussels) should be designed and validated following best-practice guidelines (De Brauwer et al. 2022b;Thalinger et al. 2021). Similarly, any eDNA surveillance program should be designed following best practice and taking key principles into account as per De Brauwer et al. (2022a). ...
Book
Full-text available
This publication is an output from EU COST Action DNAqua-Net (CA 15219 - Developing new genetic tools for bioassessment of aquatic ecosystems in Europe) and would not have been possible without the opportunities for international collaboration provided by the network, supported by COST (European Cooperation in Science and Technology). Therefore, our highest gratitude is due to Florian Leese and Agnès Bouchez who designed and led DNAqua-Net, and to programme managers Alex Weigand, Sarah Kückmann and Charlotte Frie who coordinated it. In addition to the authors, hundreds of researchers and practitioners from across Europe and further afield have contributed to the body of knowledge synthesised herein. DNAqua-Net workshops have served as the primary mechanism for consolidating knowledge and were particularly valuable for bringing together research scientists with regulators and end-users, which helped to emphasise the practical considerations in the implementation of DNA-based monitoring programmes. Workshops that ultimately fed into this publication were hosted in Germany (Florian Leese; University of Duisburg-Essen, 2017), Bosnia and Herzegovenia (Belma Kalamujić; University of Sarajevo, 2017), Hungary (Zoltán Csabai; University of Pécs, 2018), Austria (Michael Traugott; University of Innsbruck, 2018), Portugal (Pedro Beja; CIBIO, 2018), Italy (Stefano Fazi; Water Research Institute IRSA-CNR, 2019) and Cyprus (Marlen Vasquez; Cyprus University of Technology, 2019). The workshops highlighted what a collaborative community has emerged among researchers in this field, enabled to a large degree by programmes like DNAqua-Net as well as by a strong collective sense that our research has important real-world applications and is building a foundation for the years to come when we need every tool in the box to promote the protection and recovery of the natural world. We are particularly grateful to all those non-expert users of environmental DNA who fed back to us their experiences and challenges in engaging with these new methods and provided wider context as to the practical, logistical and financial constraints of routine monitoring (Iwan Jones, Simon Vitecek, Willie Duncan, Kerry Walsh and Martyn Kelly, to name just a few). These insights have helped to guide and shape research priorities, and we hope that this guide will prove a useful resource for these users as they begin to integrate these new technologies into the suite of tools at their disposal.
Article
Full-text available
Metabarcoding of DNA extracted from environmental or bulk specimen samples is increasingly used to profile biota in basic and applied biodiversity research because of its targeted nature that allows sequencing of genetic markers from many samples in parallel. To achieve this PCR amplification is carried out with primers designed to target a taxonomically informative marker within a taxonomic group, and sample-specific nucleotide identifiers are added to the amplicons prior to sequencing. This enables assignment of the sequences back to the samples they originated from. Nucleotide identifiers can be added during the metabarcoding PCR and/or during ‘library preparation’, i.e., when amplicons are prepared for sequencing. Different strategies to achieve this labelling exist. All have advantages, challenges and limitations, some of which can lead to misleading results, and in the worst case compromise the fidelity of the metabarcoding data. Given the range of questions addressed using metabarcoding, ensuring that data generation is robust and fit for the chosen purpose is critically important for practitioners seeking to employ metabarcoding for biodiversity assessments. Here, we present an overview of the three main workflows for sample-specific labelling and library preparation in metabarcoding studies on Illumina sequencing platforms; one-step PCR, two-step PCR, and tagged PCR. Further, we distill the key considerations for researchers seeking to select an appropriate metabarcoding strategy for their specific study. Ultimately, by gaining insights into the consequences of different metabarcoding workflows, we hope to further consolidate the power of metabarcoding as a tool to assess biodiversity across a range of applications.
Article
Full-text available
Many studies have investigated the ability to identify species from environmental DNA (eDNA). However, even when individual species are identified, the accurate estimation of their abundances by traditional eDNA analyses has been still difficult. We previously developed a novel analytical method called HaCeD-Seq (H aplotype C ount from eD NA), which focuses on the mitochondrial D-loop sequence. The D-loop is a rapidly evolving sequence and has been used to estimate the abundance of eel species in breeding water. In the current study, we have further improved this method by applying unique molecular identifier (UMI) tags, which eliminate the PCR and sequencing errors and extend the detection range by an order of magnitude. Based on this improved HaCeD-Seq pipeline, we computed the abundance of Pacific bluefin tuna ( Thunnus orientalis ) in aquarium tanks at the Tokyo Sea Life Park (Kasai, Tokyo, Japan). This tuna species is commercially important but is at high risk of resource depletion. With the developed UMI tag method, 90 out of 96 haplotypes (94%) were successfully detected from Pacific bluefin tuna eDNA. By contrast, only 29 out of 96 haplotypes (30%) were detected when UMI tags were not used. Our findings indicate the potential for conducting non-invasive fish stock surveys by sampling eDNA.
Article
Full-text available
Considerable promise and excitement exist in the application of environmental DNA (eDNA) methods to environmental monitoring and species inventories as eDNA can provide cost‐effective and accurate biodiversity information. However, considerable variation in data quality, rigor, and reliability has eroded confidence in eDNA application and is limiting regulatory and policy uptake. Substantial effort has gone into promoting transparency in reporting and deriving standardized frameworks and methods for eDNA field workflow components, but surprisingly little scrutiny has been given to the design and performance elements of targeted eDNA detection assays which, by far, have been most used in the scientific literature. There are several methods used for eDNA detection. The most accessible, cost‐effective, and conducive to standards development is targeted real‐time or quantitative real‐time polymerase chain reaction (abbreviated as qPCR) eDNA analysis. The present perspective is meant to assist in the development and evaluation of qPCR‐based eDNA assays. It evaluates six steps in the qPCR‐based eDNA assay development and validation workflow identifying and addressing concerns pertaining to poor qPCR assay design and implementation; identifies the need for more fulsome mitochondrial genome sequence information for a broader range of species; and brings solutions toward best practices in forthcoming large‐scale and worldwide eDNA applications, such as at‐risk or invasive species assessments and site remediation monitoring.
Article
Full-text available
Environmental DNA (eDNA)‐based assessments of macro‐organisms have now become an essential approach for biomonitoring. eDNA survey methods have a number of advantages over conventional survey methods. However, the value of the data that will accumulate would be greatly enhanced by standardizing the analysis methods, which would allow us to compare data from multiple monitoring sites at different points in time. The eDNA Society (http://ednasociety.org/en/about), whose founding members consist of Japanese researchers conducting eDNA studies on macro‐organisms, was established in 2018, with the aim of expanding eDNA technology and science. Here, we introduce our key publication, “Environmental DNA Sampling and Experiment Manual” (http://ednasociety.org/en/manual), which was published under the initiative of the eDNA Society. Detailed methods for the surveys and experiments are described in the manual, including the selection of sampling sites, sampling methods, filtration methods, DNA extraction, species‐specific detection by real‐time polymerase chain reaction, and fish eDNA metabarcoding. The manual assists users in conducting standardized surveys and quality experiments, and provides a basis for collecting comparable data. Given that the efficacy of methods can be context dependent and variable, and that procedures may sometimes conflict with standardization, it is difficult to ensure that all processes are equally effective. However, even in such cases, it is important to maintain sufficiently high data quality by setting the minimum standards to be followed. Implementation of such standardized methodologies will enable the systematic and frequent collection of flawless, comparable eDNA data from around the world; this will provide important fundamental information for biodiversity conservation, as well as the sustainable use of fisheries resources.
Article
Full-text available
The use of DNA metabarcoding to characterise the biodiversity of environmental and community samples has exploded in recent years. However, taxonomic inferences from these studies are contingent on the quality and completeness of the sequence reference database used to characterise sample species-composition. In response, studies often develop custom reference databases to improve species assignment. The disadvantage of this approach is that it limits the potential for database re-use, and the transferability of inferences across studies. Here, we present the MARine Eukaryote Species (MARES) reference database for use in marine metabarcoding studies, created using a transparent and reproducible pipeline. MARES includes all COI sequences available in GenBank and BOLD for marine taxa, unified into a single taxonomy. Our pipeline facilitates the curation of sequences, synonymization of taxonomic identifiers used by different repositories, and formatting these data for use in taxonomic assignment tools. Overall, MARES provides a benchmark COI reference database for marine eukaryotes, and a standardised pipeline for (re)producing reference databases enabling integration and fair comparison of marine DNA metabarcoding results.
Article
Full-text available
Background Environmental DNA (eDNA) analysis is increasingly being used to detect the presence and relative abundance of rare species, especially invasive or imperiled aquatic species. The rapid progress in the eDNA field has resulted in numerous studies impacting conservation and management actions. However, standardization of eDNA methods and reporting across the field is yet to be fully established, with one area being the calculation and interpretation of assay limit of detection (LOD) and limit of quantification (LOQ). Aims Here, we propose establishing consistent methods for determining and reporting of LOD and LOQ for single‐species quantitative PCR (qPCR) eDNA studies. Materials & Methods/ Results We utilize datasets from multiple cooperating laboratories to demonstrate both a discrete threshold approach and a curve‐fitting modeling approach for determining LODs and LOQs for eDNA qPCR assays. We also provide details of an R script developed and applied for the modeling method. Discussion/Conclusions Ultimately, standardization of how LOD and LOQ are determined, interpreted, and reported for eDNA assays will allow for more informed interpretation of assay results, more meaningful interlaboratory comparisons of experiments, and enhanced capacity for assessing the relative technical quality and performance of different eDNA qPCR assays.
Article
Full-text available
Background: Sample index cross-talk can result in false positive calls when massively parallel sequencing (MPS) is used for sensitive applications such as low-frequency somatic variant discovery, ancient DNA investigations, microbial detection in human samples, or circulating cell-free tumor DNA (ctDNA) variant detection. Therefore, the limit-of-detection of an MPS assay is directly related to the degree of index cross-talk. Results: Cross-talk rates up to 0.29% were observed when using standard, combinatorial adapters, resulting in 110,180 (0.1% cross-talk rate) or 1,121,074 (0.29% cross-talk rate) misassigned reads per lane in non-patterned and patterned Illumina flow cells, respectively. Here, we demonstrate that using unique, dual-matched indexed adapters dramatically reduces index cross-talk to ≤1 misassigned reads per flow cell lane. While the current study was performed using dual-matched indices, using unique, dual-unrelated indices would also be an effective alternative. Conclusions: For sensitive downstream analyses, the use of combinatorial indices for multiplexed hybrid capture and sequencing is inappropriate, as it results in an unacceptable number of misassigned reads. Cross-talk can be virtually eliminated using dual-matched indexed adapters. These results suggest that use of such adapters is critical to reduce false positive rates in assays that aim to identify low allele frequency events, and strongly indicate that dual-matched adapters be implemented for all sensitive MPS applications.
Article
Advances in high-throughput sequencing (HTS) are revolutionizing monitoring in marine environments by enabling rapid, accurate and holistic detection of species within complex biological samples. Research institutions worldwide increasingly employ HTS methods for biodiversity assessments. However, variance in laboratory procedures, analytical workflows and bioinformatic pipelines impede the transferability and comparability of results across research groups. An international experiment was conducted to assess the consistency of metabarcoding results derived from identical samples and primer sets using varying laboratory procedures. Homogenized biofouling samples collected from four coastal locations (Australia, Canada, New Zealand and the USA) were distributed to 12 independent laboratories. Participants were asked to follow one of two HTS library preparation workflows. While DNA extraction, primers and bioinformatic analyses were purposefully standardized to allow comparison, many other technical variables were allowed to vary among laboratories (e.g., amplification protocols, type of instrument used, etc.). Despite substantial variation observed in raw results, the primary signal in the data was consistent, with the samples grouping strongly by geographic origin for all datasets. Simple post-hoc data clean-up by removing low quality samples gave the best improvement in sample classification for nuclear 18S rRNA gene data, with an overall 92.81% correct group attribution. For mitochondrial COI gene data, the best classification result (95.58%) was achieved after correction for contamination errors. The identified critical methodological factors that introduced the greatest variability (preservation buffer, sample defrosting, template concentration, DNA polymerase, PCR enhancer) should be of great assistance in standardizing future comparative biodiversity studies using metabarcoding.