Article

TOPO TA is A-OK: a test of phylogenetic bias in fungal environmental clone library construction.

University of Alaska, Institute of Arctic Biology, 311 Irving I Building, Fairbanks, AK 99775, USA.
Environmental Microbiology (impact factor: 5.84). 06/2007; 9(5):1329-34. DOI:10.1111/j.1462-2920.2007.01253.x pp.1329-34
Source: PubMed

ABSTRACT TA cloning methods are widely used in analyses of environmental microbial diversity, yet the potential of TA methods to yield phylogenetically biased results has received little attention. To test for a TA bias, we constructed clone libraries of fungal amplicons spanning the ribosomal internally transcribed spacer (ITS) and partial large subunit (LSU) from 92 boreal forest soil DNA extracts using two contrasting methods: the Invitrogen TOPO-TA system and the Lucigen PCR-SMART system. The Lucigen system utilizes blunt-ended rather than TA cloning and transcription terminators to reduce biases due to toxicity of expressed inserts. We analysed 588 clone sequences from the two libraries. Species diversity estimators applied to operational taxonomical units (OTUs) were slightly higher for Invitrogen than Lucigen, but confidence intervals for accumulation curves overlapped. Abundances of OTUs were correlated between the libraries (r(2) = 0.5, P < 0.0001), but certain OTUs had contrasting abundances in the two libraries and a likelihood ratio test rejected homogeneity of the OTU counts. We constructed parsimony and Bayesian trees from aligned LSU regions, and the 'phylogenetic test' revealed that lineage representation was not significantly different between the two libraries. We conclude that characterization of this fungal community was fairly robust to cloning method and no biases due to TA cloning were found.

0 0
 · 
0 Bookmarks
 · 
35 Views
  • Article: Tasting soil fungal diversity with earth tongues: phylogenetic test of SATé alignments for environmental ITS data.
    [show abstract] [hide abstract]
    ABSTRACT: An abundance of novel fungal lineages have been indicated by DNA sequencing of the nuclear ribosomal ITS region from environmental samples such as soil and wood. Although phylogenetic analysis of these novel lineages is a key component of unveiling the structure and diversity of complex communities, such analyses are rare for environmental ITS data due to the difficulties of aligning this locus across significantly divergent taxa. One potential approach to this issue is simultaneous alignment and tree estimation. We targeted divergent ITS sequences of the earth tongue fungi (Geoglossomycetes), a basal class in the Ascomycota, to assess the performance of SATé, recent software that combines progressive alignment and tree building. We found that SATé performed well in generating high-quality alignments and in accurately estimating the phylogeny of earth tongue fungi. Drawing from a data set of 300 sequences of earth tongues and progressively more distant fungal lineages, 30 insufficiently identified ITS sequences from the public sequence databases were assigned to the Geoglossomycetes. The association between earth tongues and plants has been hypothesized for a long time, but hard evidence is yet to be collected. The ITS phylogeny showed that four ectomycorrhizal isolates shared a clade with Geoglossum but not with Trichoglossum earth tongues, pointing to the significant potential inherent to ecological data mining of environmental samples. Environmental sampling holds the key to many focal questions in mycology, and simultaneous alignment and tree estimation, as performed by SATé, can be a highly efficient companion in that pursuit.
    PLoS ONE 01/2011; 6(4):e19039. · 4.09 Impact Factor
  • Source
    Article: Factors that affect large subunit ribosomal DNA amplicon sequencing studies of fungal communities: classification method, primer choice, and error.
    [show abstract] [hide abstract]
    ABSTRACT: Nuclear large subunit ribosomal DNA is widely used in fungal phylogenetics and to an increasing extent also amplicon-based environmental sequencing. The relatively short reads produced by next-generation sequencing, however, makes primer choice and sequence error important variables for obtaining accurate taxonomic classifications. In this simulation study we tested the performance of three classification methods: 1) a similarity-based method (BLAST + Metagenomic Analyzer, MEGAN); 2) a composition-based method (Ribosomal Database Project naïve bayesian classifier, NBC); and, 3) a phylogeny-based method (Statistical Assignment Package, SAP). We also tested the effects of sequence length, primer choice, and sequence error on classification accuracy and perceived community composition. Using a leave-one-out cross validation approach, results for classifications to the genus rank were as follows: BLAST + MEGAN had the lowest error rate and was particularly robust to sequence error; SAP accuracy was highest when long LSU query sequences were classified; and, NBC runs significantly faster than the other tested methods. All methods performed poorly with the shortest 50-100 bp sequences. Increasing simulated sequence error reduced classification accuracy. Community shifts were detected due to sequence error and primer selection even though there was no change in the underlying community composition. Short read datasets from individual primers, as well as pooled datasets, appear to only approximate the true community composition. We hope this work informs investigators of some of the factors that affect the quality and interpretation of their environmental gene surveys.
    PLoS ONE 01/2012; 7(4):e35749. · 4.09 Impact Factor

Full-text (2 Sources)

View
2 Downloads
Available from
12 Apr 2013

Keywords

92 boreal forest soil DNA
 
certain OTUs
 
clone libraries
 
confidence intervals
 
environmental microbial diversity
 
fungal amplicons spanning
 
Invitrogen TOPO-TA system
 
likelihood ratio test
 
Lucigen PCR-SMART system
 
Lucigen system utilizes blunt-ended
 
operational taxonomical units
 
OTUs
 
partial large subunit
 
ribosomal internally transcribed spacer
 
Species diversity estimators
 
TA cloning methods
 
TA methods
 
transcription terminators
 
two libraries
 
yield phylogenetically