Brant C. Faircloth’s research while affiliated with Louisiana State University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (256)


Integrative taxonomy using traits and genomic data for Species Delimitation with Deep learning
  • Preprint
  • File available

March 2025

·

171 Reads

·

·

Brant C Faircloth

·

[...]

·

Recognizing species boundaries in complex speciation scenarios, including those that involve gene flow and demographic fluctuations, combined with the plethora of existing species concepts, is a challenge that has recently been brought to attention. Promising recent approaches consider an integrative taxonomy with multiple sources of evidence (e.g., genetic, morphology, geographic distributions), which can be related to diverse properties associated with the dynamics of the speciation continuum. Also, the use of statistical inferential methods for model comparison, such as approximate Bayesian computation, approximate likelihoods, and machine learning, has allowed a better assessment of species boundaries in such cases. However, most approaches involve analyzing genetic and phenotypic/geographical information separately, followed by visual/qualitative comparison. Methods that integrate genetic information with other sources of evidence have been limited to simple evolutionary models and are not able to analyze more than a few hundred loci across a maximum of a few tens of samples. Here, we present a deep learning approach (DeepID) that combines convolutional neural networks and multilayer perceptrons to integrate both genomic data (thousands of loci or single nucleotide polymorphisms, SNPs) and trait information into a unified framework. By using simulated and empirical datasets, we evaluate the power and accuracy of our approach for discriminating among competing allopatric speciation scenarios when varying the number of SNPs and traits, and the impact of missing data. We found that the accuracy of our method was lower for datasets that included only trait data, but when we combined both genomic and trait data types, the accuracy was similar to when we considered genomic data alone. However, when we violated the simple allopatric speciation model by including migration, the approach based on traits was less affected than analyzing datasets including the genomic information. Moreover, using both sources of data can incorporate complementary information associated with different stages of the speciation process. Our approach was able to recover the expected delimitation scenarios in empirical datasets of one plant (Euphorbia balsamifera) and one fish (Lepomis megalotis) species complex. We argue that our method is a flexible and promising approach, allowing for complex scenario comparison and the use of multiple types of data.

Download


Figure 7. Crown ages for 12 major avian clades with relative divergence time (to Neognathae) on the top panel and absolute divergence time on the bottom panel. Divergence times were extracted from the time-calibrated RAxML-NG tree and the six other time-calibrated trees from various approaches in our study as well as time-calibrated trees from previous studies. Branch lengths of the supertrees and divide-and-conquer trees were optimized using the same filtered dataset (see methods).
Figure 2. A genus-level RAxML-NG tree with branch lengths converted to divergence time using TreePL. Major bird clades are color-coded, while three lineages (Gruiformes, Charadriiformes and Opisthocomiformes; see Reddy et al. 2017) that were not placed within a strongly corroborated superordinal clade, remain uncolored (silhouettes in gray). Colored bars in the outer ring indicate genera that are monophyletic (blue; n = 372) and non-monophyletic (red; n = 38) in this phylogeny. Monotypic genera (n = 334 with a single species currently recognized in IOC World Bird List v13.1) and genera represented by a single sample in our dataset (n = 337) are gray. The concentric gray circles indicate 20 Ma time intervals. See Supplementary Figure S7 for a version of this tree with tip labels.
Figure 4. Neighbor-joining tree of trees based on normalized Robinson-Foulds distances (rooted at the midpoint). Here we only present the representative trees for each approach. Full results can be found in Supplementary Tables S7-9. For the two hybrid approaches, we added in the time for running the backbone tree. For the new fasttrees, we summarized the compute time for running four fasttrees analyses for each dataset (using different MP starting trees under GTR+G [parsA & parsB] or GTR+R4 [parsC & parsD] model) and presented the total time used here. For example, for a new fasttree analysis based on the full dataset, four runs each cost 999, 881, 961, and 943 CPU hours; we presented the fasttree parsD here, but the total CPU hours spent was a sum of the four runs (i.e., 3785).
Figure 6. A comparison between the number of unresolved expected clades (genera, high-level clades, and all expected clades combined) and parsimony informative sites (above), as well as average proportion of gaps and ambiguities (“-”, “?” or “N”) across all locus alignments for the dataset (below). For each filtered dataset, four fasttrees with different parsimony starting trees were evaluated. A slight jitter was used when two shapes were completely overlapping each other so that both shapes would be visible.
Efficient Inference of Macrophylogenies: Insights from the Avian Tree of Life

February 2025

·

259 Reads

The exponential growth of molecular sequence data over the past decade has enabled the construction of numerous clade-specific phylogenies encompassing hundreds or thousands of taxa. These independent studies often include overlapping data, presenting a unique opportunity to build macrophylogenies (phylogenies sampling > 1,000 taxa) for entire classes across the Tree of Life. However, the inference of large trees remains constrained by logistical, computational, and methodological challenges. The Avian Tree of Life provides an ideal model for evaluating strategies to robustly infer macrophylogenies from intersecting datasets derived from smaller studies. In this study, we leveraged a comprehensive resource of sequence capture datasets to evaluate the phylogenetic accuracy and computational costs of four methodological approaches: (1) supermatrix approaches using concatenation, including the "fast" maximum likelihood (ML) methods, (2) filtering datasets to reduce heterogeneity, (3) supertree estimation based on published phylogenomic trees, and (4) a "divide-and-conquer" strategy, wherein smaller ML trees were estimated and subsequently combined using a supertree approach. Additionally, we examined the impact of these methods on divergence time estimation using a dataset that includes newly vetted fossil calibrations for the Avian Tree of Life. Our findings highlight that recently developed fast tree search approaches offer a reasonable compromise between computational efficiency and phylogenetic accuracy, facilitating inference of macrophylogenies.


The coevolution of fungus-ant agriculture

October 2024

·

653 Reads

·

9 Citations

Science

Fungus-farming ants cultivate multiple lineages of fungi for food, but, because fungal cultivar relationships are largely unresolved, the history of fungus-ant coevolution remains poorly known. We designed probes targeting >2000 gene regions to generate a dated evolutionary tree for 475 fungi and combined it with a similarly generated tree for 276 ants. We found that fungus-ant agriculture originated ~66 million years ago when the end-of-Cretaceous asteroid impact temporarily interrupted photosynthesis, causing global mass extinctions but favoring the proliferation of fungi. Subsequently, ~27 million years ago, one ancestral fungal cultivar population became domesticated, i.e., obligately mutualistic, when seasonally dry habitats expanded in South America, likely isolating the cultivar population from its free-living, wet forest–dwelling conspecifics. By revealing these and other major transitions in fungus-ant coevolution, our results clarify the historical processes that shaped a model system for nonhuman agriculture.



Complexity of avian evolution revealed by family-level genomes

April 2024

·

1,464 Reads

·

94 Citations

Nature

Despite tremendous efforts in the past decades, relationships among main avian lineages remain heavily debated without a clear resolution. Discrepancies have been attributed to diversity of species sampled, phylogenetic method and the choice of genomic regions1–3. Here we address these issues by analysing the genomes of 363 bird species⁴ (218 taxonomic families, 92% of total). Using intergenic regions and coalescent methods, we present a well-supported tree but also a marked degree of discordance. The tree confirms that Neoaves experienced rapid radiation at or near the Cretaceous–Palaeogene boundary. Sufficient loci rather than extensive taxon sampling were more effective in resolving difficult nodes. Remaining recalcitrant nodes involve species that are a challenge to model due to either extreme DNA composition, variable substitution rates, incomplete lineage sorting or complex evolutionary events such as ancient hybridization. Assessment of the effects of different genomic partitions showed high heterogeneity across the genome. We discovered sharp increases in effective population size, substitution rates and relative brain size following the Cretaceous–Palaeogene extinction event, supporting the hypothesis that emerging ecological opportunities catalysed the diversification of modern birds. The resulting phylogenetic estimate offers fresh insights into the rapid radiation of modern birds and provides a taxon-rich backbone tree for future comparative studies.


iNextEra: An efficient and cost-effective library preparation protocol for sequencing human-infecting Cryptosporidium spp

March 2024

·

44 Reads

Genome-scale sequencing is important in the study of infectious diseases, but preparing DNA samples for such sequencing can be constrained by the need for specialized equipment, expensive reagents, and higher input DNA than is readily available. Cryptosporidium oocysts are shed into feces, making it difficult and expensive to purify and yielding limited amounts of DNA. To address these constraints, we developed an iNextEra library preparation protocol for Illumina sequencing that works across a broad range of input DNA amounts, requires little bench time and no specialized equipment, succeeds with limited PCR cycles, yet is low-cost per sample. Our method uses 1/20th of the standard amount of the Illumina DNA Prep tagmentation beads, a simplified protocol, custom indexing primers, and common PCR reagents. A single researcher can use these methods to produce ≥ 96 libraries per day at a cost of <5 USD per sample and may pool up to 480 unique dual-indexed or up to 230,400 (480x480) unique combinatorically indexed libraries. We tested this protocol by making libraries from mock communities with known dilutions of Cryptosporidium DNA, and real-world clinical samples of Cryptosporidium infected patients. Overall, our protocol performed well with a wide range of input DNA (<1 ng to >60 ng) and 8-14 PCR cycles, thus reducing bias introduced by amplification. The libraries are compatible with all Illumina sequencers, including NovaSeq X, for low-cost sequencing per sample. This protocol thus opens the potential for many laboratories to process the samples of these important parasites for Illumina sequencing efficiently.


Testing the mettle of METAL: A comparison of phylogenomic methods using a challenging but well-resolved phylogeny

March 2024

·

194 Reads

·

2 Citations

The evolutionary histories of different genomic regions typically differ from each other and from the underlying species phylogeny. This makes species tree estimation challenging. Here, we examine the performance of phylogenomic methods using a well-resolved phylogeny that nevertheless contains many difficult nodes, the species tree of living birds. We compared trees generated by maximum likelihood (ML) analysis of concatenated data, gene tree summary methods, and SVDquartets. We also conduct the first empirical test of a ''new'' method called METAL ( M etric algorithm for E stimation of T rees based on A ggregation of L oci), which is based on evolutionary distances calculated using concatenated data. We conducted this test using a novel dataset comprising more than 4000 ultraconserved element (UCE) loci from almost all bird families and two existing UCE and intron datasets sampled from almost all avian orders. We identified ''reliable clades'' very likely to be present in the true avian species tree and used them to assess method performance. ML analyses of concatenated data recovered almost all reliable clades with less data and greater robustness to missing data than other methods. METAL recovered many reliable clades, but only performed well with the largest datasets. Gene tree summary methods (weighted ASTRAL and weighted ASTRID) performed well; they required less data than METAL but more data than ML concatenation. SVDquartets exhibited the worst performance of the methods tested. In addition to the methodological insights, this study provides a novel estimate of avian phylogeny with almost 99% of the currently recognized avian families. Only one of the 181 reliable clades we examined was consistently resolved differently by ML concatenation versus other methods, suggesting that it may be possible to achieve consensus on the deep phylogeny of extant birds.



Dispersal sweepstakes: Biotic interchange propelled air‐breathing fishes across the globe

December 2023

·

126 Reads

·

3 Citations

Journal of Biogeography

Aim Biotic interchanges between Africa, India, and Eurasia are central to explaining the present‐day distribution and diversity of freshwater organisms across these landmasses. Synbranchiformes is a diverse and species‐rich clade of freshwater acanthomorph fishes found on all southern continents except Antarctica, and include eel‐ and perch‐like, air‐breathing and non‐air‐breathing fishes. Lacking a comprehensive and resolved phylogeny of the entire clade, contemporary interpretations of synbranchiform biogeography invoke scenarios as disparate as Gondwanan vicariance and pan‐global rafting to explain their modern‐day distribution. Here, we study their biogeographic history of continental dispersal events and test whether these are associated with increases in lineage diversification. Location Asia, India, Africa freshwater habitats. Taxon Synbranchiformes (gouramis, snakeheads, swamp eels, and relatives). Methods We used nearly 1000 ultra‐conserved elements (UCEs) and Sanger‐sequenced genes to infer a phylogeny with representatives of all major synbranchiform lineages and nearly two‐thirds of its known species diversity. Incorporating fossil calibrations, we inferred a time‐calibrated phylogeny to which we apply Bayesian methods of ancestral area reconstruction and test for diversification rate shifts. Results Analyses of UCE data provide a resolved phylogeny for major synbranchiform lineages. Divergence times support a most recent common ancestor of the entire clade approximately 79.2 million years ago. We infer significant increases in lineage diversification in both the spiny eels (Mastacembelidae) and the genus Betta (Osphronemidae). Main Conclusions Our results reject the hypothesis of Gondwanan vicariance explaining synbranchiform biogeography. Instead, our analyses reconstruct a southeast Asian origin of the entire clade and independent dispersal events to other continents by snakeheads, anabantids, and spiny eels, with no signal of elevated lineage diversification occurring after these invasions. Higher lineage diversification rates in spiny eels pre‐date their arrival to Africa, while the high diversification rates observed in Betta were initiated prior to the flooding of insular Sundaland in southeast Asia.


Citations (60)


... However, the impact on macrofauna, such as fish, is less clear. Molecular clock data for Goby fishes (Acanthomorpha: Percomorpha: Gobiiformes) suggest that they may have had a large radiation around the EOT [11]. The freshwater cyprinids also diversified around this time interval and expanded into Europe [12]. ...

Reference:

First Marine Fossil Otoliths (Teleostei) from East Africa (Tanzania)
Explosion of goby fish diversity at the Eocene-Oligocene transition
  • Citing Article
  • March 2025

Molecular Phylogenetics and Evolution

... Recent genomic studies on various fungal symbionts outside of the beetle system, including arbuscular mycorrhizal fungi and fungal crops of insects, have revealed insights into genome evolution and functional gene diversification [11][12][13][14][15]. However, most such symbioses evolved only once or a limited number of times within their respective phylogenies. ...

The coevolution of fungus-ant agriculture
  • Citing Article
  • October 2024

Science

... The Orussoidea-Apocrita sister group relationship is repeatedly retrieved as strongly supported, also in recent comprehensive phylogenomic treatments of hymenopteran relationships (e.g. Blaimer et al., 2023;Peters et al., 2017;Wutke et al., 2024). ...

Phylogenomics and biogeography of sawflies and woodwasps (Hymenoptera, Symphyta)

Molecular Phylogenetics and Evolution

... Meanwhile, long-read sequencing approaches, recently coined "method of the year" by Nature Methods [18], are the method of choice and allow for much better genome assemblies up to chromosome-scale scaffolds [19]. Comparative genomic studies are performed across all the branches of the tree of life [20][21][22]. Advances in sequencing technology and assembly techniques promoted large consortium efforts to produce datasets that allow for broad scale comparative genomics approaches and led to the final goal to sequence representative genomes from all eukaryotic species (www. ...

Complexity of avian evolution revealed by family-level genomes

Nature

... These distributions are suggestive of introgression and/or incomplete lineage sorting between two species, and merit further investigation (Helbig et al. 2002, Donegan 2018. A single dark-eyed individual in the range of the pale-eyed group in the Yucatán (Fig. 6A) suggests that, in this population, dark irides may be recessive to pale iris colour, as in domestic chickens (Smyth 1990, Corbett et al. 2024. Paynter (1955) correctly questioned whether pallidus is valid, since specimens from its purported range are extremely variable and inseparable from affinis in eastern Chiapas. ...

The mechanistic, genetic and evolutionary causes of bird eye colour variation
  • Citing Article
  • October 2023

Ibis

... In this study, we explore the potential of intramolecular distances to be treated as evolutionary characters and set out to ask if these characters could either help the reconstruction of phylogenetic trees or provide new ways of estimating branch reliability. The use of structures for phylogeny reconstruction remains limited because it has been difficult to identify independent objective reference datasets that would allow the distances to be tested in a manner similar to other distance metrics 18 . ...

Testing the mettle of METAL: A comparison of phylogenomic methods using a challenging but well-resolved phylogeny

... Because of their limited tolerance to high salinities (as secondary freshwater fishes)-although see Tyler and Feller (1996)-attempts to explain their near global distribution have historically relied on Gondwanan vicariance scenarios (Rosen, 1975(Rosen, , 1978Mar-Silva et al., 2022). More recently, however, a time-scaled phylogeny of all major synbranchiform lineages has challenged the vicariance hypothesis in favor of pan-global rafting (Harrington et al., 2024). The taxonomic history of swamp eels has been no less convoluted, afflicted by numerous nomenclatural changes and limited resolution of phylogenetic relationships. ...

Dispersal sweepstakes: Biotic interchange propelled air‐breathing fishes across the globe
  • Citing Article
  • December 2023

Journal of Biogeography

... This isolation might explain some early Pleistocene diversification events associated with dynamic rivers flowing over alluvial terrains, such as the Japurá, Ucayali and Purus 36,124,126,139,140 (Fig. 4). The Late Pleistocene retraction of floodplains in western Amazonia and the expansion of terra firme habitats over abandoned floodplains 6 probably constrained the current distributional limits of species from these areas and could explain the observed instances of introgression involving taxa from small western Amazonian interfluves [141][142][143] . ...

Implications of headwater contact zones for the riverine barrier hypothesis: a case study of the Blue-capped Manakin (Lepidothrix coronata)

Evolution

... This hitchhiking behavior enhances access to food resources, reduces transportation costs, and protects against predators [3,6,7]. Echeneidae research has largely focused on several key areas, including bionomics [8], hydrodynamic loading on hosts [6], unique adhesion capabilities [9], genomic studies [4,10,11], and phylogenetic relationships [2,3,[12][13][14][15]. Previous studies have classified the family Echeneidae based on morphological and molecular data. ...

Widespread sympatry in a species-rich clade of marine fishes (Carangoidei)