ArticlePublisher preview available

Machine learning approaches to assess microendemicity and conservation risk in cave-dwelling arachnofauna

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract and Figures

The biota of cave habitats faces heightened conservation risks, due to geographic isolation and high levels of endemism. Molecular datasets, in tandem with ecological surveys, have the potential to precisely delimit the nature of cave endemism and identify conservation priorities for microendemic species. Here, we sequenced ultraconserved elements of Tegenaria within, and at the entrances of, 25 cave sites to test phylogenetic relationships, combined with an unsupervised machine learning approach for detecting species. Our analyses identified clear and well-supported genetic breaks in the dataset that accorded closely with morphologically diagnosable units. Through these analyses, we also detected some previously unidentified, potential cryptic morphospecies. We then performed conservation assessments for seven troglobitic Israeli species of this genus and determined five of these to be critically endangered.
This content is subject to copyright. Terms and conditions apply.
RESEARCH
Conservation Genetics (2024) 25:1103–1110
https://doi.org/10.1007/s10592-024-01627-5
to delimit endemic cave species, as well as develop man-
agement strategies for endangered taxa (Paquin and Hedin
2004). Broadly, cave ecosystems share core abiotic features,
such as reduction or complete absence of light, high rela-
tive humidity, and buered temperature ranges compared
to their surrounding terrestrial surface climates (Barr and
Holsinger 1985). The existence and maintenance of biodi-
versity in cave habitats is predicated on the ability of biota
to adapt to such conditions. Consequently, unique pheno-
typic changes can be observed in cave-dwelling organisms
across the animal tree of life. These changes comprise both
reductive features (e.g., atrophy of structures not required
for subterranean life), as well as constructive adaptations
(e.g., compensatory gains in tactile appendages or olfac-
tory capacity; Re et al. 2018; Riddle et al. 2018). One of the
more conspicuous examples of this phenomenon is the par-
tial or complete loss of eyes in cave-dwelling species. The
Mexican cavesh (Astyanax mexicanus) is a well-studied
exemplar of eye loss in cave-dwelling species. The blind
morph of A. mexicanus is said to have evolved as recently
as 20,000 years ago, exemplifying phenotypic change over
rapid timescales and without the requirement of reproduc-
tive isolation (Fumey et al. 2018). Rapid evolution of dis-
parate phenotypes allows for the study of how speciation
begins in cave populations versus surface populations. Over
Introduction
Cave-dwelling taxa are at heightened risk of extinction due
to the limited ranges imposed by a single cave system or, in
extreme cases, a single cave. These taxa, sometimes referred
to as short-range endemics or microendemics, face an out-
sized threat in the face of disturbance to their habitats and
climate change. (Harvey et al. 2011; Mammola et al. 2018).
With limited individuals to sample, it is a challenge both
Hugh G. Steiner
hgsteiner@wisc.edu
1 Department of Integrative Biology, University of Wisconsin-
Madison, Madison, WI, USA
2 The National Natural History Collections, The Hebrew
University of Jerusalem, Edmond J. Safra Campus, Givat
Ram, Jerusalem 9190401, Israel
3 Department of Ecology, Evolution & Behavior, Edmond J.
Safra Campus, Jerusalem, Israel
4 Department of Biology, Kean University, Union, NJ, USA
5 Department of Systems Biology, Harvard Medical School,
Harvard University, Boston, MA, USA
6 Zoology Museum, University of Wisconsin-Madison,
Madison, WI, USA
Abstract
The biota of cave habitats faces heightened conservation risks, due to geographic isolation and high levels of endemism.
Molecular datasets, in tandem with ecological surveys, have the potential to precisely delimit the nature of cave endemism
and identify conservation priorities for microendemic species. Here, we sequenced ultraconserved elements of Tegenaria
within, and at the entrances of, 25 cave sites to test phylogenetic relationships, combined with an unsupervised machine
learning approach for detecting species. Our analyses identied clear and well-supported genetic breaks in the dataset
that accorded closely with morphologically diagnosable units. Through these analyses, we also detected some previously
unidentied, potential cryptic morphospecies. We then performed conservation assessments for seven troglobitic Israeli
species of this genus and determined ve of these to be critically endangered.
Keywords Machine learning · Phylogenomics · Conservation · Ultraconserved elements (UCEs) · Endemism
Received: 18 September 2023 / Accepted: 26 July 2024 / Published online: 3 August 2024
© The Author(s), under exclusive licence to Springer Nature B.V. 2024
Machine learning approaches to assess microendemicity and
conservation risk in cave-dwelling arachnofauna
Hugh G.Steiner1· ShlomiAharon2,3· JesúsBallesteros4· GuilhermeGainett5· EfratGavish-Regev3·
Prashant P.Sharma1,6
1 3
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Advanced sequencing technologies have expedited resolving higher-level arthropod relationships. Yet, dark branches persist, principally among groups occurring in cryptic habitats. Among chelicerates, Solifugae (“camel spiders”) is the last order lacking a higher-level phylogeny and thus, historically characterized as “neglected [arachnid] cousins”. Though renowned for aggression, remarkable running speed, and xeric adaptation, inferring solifuge relationships has been hindered by inaccessibility of diagnostic morphological characters, whereas molecular investigations have been limited to one of 12 recognized families. Our phylogenomic dataset via capture of ultraconserved elements sampling all extant families recovered a well-resolved phylogeny, with two distinct groups of New World taxa nested within a broader Paleotropical radiation. Divergence times using fossil calibrations inferred Solifugae radiated by the Permian, and most families diverged pre-Paleogene-Cretaceous extinction, largely driven by continental breakup. We establish Boreosolifugae new suborder uniting five Laurasian families, and Australosolifugae new suborder uniting seven Gondwanan families using morphological and biogeographic signal.
Preprint
Full-text available
Targeted enrichment of conserved and ultraconserved genomic elements allows universal collection of phylogenomic data from hundreds of species at multiple time scales (< 5 Ma to > 300 Ma). Prior to downstream inference, data from these types of targeted enrichment studies must undergo pre-processing to assemble contigs from sequence data; identify targeted, enriched loci from the off-target background data; align enriched contigs representing conserved loci to one another; and prepare and manipulate these alignments for subsequent phylogenomic inference. PHYLUCE is an efficient and easy-to-install software package that accomplishes these tasks across hundreds of taxa and thousands of enriched loci. Availability and Implementation PHYLUCE is written for Python 2.7. PHYLUCE is supported on OSX and Linux (RedHat/CentOS) operating systems. PHYLUCE source code is distributed under a BSD-style license from https://www.github.com/fairclothUlab/phyluce/ . PHYLUCE is also available as a package ( https://binstar.org/fairclothUlab/phyluce ) for the Anaconda Python distribution that installs all dependencies, and users can request a PHYLUCE instance on iPlant Atmosphere (tag: phyluce). The software manual and a tutorial are available from http://phyluce.readthedocs.org/en/latest/ and test data are available from doi: 10.6084/m9.figshare.1284521. Contact brant@fairclothUlab.org Supplementary information Supplementary Figure 1.
Article
Full-text available
Caves share unique conditions that have led to convergent adaptations of cave-dwelling animals. In addition, local factors act as filters on regional species-pools to shape the assemblage composition of local caves. Surveys of 35 Levantine caves, distributed along a climate gradient from the mesic in the north of Israel to hyper-arid areas in the south of Israel, were conducted to test the effect of cave characteristics, location, climate, bat presence, and guano level on the spider assemblage. We found 62 spider species and assigned four species as troglobites, 28 as troglophiles, and 30 as accidentals. Precipitation, elevation, latitude, minimum temperature, and guano levels significantly affected the composition of cave-dwelling spider assemblages. Caves situated in the Mediterranean region had higher species richness and abundance, as well as more troglobite and troglophile arachnids. These discoveries contribute to the knowledge of the local arachnofauna and are important for the conservation of cave ecosystems. By comparing spider assemblages of Levantine caves to European caves, we identified gaps in the taxonomic research, focusing our efforts on spider families that may have additional cryptic or yet to be described cave-dwelling spider species. Our faunistic surveys are crucial stages for understanding the evolutionary and ecological mechanisms of arachnid speciation in Levantine caves.
Article
Full-text available
Genome-scale data sets are converging on robust, stable phylogenetic hypotheses for many lineages; however, some nodes have shown disagreement across classes of data. We use spiders (Araneae) as a system to identify the causes of incongruence in phylogenetic signal between three classes of data: exons (as in phylotranscriptomics), non-coding regions (included in ultraconserved elements [UCE] analyses), and a combination of both (as in UCE analyses). Gene orthologs, coded as amino acids and nucleotides (with and without third codon positions), were generated by querying published transcriptomes for UCEs, recovering 1,931 UCE loci (codingUCEs). We expected that congeners represented in the codingUCE and UCEs data would form clades in the presence of phylogenetic signal. Non-coding regions derived from UCE sequences were recovered to test the stability of relationships. Phylogenetic relationships resulting from all analyses were largely congruent. All nucleotide data sets from transcriptomes, UCEs, or a combination of both recovered similar topologies in contrast with results from transcriptomes analyzed as amino acids. Most relationships inferred from low occupancy data sets, containing several hundreds of loci, were congruent across Araneae, as opposed to high occupancy data matrices with fewer loci, which showed more variation. Furthermore, we found that low occupancy data sets analyzed as nucleotides (as is typical of UCE data sets) can result in more congruent relationships than high occupancy data sets analyzed as amino acids (as in phylotranscriptomics). Thus, omitting data, through amino acid translation or via retention of only high occupancy loci, may have a deleterious effect in phylogenetic reconstruction.
Article
Full-text available
Recent environmental processes are studied in ʻA’rak Naʻasane Cave at the northern Judean Desert, Israel. The outer zone of the cave is heavily influenced by the outside environment through a large entrance, facilitating entry of air flow, fauna and humans, with minor cave-forming modifications. Conversely, the inner cave sustains humid and warm conditions, favoring modifications by condensation corrosion of convective air flow, associated with deposition of popcorn speleothems at the lower parts of dissolution pockets. The warm humid air of the inner cave may be associated with an underlying thermal water table. Active condensation corrosion is decreasing, possibly because of gradual change in the cave microclimate, associated with falling water table and ventilation. Increasing connection with the surface is indicated by high collapse domes, rare flood invasion, and a large Trident Leaf-nosed bat community which spends the winter within the innermost parts of the cave. Bat guano supports bedrock corrosion and a rich invertebrate fauna, but humans preferred the outer parts of the cave, particularly for refuge during the second Jewish revolt against the Romans. Rare occasions of ancient human entry into the inner cave support this scenario by the small number of artifacts compared with the outer cave. Enigmatic small cairns in the largest inner hall were probably erected during the Intermediate Bronze Age.
Article
Asymmetrical rates of cladogenesis and extinction abound in the Tree of Life, resulting in numerous minute clades that are dwarfed by larger sister groups. Such taxa are commonly regarded as phylogenetic relicts or "living fossils" when they exhibit an ancient first appearance in the fossil record and prolonged external morphological stasis, particularly in comparison to their more diversified sister groups. Due to their special status, various phylogenetic relicts tend to be well-studied and prioritized for conservation. A notable exception to this trend is found within Amblypygi ("whip spiders"), a visually striking order of functionally hexapodous arachnids that are notable for their antenniform first walking leg pair (the eponymous "whips"). Paleoamblypygi, the putative sister group to the remaining Amblypygi, is known from Late Carboniferous and Eocene deposits, but is survived by a single living species, Paracharon caecus Hansen, 1921, that was last collected in 1899. Due to the absence of genomic sequence-grade tissue for this vital taxon, there is no global molecular phylogeny for Amblypygi to date, nor a fossil-calibrated estimation of divergences within the group. Here, we report a previously unknown species of Paleoamblypygi from a cave site in Colombia. Capitalizing upon this discovery, we generated the first molecular phylogeny of Amblypygi, integrating ultraconserved element sequencing with legacy Sanger datasets and including described extant genera. To quantify the impact of sampling Paleoamblypygi on divergence time estimation, we performed in silico experiments with pruning of Paracharon. We demonstrate that the omission of relicts has a significant impact on the accuracy of node dating approaches that outweighs the impact of excluding ingroup fossils, which bears upon the ancestral range reconstruction for the group. Our results underscore the imperative for biodiversity discovery efforts in elucidating the phylogenetic relationships of "dark taxa", and especially phylogenetic relicts in tropical and subtropical habitats. The lack of reciprocal monophyly for Charontidae and Charinidae leads us to subsume them into one family, Charontidae, new synonymy.
Article
Caves have long been recognized as a window into the mechanisms of diversification and convergent evolution, due to the unique conditions of isolation and life in the dark. These lead to adaptations and reduce dispersal and gene flow, resulting in high levels of speciation and endemism. The Israeli cave arachnofauna remains poorly known, but likely represents a rich assemblage. In a recent survey, we found troglophilic funnel-web spiders of the genus Tegenaria in 26 caves, present mostly at the cave entrance ecological zone. In addition, we identified at least 14 caves inhabited by troglobitic Tegenaria, which are present mostly in the twilight and dark ecological zones. Ten of the caves, located in the north and center of Israel, are inhabited by both troglophilic and troglobitic Tegenaria. These spiders bear superficial phenotypic similarities, but differ in the levels of eye reduction and pigmentation. To test whether these taxa constitute separate species, as well as understand their relationships to epigean counterparts, we conducted a broad geographic sampling of cave-dwelling Tegenaria in Israel and Palestine, using morphological and molecular evidence. Counterintuitively, our results show that the troglobitic Tegenaria we studied are distantly related to the troglophilic Tegenaria found at each of the cave entrances we sampled. Moreover, seven new troglobitic species can be identified based on genetic differences, eye reduction level, and features of the female and male genitalia. Our COI analysis suggest that the Israeli troglobitic Tegenaria species are more closely related to eastern-Mediterranean congeners than to the local sympatric troglophile Tegenaria species, suggesting a complex biogeographic history.
Article
Phylogenomic methods have proven useful for resolving deep nodes and recalcitrant groups in the spider tree of life. Across arachnids, transcriptomic approaches may generate thousands of loci, and target‐capture methods, using the previously designed arachnid‐specific probe‐set, can target a maximum of about 1,000 loci. Here, we develop a specialized target‐capture probe set for spiders that contains over 2,000 ultraconserved elements (UCEs) and then demonstrate the utility of this probe set through sequencing and phylogenetic analysis. We designed the “spider‐specific” probe set using three spider genomes (Loxosceles, Parasteatoda and Stegodyphus) and ensured that the newly designed probe‐set include UCEs from the previously designed Arachnida probe set. The new “spider‐specific” probes were used to sequence UCE loci in 51 specimens. The remaining samples included five spider genomes and taxa that were enriched using Arachnida probe set. The “spider‐specific” probes were also used to gather loci from a total of 84 representative taxa across Araneae. On mapping these 84 taxa to the Arachnida probe set, we captured at most 710 UCE loci, while the spider specific probe set captured up to 1,547 UCE loci from the same taxon sample. Phylogenetic analyses using Maximum Likelihood and coalescent methods corroborate most nodes resolved by recent transcriptomic analyses, but not all (e.g., UCE data suggests monophyly of “symphytognathoids”). Our preferred analysis based on topology tests, suggests monophyly of the “symphytognathoids” (the miniature orb‐weavers), which in previous studies has only been supported by a combination of morphological and behavioral characters.
Article
Repeated evolution of similar phenotypes is a widespread phenomenon found throughout the living world and it can proceed through the same or different genetic mechanisms. Cave animals with their convergent traits such as eye and pigment loss, as well as elongated appendages, are a striking example of the evolution of similar phenotypes. Yet, few cave species are amenable to genetic crossing and mapping techniques making it challenging to determine the genetic mechanisms causing their similar phenotypes. To address this limitation, we have been developing Asellus aquaticus, a freshwater isopod crustacean, as a genetic model. Many of its cave populations originate from separate colonization events and thus independently evolved their similar cave-related phenotypes which differ from the still existent ancestral-like surface populations. In our prior work, we identified genomic regions responsible for eye and pigment loss in a single cave population from Slovenia. In this study we examined another, independently evolved cave population, also from Slovenia, and asked whether the same or different genomic regions are responsible for eye and pigment loss in the two cave populations. We generated F2 and backcross hybrids with a surface population, genotyped them for the previously identified genomic regions, and performed a complementation test by crossing individuals from the two cave populations. We found out that the same genomic regions are responsible for eye and pigment loss and that at least one of the genes causing pigment loss is the same in both cave populations. Future studies will identify the actual genes and mutations, as well as examine additional cave populations to see if the same genes are commonly associated with eye and pigment loss in this species.