Michael J. Sanderson’s research while affiliated with University of Massachusetts Chan Medical School and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (302)


BALB/c mice show stronger airway contraction than C57BL/6 mice. (A) Representative phase-contrast images of 2 small airways in PCLS derived from BALB/c and C57BL/6 mice under resting conditions (left panel, scale bar = 100 μm) and exposed to 2 μM MCh (right panel) at 37°C. (B–C) The contraction assay show small airway contraction in response to increasing doses of MCh and 5-HT, respectively, at 37°C in PCLS derived from BALB/c (BLUE line) and C57BL/6 (RED line) mice. (D–E) The comparison of dose-response curves to MCh (D) and 5-HT (E) between BALB/c (BLUE line) and C57BL/6 (RED line) mice. Data are presented as the mean ± SD, *p < 0.0001 in (D), #p < 0.0001 in (E). Data are obtained from 14 airways in 8 PCLS of 6 BALB/c mice, and 12 airways in 6 PCLS of 6 C57BL/6 mice, respectively.
BALB/c mice show an increased agonist-induced Ca²⁺ oscillation frequency in ASM cells compared to C57BL/6 mice. (A–B) Representative traces show Ca²⁺ oscillations of ASM cells in PCLS derived from BALB/c (A, BLUE line) and C57BL/6 (B, RED line) mice in response to increasing doses of MCh at 37°C. (C–D) The comparison of dose-response curves to MCh (C) and 5-HT (D) between BALB/c (BLUE line) and C57BL/6 (RED line) mice. Data are presented as the mean ± SD, *p < 0.0001 in (C), #p < 0.0001 in (D). Data are obtained from 18 ASM cells in 9 PCLS of 6 BALB/c mice, and15 ASM cells in 7 PCLS of 6 C57BL/6 mice, respectively.
Reduced SOC current (VSOCC) leads to slower Ca²⁺ oscillations in ASM cells. (A) Mathematical model predicts that decreased VSOCC might lead to decreased Ca²⁺ oscillations in ASM cells. (B) The frequency of MCh (400 nM) induced Ca²⁺ oscillations in ASM cells of PCLS decreased when SOCE is inhibited by 1 μM and 10 μM GSK7975A (G7) respectively. (C) There are significant differences among Ca²⁺ oscillation frequency induced by 400 nM MCh, MCh +1 μM G7 and MCh +10 μM G7. Data are presented as the mean ± SD. For each bar in (C), seven to eight ASM cells obtained from 6 PCLS in 3 BALB/c mice are evaluated.
BALB/c mice show higher levels of [Ca²⁺]i in SOCE-activated ASM cells than C57BL/6 mice. (A) A sketch illustrates the development of SOCE-activated ASM cells: 1, PCLS were treated with 25 μM ryanodine (Rya) and 20 mM caffeine simultaneously to “lock” the ryanodine receptors (RyR) of SR in an open state; 2, Opening of RyR leads to the depletion of SR Ca²⁺; 3, Ca²⁺ depletion in SR induces Ca²⁺ influx through SOC to form SOCE; 4, The steady level of [Ca²⁺]i in SOCE-activated ASM cells results from a dynamic balance of Ca²⁺ influx through SOCE, Ca²⁺ efflux from SR through RyR, Ca²⁺ uptake to the SR by SERCA, and Ca²⁺ efflux through plasma membrane Ca²⁺ ATPase (PCMA) and Na⁺/Ca²⁺ exchanger (NCX). (B) SOCE-activated PCLS is prepared by exposing PCLS to a cocktail of 25 μM ryanodine and 20 mM caffeine at 37°C, ASM cells from BALB/c mice (BLUE line) show a higher steady level of [Ca²⁺]i than those from C57BL/6 mice (RED line). Each point of the line represents the mean ± SD of the intensity at selected time points (Ft) normalized to the initial intensity at the beginning time (F0). (C) Comparison of steady levels (Ft/F0) in SOCE-activated ASM cells derived from BALB/c and C57BL/6 mice. Data are presented as the mean ± SD, p = 0.0126. Data are obtained from 17 ASM cells of 5 BALB/c mice, and 20 ASM cells of 5 C57BL/6 mice, respectively.
BALB/c mice show faster and higher SOC currents in SOCE-activated ASM cells than C57BL/6 mice. (A) The SOC current is evaluated by removing extracellular Ca²⁺ first to empty the [Ca²⁺]i and [Ca²⁺]SR in SOCE-activated ASM cells, and then adding back 1.3 mM Ca²⁺ to assess the transient Ca²⁺ influx via SOCE. Each point of the line represents the mean ± SD of the intensity at selected time points (Ft) normalized to the initial intensity at the beginning time (F0). The SOC current was further quantified and compared by measuring (B) the rate of Ca²⁺ entry and (C) the peak value of Ca²⁺ influx. Data are presented as the mean ± SD, p < 0.0001 in (C). Data are obtained from 16 ASM cells in 8 PCLS of 5 BALB/c mice, and 15 ASM cells in 7 PCLS of 5 C57BL/6 mice.

+1

Inherent differences of small airway contraction and Ca oscillations in airway smooth muscle cells between BALB/c and C57BL/6 mouse strains
  • Article
  • Full-text available

June 2023

·

36 Reads

·

8 Citations

Zijian Zeng

·

Mengxin Cheng

·

Meng Li

·

[...]

·

BALB/c and C57BL/6 mouse strains are widely used as animal model in studies of respiratory diseases, such as asthma. Asthma is characterized by airway hyperresponsiveness, which is eventually resulted from the excessive airway smooth muscle (ASM) contraction mediated by Ca²⁺ oscillations in ASM cells. It is reported that BALB/c mice have inherently higher airway responsiveness, but show no different contractive response of tracheal ring as compared to C57BL/6 mice. However, whether the different airway responsiveness is due to the different extents of small airway contraction, and what’s underlying mechanism remains unknown. Here, we assess agonist-induced small airway contraction and Ca²⁺ oscillations in ASM cells between BALB/c and C57BL/6 mice by using precision-cut lung slices (PCLS). We found that BALB/c mice showed an intrinsically stronger extent of small airway narrowing and faster Ca²⁺ oscillations in ASM cells in response to agonists. These differences were associated with a higher magnitude of Ca²⁺ influx via store-operated Ca²⁺ entry (SOCE), as a result of increased expression of SOCE components (STIM1, Orai1) in the ASM cells of small airway of BALB/c mice. An established mathematical model and experimental results suggested that the increased SOC current could result in increased agonist-induced Ca²⁺ oscillations. Therefore, the inherently higher SOC underlies the increased Ca²⁺ oscillation frequency in ASM cells and stronger small airway contraction in BALB/c mice, thus higher airway responsiveness in BALB/c than C57BL/6 mouse strain.

Download

Figure 1. The SGP5 individual on Tumamoc Hill, Tucson, Arizona. Photo: MF Wojciechowski.
An improved genome assembly of the saguaro cactus ( Carnegiea gigantea (Engelm.) Britton & Rose)

April 2023

·

155 Reads

We present an improved genome assembly of the saguaro cactus ( Carnegiea gigantea (Engelm.) Britton & Rose), obtained by incorporating long-read PacBio data to the existing short reads. The assembly improves in terms of total size, contiguity, and accuracy, allowing to extend the range of sequence analyses beyond the single-gene scale. Consequently, the assembly is 16% larger and has 20% more genes, expanding the resources for a neglected yet very remarkable plant family such as Cactaceae. Species taxonomy Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliopsida; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Caryophyllales; Cactineae; Cactaceae; Cactoideae; Echinocereeae; Carnegiea gigantea (Engelm.) Britton & Rose) (also known as saguaro cactus) (NCBI txid: 171969).


Origin and Diversification of the Saguaro Cactus (Carnegiea gigantea): A Within-Species Phylogenomic Analysis

March 2022

·

230 Reads

·

6 Citations

Systematic Biology

Reconstructing accurate historical relationships within a species poses numerous challenges, not least in many plant groups in which gene flow is high enough to extend well beyond species boundaries. Nonetheless, the extent of tree-like history within a species is an empirical question on which it is now possible to bring large amounts of genome sequence to bear. We assess phylogenetic structure across the geographic range of the saguaro cactus, an emblematic member of Cactaceae, a clade known for extensive hybridization and porous species boundaries. Using 200 Gb of whole genome resequencing data from 20 individuals sampled from 10 localities, we assembled two data sets comprising 150,000 biallelic single nucleotide polymorphisms (SNPs) from protein coding sequences. From these we inferred within-species trees and evaluated their significance and robustness using five qualitatively different inference methods. Despite the low sequence diversity, large census population sizes, and presence of wide-ranging pollen and seed dispersal agents, phylogenetic trees were well resolved and highly consistent across both data sets and all methods. We inferred that the most likely root, based on marginal likelihood comparisons, is to the east and south of the region of highest genetic diversity, which lies along the coast of the Gulf of California in Sonora, Mexico. Together with striking decreases in marginal likelihood found to the north, this supports hypotheses that saguaro's current range reflects post-glacial expansion from the refugia in the south of its range. We conclude with observations about practical and theoretical issues raised by phylogenomic data sets within species, in which SNP-based methods must be used rather than gene tree methods that are widely used when sequence divergence is higher. These include computational scalability, inference of gene flow, and proper assessment of statistical support in the presence of linkage effects.


FIG. 1.-Annotated plastomes of five Astragalus species. Gene order in A. agnicidus is consistent with a plastid genome having the 50-kb inversion (Doyle et al. 1996). A 7-kb inversion is found in A. calycosus (rbcL $ trnH-GUG; red), a 40-kb inversion is found in A. mollissimus (trnQ-UUG $ trnT-UGU; tan), and 7-kb inversions are found in both A. flexuosus and A. neglectus (trnL-CAA $ trnI-CAU; blue). Inversions identified from MUMmer (Marc¸ais et al. 2018) and progressiveMauve (Darling et al. 2010) alignments. The approximate locations of the large single-copy region (LSC), the region ancestrally duplicated as the inverted repeat but now present as a single copy only ("IR"), and the small single-copy region (SSC) are shown at the left. Plastome maps modified from the output of OGDraw (Greiner et al. 2019).
FIG. 2.-Plastome inversions and overdispersed repeats on maximum likelihood phylogram of 30 Astragalus taxa and Oxytropis bicolor from a concatenated alignment of locally colinear blocks (LCBs) identified using progressiveMauve (Darling et al. 2010). The tree is consistent with previous phylogenies of Astragalus at a higher level (Wojciechowski 2005; Scherson et al. 2008; Azani et al. 2019; Su et al. 2021) with Neo-Astragalus a wellsupported clade nested within Old World and euploid North American taxa. Bootstrap support values from 1,000 ultrafast bootstrap replicates are shown with color-coded circles. Branch lengths are in units of substitutions per site.
FIG. 3.-Phylogenetic context of plastome repeat content by category (A) and repeat density by position in plastome (B) for 30 Astragalus taxa and Oxytropis bicolor. Cladogram of maximum likelihood topology shown on left. Repeats are categorized by their occurrence among taxa based on Markov clustering. Repeat density in 3-kb sliding windows is averaged over 100-bp steps. The position of repeats colored by category is shown below the horizontal axis in repeat density plots. All plastomes rescaled to the same length. Inversion endpoint locations are shown in all taxa, and colored rectangles are present in taxa with inversions. All inversions were reverted before calculating repeat density.
FIG. 5.-Position of repeats and microhomologous sequences ancestrally (upper) and currently (lower) after plastome inversions in Astragalus calycosus (A), A. mollissimus (B), as well as A. flexuosus and A. neglectus (C). In each case, two longer repeats (Acaly1, Acaly2, etc.) that are now in inverted orientation at opposite ends of the inversion are inferred to have ancestrally been direct repeats on the same end of the inversion. Shorter microhomologous sequences (M c , m c , etc.) are inferred to have mediated the inversion and are found at the exact inversion endpoints. Different repeats and microhomologous sequences are implicated in the inversions between the same sets of loci in A. flexuosus and A. neglectus. Size and position of features not to scale.
FIG. 6.-Proposed sequence of events for initiation of plastome inversions resulting in short inverted repeats at both ends from ancestrally direct repeats near one endpoint (inspired by Mar echal and Brisson [2010]). MMBIR, microhomology-mediated break-induced replication; RDR, recombination-dependent replication.
Plastome Structural Evolution and Homoplastic Inversions in Neo-Astragalus (Fabaceae)

September 2021

·

157 Reads

·

20 Citations

Genome Biology and Evolution

The plastid genomes of photosynthetic green plants have largely maintained conserved gene content and order as well as structure over hundreds of millions of years of evolution. Several plant lineages, however, have departed from this conservation and contain many plastome structural rearrangements, which have been associated with an abundance of repeated sequences both overall and near rearrangement endpoints. We sequenced the plastomes of 25 taxa of Astragalus L. (Fabaceae), a large genus in the inverted repeat–lacking clade of legumes, to gain a greater understanding of the connection between repeats and plastome inversions. We found plastome repeat structure has a strong phylogenetic signal among these closely related taxa mostly in the New World clade of Astragalus called Neo-Astragalus. Taxa without inversions also do not differ substantially in their overall repeat structure from four taxa each with one large-scale inversion. For two taxa with inversion endpoints between the same pairs of genes, differences in their exact endpoints indicate the inversions occurred independently. Our proposed mechanism for inversion formation suggests the short inverted repeats now found near the endpoints of the four inversions may be there as a result of these inversions rather than their cause. The longer inverted repeats now near endpoints may have allowed the inversions first mediated by shorter microhomologous sequences to propagate, something that should be considered in explaining how any plastome rearrangement becomes fixed regardless of the mechanism of initial formation.


Accuracy in Near-Perfect Virus Phylogenies

August 2021

·

35 Reads

·

13 Citations

Systematic Biology

Phylogenetic trees from real-world data often include short edges with very few substitutions per site, which can lead to partially resolved trees and poor accuracy. Theory indicates that the number of sites needed to accurately reconstruct a fully resolved tree grows at a rate proportional to the inverse square of the length of the shortest edge. However, when inferred trees are partially resolved due to short edges, “accuracy” should be defined as the rate of discovering false splits (clades on a rooted tree) relative to the actual number found. Thus, accuracy can be high even if short edges are common. Specifically, in a “near-perfect” parameter space in which trees are large, the tree length ξ (the sum of all edge lengths) is small, and rate variation is minimal, the expected false positive rate is less than ξ∕3; the exact value depends on tree shape and sequence length. This expected false positive rate is far below the false negative rate for small ξ and often well below 5% even when some assumptions are relaxed. We show this result analytically for maximum parsimony and explore its extension to maximum likelihood using theory and simulations. For hypothesis testing, we show that measures of split “support” that rely on bootstrap resampling consistently imply weaker support than that implied by the false positive rates in near-perfect trees. The near-perfect parameter space closely fits several empirical studies of human virus diversification during outbreaks and epidemics, including Ebolavirus, Zika virus, and SARS-CoV-2, reflecting low substitution rates relative to high transmission/sampling rates in these viruses.


Accuracy in near-perfect virus phylogenies

May 2021

·

14 Reads

·

1 Citation

A bstract Phylogenetic trees from real-world data often include short edges with very few substitutions per site, which can lead to partially resolved trees and poor accuracy. Theory indicates that the number of sites needed to accurately reconstruct a fully resolved tree grows at a rate proportional to the inverse square of the length of the shortest edge. However, when inferred trees are partially resolved due to short edges, “accuracy” should be defined as the rate of discovering false splits (clades on a rooted tree) relative to the actual number found. Thus, accuracy can be high even if short edges are common. Specifically, in a “near-perfect” parameter space in which trees are large, the tree length ξ (the sum of all edge lengths), is small, and rate variation is minimal, the expected false positive rate is less than ξ /3; the exact value depends on tree shape and sequence length. This expected false positive rate is far below the false negative rate for small ξ and often well below 5% even when some assumptions are relaxed. We show this result analytically for maximum parsimony and explore its extension to maximum likelihood using theory and simulations. For hypothesis testing, we show that measures of split “support” that rely on bootstrap resampling consistently imply weaker support than that implied by the false positive rates in near-perfect trees. The near-perfect parameter space closely fits several empirical studies of human virus diversification during outbreaks and epidemics, including Ebolavirus, Zika virus, and SARS-CoV-2, reflecting low substitution rates relative to high transmission/sampling rates in these viruses.


Intercellular Ca2+ Waves: Mechanisms of Initiation and Propagation

January 2021

·

28 Reads

·

1 Citation

Coordination of cellular activity is vital to multicellular function and intercellular Ca²⁺ waves may serve as a signaling mechanism to coordinate localized cellular responses. Intercellular Ca²⁺ waves occur in a diversity of cell types and are initiated by a variety of stimuli. These Ca²⁺ waves propagate by the diffusion of an internal or external messenger. The major internal second messengers are inositol trisphosphate (IP3) and Ca²⁺, which reach adjacent cells via gap junctions to propagate Ca²⁺ waves by Ca²⁺-induced Ca²⁺ release from the endoplasmic reticulum via IP3 or ryanodine receptors. External messengers, including adenosine triphosphate and glutamate, are released by vesicular discharge or via hemi-channels of gap junctions or pannexins to stimulate G-protein-coupled receptors of nearby cells to initiate intracellular Ca²⁺ increases. Ca²⁺ waves are limited in size when the messenger is generated only by the initiating cell but more extensive when messenger production is regenerative in participating cells.


A new (old) approach to genotype-based phylogenomic inference within species, with an example from the saguaro cactus (Carnegiea gigantea)

June 2020

·

286 Reads

A bstract Genome sequence data are routinely being used to infer phylogenetic history within and between closely related diploid species, but few tree inference methods are specifically tailored to diploid genotype data. Here we re-examine the method of “polymorphism parsimony” (Inger 1967; Farris 1978; Felsenstein 1979), originally introduced to study morphological characters and chromosome inversion polymorphisms, to evaluate its utility for unphased diploid genotype data in large scale phylogenomic data sets. We show that it is equivalent to inferring species trees by minimizing deep coalescences—assuming an infinite sites model. Two potential advantages of this approach are scalability and estimation of a rooted tree. As with some other single nucleotide polymorphism (SNP) based methods, it requires thinning of data sets to statistically independent sites, and we describe a genotype-based test for phylogenetic independence. To evaluate this approach in genome scale data, we construct intraspecific phylogenies for 10 populations of the saguaro cactus using 200 Gbp of resequencing data, and then use these methods to test whether the population with highest genetic diversity corresponds to the root of the genotype trees. Results were highly congruent with the (unrooted) trees obtained using SVDquartets, a scalable alternative method of phylogenomic inference.


Fig. 1. Illustration of two subtree operations, retaining leaf label set Xg = {A, C, D, F }: a) Original species tree, T ; b) homomorphic subtree, T |X g ; c) restriction subtree, T ||X g .
Fig. 4. Reconciliation of a gene tree, g, and species tree, S. Note the set of "missing leaves" on g, L S\g = {A, E, F }. a) The MRCA mapping, M, is shown for all internal nodes of g. Nodes annotated as duplications on g indicated by diamonds; nodes annotated as speciation indicated by dots. b) Imbedding of g in S. For clarity, only the duplication nodes on g are shown as annotated.
Fig. 5. Optimal completion under the duplication-loss costs. Trees are the same as in Fig. 4. Dashed border for some species tree edges indicates parts of S that are missing from g because of absence of leaves A, E, F . Dashed circles are unary nodes on the restriction subtree, S||g. Dashed edges of g represent the optimal completion of g that allows leaves of g to be "present" at leaves of S, while adding as few duplications and losses as possible to the reconciliation. Here, no duplications or losses are added by this completion. Note that for the unary node at left, two edges of g traverse it and two subtrees are added, whereas for the unary node at right having one imbedded edge, only one subtree need be added.
Fig. 6. Optimal completion under the deep coalescence cost. Layout is the same as in Fig. 5. Here the optimal completion must add as few "extra edges" as possible (i.e., over and above one edge of g per edge of S), regardless of how many losses are then required. Note the loss of one edge of g immediately after its split within the unary node of S at lower left. Contrast with Fig. 5.
Fig. 7. Special case in which presence of gene at the root of the species tree is ambiguous for incompleteness due to deletion. a) Incomplete gene tree and species tree with MRCA mapping (M) from root of g shown. b) Imbedding of g in S leaving uncertainty toward the root of S. Dashed lines represent presence of gene if it is forced present at root of S and location of two losses entailed as a consequence.
Terraces in Gene Tree Reconciliation-Based Species Tree Inference

April 2020

·

174 Reads

·

1 Citation

A bstract Terraces in phylogenetic tree space are sets of trees with identical optimality scores for a given data set, arising from missing data. These were first described for multilocus phylogenetic data sets in the context of maximum parsimony inference and maximum likelihood inference under certain model assumptions. Here we show how the mathematical properties that lead to terraces extend to gene tree - species tree problems in which the gene trees are incomplete. Inference of species trees from either sets of gene family trees subject to duplication and loss, or allele trees subject to incomplete lineage sorting, can exhibit terraces in their solution space. First, we show conditions that lead to a new kind of terrace, which stems from subtree operations that appear in reconciliation problems for incomplete trees. Then we characterize when terraces of both types can occur when the optimality criterion for tree search is based on duplication, loss or deep coalescence scores. Finally, we examine the impact of assumptions about the causes of losses: whether they are due to imperfect sampling or true evolutionary deletion.



Citations (73)


... Airways in PCLS from Balb/c mice contract more effectively and have exhibited faster ASM Ca 2+ oscillations in response to contraction-inducing mediators than those from C57BL/6 mice. These changes correspond with higher expression of STIM1 and Orai1 and increased AHR in asthma models (48). Type 2 cytokines IL-4 and IL-13 also induce Orai1 expression in HASM and in vivo in rodent asthma models (Table 1) (49,50). ...

Reference:

Targeting cytoskeletal biomechanics to modulate airway smooth muscle contraction in asthma
Inherent differences of small airway contraction and Ca oscillations in airway smooth muscle cells between BALB/c and C57BL/6 mouse strains

... These findings indicate the possibility of range expansion from several refugia with higher diversity found in presumable pockets of suitable habit from where, once climatic conditions improved, palms colonized new areas with leading populations presenting lower diversity (Eckert et al., 2008;Peter and Slatkin, 2013). Several plant species show latitudinal changes in genetic diversity, which have been linked to postglacial range shifts, such as northward expansion along the BCP and mainland Sonora in columnar cacti (Nason et al., 2002;Clark-Tapia and Molina-Freaner, 2003;Sanderson et al., 2022), whereas southward directionality during colonization was found in Yucca schidigera (De la Rosa-Conroy et al., 2019). Other species present more complex scenarios, such as several expansion events from different refugia for Euphorbia lomelii (Garrick et al., 2009) and Encelia farinosa (Fehlberg and Ranker, 2009) or both southward and northward expansion from a single refugium for Agave aurea (Klimova et al., 2024). ...

Origin and Diversification of the Saguaro Cactus (Carnegiea gigantea): A Within-Species Phylogenomic Analysis
  • Citing Article
  • March 2022

Systematic Biology

... Even after such a long-standing phylogenetic exploration by sampling taxa as densely as possible, with data resources ranging from single plastomic markers (e.g., rbcL [8,19] and matK [4,[11][12][13][20][21][22][23]) to multigene [14,24,25] or whole plastomes [24], our current insights into Papilionoideae phylogenies remain a progress report. Another topical issue is the extensive plastomic rearrangements embodied by this subfamily, such as large-scale inversions [26,27], and disparities in gene order [28][29][30] and gene/intron content [31][32][33]. Notably, one of the most quintessential examples is the broad loss of one inverted repeat (IR), which is shared by a robustly monophyletic clade designated as the "IR-lacking clade" or IRLC [34]. ...

Plastome Structural Evolution and Homoplastic Inversions in Neo-Astragalus (Fabaceae)

Genome Biology and Evolution

... Sequences of non-structural proteins from ORF1a accumulated multiple substitutions without evidence of diversifying selection pressure in these sites. Considering the limitations related to a subsampled global phylogeny analysis, it is not possible to know if the absence of well-supported monophyletic groups is due to the missing branches and nodes representing the most related sequences and their ancestors or some technical limitation inherent to the application of the branch support statistical tests to the SARS-CoV-2 genomic sequences, since bootstrapping approaches require multiple sites supporting a clade to infer strong support value in near-perfect trees (60). In fact, SARS-CoV-2 genomes present a low number of informative sites, which may generate topology with low statistical support and ambiguous clustering of large data sets (61). ...

Accuracy in near-perfect virus phylogenies
  • Citing Preprint
  • May 2021

... Advanced techniques are emerging to record biodiversity data, e.g. artificial intelligence-assisted remote sensing (Horning et al. 2010), automatic species identification (Barré et al. 2017), DNA barcoding (Hebert et al. 2003), compilations of phylogenetic information (Antonelli et al. 2014), non-invasive methods for individual identification, crowdsourcing for species distribution data and sampling approaches for remote and low-technology areas, including traditional knowledge from indigenous communities (Pimm et al. 2015). Species can increasingly be fitted with data loggers that track their movements and activities since these devices shrink in size due to technological advancements (Allen and Singh 2016). ...

SUPERSMART: ecology and evolution in the era of big data
  • Citing Preprint
  • September 2014

... Contemporary datasets often comprise hundreds of species and hundreds of loci, combining organisms from distant taxonomic groups, and well-studied species, whose genomes were sequenced completely, together with non-model organisms, for which only a handful of loci are available. As a result, such diverse datasets often exhibit missing 5 data, i.e. for some species, sequences for some loci are not available, either as a consequence of species-specific gene losses/acquisitions or incompletely sequenced genomes. The availability of genetic sequences is summarised in a species per locus presence-absence matrix with 1's or 0's indicating presence or absence of a sequence, respectively (e.g., Fig. 1A). ...

Terraces in Gene Tree Reconciliation-Based Species Tree Inference

... Notably, polyploidy has left its mark on the evolutionary trajectory of numerous angiosperm species, with a strikingly high frequency of 1 in every 100,000 plants experiencing polyploidization, particularly prevalent in flowering plants (Doyle and Coate, 2019). Polyploidy proves detrimental in animal cells, plant cells typically tolerate it well, often resulting in larger, hardier, and faster-growing fruits and seeds, thereby enhancing desirability (Ferera and Dahanayake, 2014;Yetman et al., 2020). Consequently, ploidy manipulation has emerged as a valuable tool in the genetic enhancement of various plant species, including economically significant crops such as Cotton and Sugarcane (Trojak-Goluch et al., 2021;Li and Iqbal, 2024). ...

The Saguaro Cactus: A Natural History
  • Citing Book
  • February 2020

... The generality of these effects remains to be tested with a broader sample of datasets.Loci subsampling has become a standard approach in phylogenomic dating, yet again, the effects introduced by each of the numerous available approaches have not been fully characterized.While some authors have advocated for the selection of clock-like genes in the hopes that reduced lineage-specific rate variation will improve model fit and diminish bias, others have favoured loci with less topological conflict, as these are expected to harbour higher phylogenetic signal and be less affected by incomplete lineage sorting (ILS). Simulations have confirmed that both of these subsampling schemes result in more accurate divergence times(Carruthers et al., 2020(Carruthers et al., , 2022Smith et al., 2018). Other sensible options include selecting genes based on phylogenetic usefulness (as estimated using genesortR;, which should balance both aforementioned strategies by discovering loci with increased clock-likeness and decreased topological incongruence; and maximizing occupancy, increasing the amount of information contained within a matrix of a given size.Our results show that locus choice is a major factor determining the divergence times obtained and that alternative schemes impart characteristic signatures on inferred chronograms. ...

The Implications of Lineage-Specific Rates for Divergence Time Estimation

Systematic Biology

... However, overexpression HpDODA2 and supplementation with L-DOPA substrate in Nicotiana benthamiana and Arabidopsis thaliana did not result in any altered pigment phenotype [50]. Therefore, the researchers infer that the 4, 5-dopa dioxygenase activity influenced betalain synthesis [51]. HpDODA2 may not have or easily achieved the enzymatic activity for the synthesis of betalains [50]. ...

Evolution of l‐DOPA 4,5‐dioxygenase activity allows for recurrent specialisation to betalain pigmentation in Caryophyllales