
Xing-Xing Shen- PhD
- Professor at Zhejiang University
Xing-Xing Shen
- PhD
- Professor at Zhejiang University
About
122
Publications
38,555
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
4,561
Citations
Introduction
Skills and Expertise
Current institution
Additional affiliations
December 2019 - present
Publications
Publications (122)
Phylogenomic studies have resolved countless branches of the tree of life, but remain strongly contradictory on certain, contentious relationships. Here, we use a maximum likelihood framework to quantify the distribution of phylogenetic signal among genes and sites for 17 contentious branches and 6 well-established control branches in plant, animal...
Budding yeasts (subphylum Saccharomycotina) are found in every biome and are as genetically diverse as plants or animals. To understand budding yeast evolution, we analyzed the genomes of 332 yeast species, including 220 newly sequenced ones, which represent nearly one-third of all known budding yeast diversity. Here, we establish a robust genus-le...
Phylogenetic trees are essential for studying biology, but their reproducibility under identical parameter settings remains unexplored. Here, we find that 3515 (18.11%) IQ-TREE-inferred and 1813 (9.34%) RAxML-NG-inferred maximum likelihood (ML) gene trees are topologically irreproducible when executing two replicates (Run1 and Run2) for each of 19,...
Horizontal gene transfer (HGT) is an important evolutionary force shaping prokaryotic and eukaryotic genomes. HGT-acquired genes have been sporadically reported in insects, a lineage containing >50% of animals. We systematically examined HGT in 218 high-quality genomes of diverse insects and found that they acquired 1,410 genes exhibiting diverse f...
Oxidative phosphorylation, essential for energy metabolism and linked to the regulation of longevity, involves mitochondrial and nuclear genes. The functions of these genes and their evolutionary rate covariation (ERC) have been extensively studied, but little is known about whether other nuclear genes not targeted to mitochondria evolutionarily an...
Mutator phenotypes are short-lived due to the rapid accumulation of deleterious mutations. Yet, recent observations reveal that certain fungi can undergo prolonged accelerated evolution after losing DNA repair genes. Here, we surveyed 1,154 yeast genomes representing nearly all known yeast species of the subphylum Saccharomycotina to examine the re...
The Saccharomycotina fungi have evolved to inhabit a vast diversity of habitats over their 400-million-year evolution. There are, however, only a few known fungal pathogens of plants in this subphylum, primarily belonging to the genera Eremothecium and Geotrichum. We compared the genomes of 12 plant-pathogenic Saccharomycotina strains to 360 plant-...
Multiple sequence alignments and phylogenetic trees are rich in biological information and are fundamental to research in biology. PhyKIT is a tool for processing and analyzing the information content of multiple sequence alignments and phylogenetic trees. Here, we describe how to use PhyKIT for diverse analyses, including (i) constructing a phylog...
Many distantly related organisms have convergently evolved traits and lifestyles that enable them to live in similar ecological environments. However, the extent of phenotypic convergence evolving through the same or distinct genetic trajectories remains an open question. Here, we leverage a comprehensive dataset of genomic and phenotypic data from...
Yeasts in the subphylum Saccharomycotina are found across the globe in disparate ecosystems. A major aim of yeast research is to understand the diversity and evolution of ecological traits, such as carbon metabolic breadth, insect association, and cactophily. This includes studying aspects of ecological traits like genetic architecture or associati...
Ancient divergences within Opisthokonta—a major lineage that includes organisms in the kingdoms Animalia, Fungi, and their unicellular relatives—remain contentious. To assess progress toward a genome-scale Opisthokonta phylogeny, we conducted the most taxon rich phylogenomic analysis using sets of genes inferred with different orthology inference m...
Codon usage bias, or the unequal use of synonymous codons, is observed across genes, genomes, and between species. It has been implicated in many cellular functions, such as translation dynamics and transcript stability, but can also be shaped by neutral forces. We characterized codon usage across 1,154 strains from 1,051 species from the fungal su...
Horizontal gene transfer (HGT) is a major driving force in the evolution of prokaryotic and eukaryotic genomes. Despite recent advances in distribution and ecological importance, the extensive pattern, especially in seed plants, and post‐transfer adaptation of HGT‐acquired genes in land plants remain elusive.
We systematically identified 1150 forei...
Many remarkable innovations have repeatedly occurred across vast evolutionary distances. When convergent traits emerge on the tree of life, they are sometimes driven by the same underlying gene families, while other times many different gene families are involved. Conversely, a gene family may be repeatedly recruited for a single trait or many diff...
Maximum likelihood (ML) phylogenetic inference is widely used in phylogenomics. As heuristic searches most likely find suboptimal trees, it is recommended to conduct multiple (e.g., ten) tree searches in phylogenetic analyses. However, beyond its positive role, how and to what extent multiple tree searches aid ML phylogenetic inference remains poor...
Gene gains and losses are a major driver of genome evolution; their precise characterization can provide insights into the origin and diversification of major lineages. Here, we examined gene family evolution of 1,154 genomes from nearly all known species in the medically and technologically important yeast subphylum Saccharomycotina. We found that...
Codon usage bias, or the unequal use of synonymous codons, is observed across genes, genomes, and between species. The biased use of synonymous codons has been implicated in many cellular functions, such as translation dynamics and transcript stability, but can also be shaped by neutral forces. The Saccharomycotina, the fungal subphylum containing...
Organisms exhibit extensive variation in ecological niche breadth, from very narrow (specialists) to very broad (generalists). Two general paradigms have been proposed to explain this variation: (i) trade-offs between performance efficiency and breadth and (ii) the joint influence of extrinsic (environmental) and intrinsic (genomic) factors. We ass...
How genomic differences contribute to phenotypic differences is a major question in biology. The recently characterized genomes, isolation environments, and qualitative patterns of growth on 122 sources and conditions of 1,154 strains from 1,049 fungal species (nearly all known) in the yeast subphylum Saccharomycotina provide a powerful, yet comple...
Siderophores are crucial for iron-scavenging in microorganisms. While many yeasts can uptake siderophores produced by other organisms, they are typically unable to synthesize siderophores themselves. In contrast, Wickerhamiella/Starmerella (W/S) clade yeasts gained the capacity to make the siderophore enterobactin following the remarkable horizonta...
The Saccharomycotina yeasts (“yeasts” hereafter) are a fungal clade of scientific, economic, and medical significance. Yeasts are highly ecologically diverse, found across a broad range of environments in every biome and continent on earth; however, little is known about what rules govern the macroecology of yeast species and their range limits in...
Eukaryotes have evolved sophisticated post‐translational modifications to regulate protein function and numerous biological processes, including ubiquitination controlled by the coordinated action of ubiquitin‐conjugating enzymes and deubiquitinating enzymes (Dubs). However, the function of deubiquitination in pathogenic fungi is largely unknown.
H...
PhyloFisher is a software package written primarily in Python3 that can be used for the creation, analysis, and visualization of phylogenomic datasets that consist of protein sequences from eukaryotic organisms. Unlike many existing phylogenomic pipelines, PhyloFisher comes with a manually curated database of 240 protein‐coding genes, a subset of a...
MicroRNAs (miRNAs) are a class of non-protein-coding short transcripts that provide a layer of post-transcriptional regulation essential to many plant biological processes. MiR858, which targets the transcripts of MYB transcription factors, can affect a range of secondary metabolic processes. Although miR858 and its 187-nt precursor have been well...
Siderophores are crucial for iron-scavenging in microorganisms. While many yeasts can uptake siderophores produced by other organisms, they are typically unable to synthesize siderophores themselves. In contrast, Wickerhamiella/Starmerella (W/S) clade yeasts gained the capacity to make the siderophore enterobactin following the remarkable horizonta...
Ancient divergences within Opisthokonta—a major lineage that includes organisms in the kingdoms Animalia, Fungi, and their unicellular relatives— remain contentious, hindering investigations of the evolutionary processes that gave rise to two kingdoms and the repeated emergence of iconic phenotypes like multicellularity. Here, we use genome-scale a...
Many distantly related organisms have convergently evolved traits and lifestyles that enable them to live in similar ecological environments. However, the extent of phenotypic convergence evolving through the same or distinct genetic trajectories remains an open question. Here, we leverage a comprehensive dataset of genomic and phenotypic data from...
The Saccharomycotina yeasts ("yeasts" hereafter) are a fungal clade of scientific, economic, and medical significance. Yeasts are highly ecologically diverse, found across a broad range of environments in every biome and continent on earth; however, little is known about what rules govern the macroecology of yeast species and their range limits in...
Ascochyta blight is a fungal disease affecting peas, causing significant damage to the plant and reducing crop yield. Host‒pathogen interactions can inform disease prevention and control strategies but remain poorly understood. Here, we generate a near-chromosome-level assembly for Didymella pinodella HNA18, a pathogenic fungus
that causes pea asco...
How genomic differences contribute to phenotypic differences across species is a major question in biology. The recently characterized genomes, isolation environments, and qualitative patterns of growth on 122 sources and conditions of 1,154 strains from 1,049 fungal species (nearly all known) in the subphylum Saccharomycotina provide a powerful, y...
Genome-scale data and the development of novel statistical phylogenetic approaches have greatly aided the reconstruction of a broad sketch of the tree of life and resolved many of its branches. However, incongruence - the inference of conflicting evolutionary histories - remains pervasive in phylogenomic data, hampering our ability to reconstruct a...
Organisms exhibit extensive variation in ecological niche breadth, from very narrow (specialists) to very broad (generalists). Paradigms proposed to explain this variation either invoke trade-offs between performance efficiency and breadth or underlying intrinsic or extrinsic factors. We assembled genomic (1,154 yeast strains from 1,049 species), m...
The subphylum Saccharomycotina is a lineage in the fungal phylum Ascomycota that exhibits levels of genomic diversity similar to those of plants and animals. The Saccharomycotina consist of more than 1 200 known species currently divided into 16 families, one order, and one class. Species in this subphylum are ecologically and metabolically diverse...
Wing polyphenism is found in a variety of insects and offers an attractive model system for studying the evolutionary significance of dispersal. The Forkhead box O (FoxO) transcription factor (TF) acts as a wing-morph switch that directs wing buds developing into long-winged (LW) or short-winged morphs in wing-dimorphic planthoppers, yet the regula...
As the most diverse group of animals on Earth, insects are key organisms in ecosystems. Horizontal gene transfer (HGT) refers to the transfer of genetic material between species by non‐reproductive means. HGT is a major evolutionary force in prokaryotic genome evolution, but its importance in different eukaryotic groups, such as insects, has only r...
Examination of the changes in order and arrangement of homologous genes is key for understanding the mechanisms of genome evolution in eukaryotes. Previous comparisons between eukaryotic genomes have revealed considerable conservation across species that diverged hundreds of millions of years ago (e.g., vertebrates,1, 2, 3 bilaterian animals,⁴,⁵ an...
Molecular evolution studies, such as phylogenomic studies and genome-wide surveys of selection, often rely on gene families of single-copy orthologs (SC-OGs). Large gene families with multiple homologs in 1 or more species—a phenomenon observed among several important families of genes such as transporters and transcription factors—are often ignore...
Plant-pathogenic fungi form intimate interactions with their associated bacterial microbiota during their entire life cycle. However, little is known about the structure, functions and interaction mechanisms of bacterial communities associated with fungal fruiting bodies (perithecia). Here we examined the bacterial microbiome of perithecia formed b...
Peroxisomes are universal eukaryotic organelles essential to plants and animals. Most peroxisomal matrix proteins carry Peroxisome Targeting Signal type 1 (PTS1), a C‐terminal tripeptide. Studies from various kingdoms have revealed influences from sequence upstream of the tripeptide on peroxisome targeting, supporting the view that positive charges...
Molecular evolution studies, such as phylogenomic studies and genome-wide surveys of positive selection, often rely on gene families of single-copy orthologs (SC-OGs). In contrast, large gene families with multiple homologs in one or more species - a phenomenon observed among several important families of genes such as transporters and transcriptio...
The in situ detection of the damaged aluminum alloy plate with bolted patch is numerically and experimentally studied based on a surface-mounted piezoelectric ceramic lead zirconate titanate (PZT) elements. The contact effects between the bolts, repair patch and host structure are considered. The convergence analysis, range-frequency analysis and p...
The DNA mismatch repair (MMR) pathway corrects mismatched bases produced during DNA replication and is highly conserved across the tree of life, reflecting its fundamental importance for genome integrity. Loss of function in one or a few MMR genes can lead to increased mutation rates and microsatellite instability, as seen in some human cancers. Wh...
Control of the boundary layer upstream and in the region of the engine inlet is an important consideration for airplane designers. One important aspect of this problem can be seen in the performance analysis of submerged inlets for which the design is based on boundary layer ingestion. With a large offset between the flush entrance and the compress...
In order to study the working mechanism of piezoelectric composite cantilever beam and improve the ability of predicting the power generation performance of piezoelectric composite cantilever vibration energy harvester (CVEH), the simple and high-precision analysis models of equivalent mass and equivalent stiffness of CVEH are proposed. The natural...
Phylogenomic analyses of hundreds of protein-coding genes aimed at resolving phylogenetic relationships is now a common practice. However, no software currently exists that includes tools for dataset construction and subsequent analysis with diverse validation strategies to assess robustness. Furthermore, there are no publicly available high-qualit...
Identifying our most distant animal relatives has emerged as one of the most challenging problems in phylogenetics. This debate has major implications for our understanding of the origin of multicellular animals and of the earliest events in animal evolution, including the origin of the nervous system. Some analyses identify sponges as our most dis...
The DNA mismatch repair (MMR) pathway corrects mismatched bases produced during DNA replication and is highly conserved across the tree of life, reflecting its fundamental importance for genome integrity. Loss of function in one or a few MMR genes can lead to increased mutation rates and microsatellite instability, as seen in some human cancers. Wh...
This study describes the aerodynamic efficiency of a forebody–inlet configuration and computational investigation of a drone system, capable of sustainable supersonic cruising at Mach 1.60. Because the whole drone configuration is formed around the induction system and the design is highly interrelated to the flow structure of forebody and inlet ef...
Topological conflict or incongruence is widespread in phylogenomic data. Concatenation- and coalescent-based approaches often result in incongruent topologies, but the causes of this conflict can be difficult to characterize. We examined incongruence stemming from conflict between likelihood-based signal (quantified by the difference in gene-wise l...
Dollo’s law posits that evolutionary losses are irreversible, thereby narrowing the potential paths of evolutionary change. While phenotypic reversals to ancestral states have been observed, little is known about their underlying genetic causes. The genomes of budding yeasts have been shaped by extensive reductive evolution, such as reduced genome...
Motivation:
Diverse disciplines in biology process and analyze multiple sequence alignments (MSAs) and phylogenetic trees to evaluate their information content, infer evolutionary events and processes, and predict gene function. However, automated processing of MSAs and trees remains a challenge due to the lack of a unified toolkit. To fill this g...
In this paper, the temperature memory effect (TME) in a commercial ethylene‐vinyl acetate copolymer (EVA) is characterized via differential scanning calorimetry (DSC) tests. Three temperatures, which are 35, 60, and 85°C representing temperatures below glass transition (Tg), within Tg and within melting (Tm), respectively, are included for the inve...
Phylogenomic studies using genome-scale amounts of data have greatly improved understanding of the tree of life. Despite the diversity, ecological significance, and biomedical and industrial importance of fungi, evolutionary relationships among several major lineages remain poorly resolved, especially those near the base of the fungal phylogeny. To...
The lever-type mechanism is one of the most widely used compliant mechanisms in precision engineering applications. To achieve large amplification ratios, the lever-type mechanisms are often connected in series. However, overstress during the deformation process of the mechanism appears frequently, resulting in the undesired deformation and the dis...
Highly divergent sites in multiple sequence alignments (MSAs), which can stem from erroneous inference of homology and saturation of substitutions, are thought to negatively impact phylogenetic inference. Thus, several different trimming strategies have been developed for identifying and removing these sites prior to phylogenetic inference. However...
used the alignment-free feature frequency profile (FFP) method to reconstruct a broad sketch of the tree of life (ToL). The FFP tree reports many relationships that strongly contradict the current consensus view of the ToL, including sister group relationships for plants + animals, Bacteria + Archaea, and Mollusca (incorrectly referred to as cnidar...
Ascomycota, the largest and most well-studied phylum of fungi, contains three subphyla: Saccharomycotina (budding yeasts), Pezizomycotina (filamentous fungi), and Taphrinomycotina (fission yeasts). Despite its importance, we lack a comprehensive genome-scale phylogeny or understanding of the similarities and differences in the mode of genome evolut...
15 There has been considerable debate about the placement of the root in the animal tree of life, which has 16 emerged as one of the most challenging problems in animal phylogenetics. This debate has major implications 17 for our understanding of the earliest events in animal evolution, including the origin of the nervous system. 18 Some phylogenet...
Diverse disciplines in biology process and analyze multiple sequence alignments (MSAs) and phylogenetic trees to evaluate their information content, infer evolutionary events and processes, and predict gene function. However, automated processing of MSAs and trees remains a challenge due to the lack of a unified toolkit. To fill this gap, we introd...
In wind tunnel tests, the cantilever sting is usually used to support aircraft models because of its simple structure and low aerodynamic interference. However, in some special conditions, big-amplitude and low-frequency vibration would occur easily on the model not only in the pitch direction but also in the yaw direction, resulting in inaccurate...
In this paper, a thermo-responsive shape memory effect in a polyvinyl chloride thermoset foam is characterized. Excellent shape recovery performance is observed in foam samples programmed both at room temperature and above their transition temperature. The conversion of porous structures in the foam from closed-cell to open-cell after a shape memor...
Phylogenomic studies based on genome-scale amounts of data have greatly improved understanding of the tree of life. Despite their diversity, ecological significance, and biomedical and industrial importance, large-scale phylogenomic studies of Fungi are lacking. Furthermore, several evolutionary relationships among major fungal lineages remain cont...
Dollo's law posits that evolutionary losses are irreversible, thereby narrowing the potential paths of evolutionary change. While phenotypic reversals to ancestral states have been observed, little is known about their underlying genetic causes. The genomes of budding yeasts have been shaped by extensive reductive evolution, such as reduced genome...
Choi and Kim (PNAS, 117: 3678-3686; first published February 4, 2020; https://doi.org/10.1073/pnas.1915766117) used the alignment-free Feature Frequency Profile (FFP) method to reconstruct a broad sketch of the tree of life based on proteome data from 4,023 taxa. The FFP-based reconstruction reports many relationships that strongly contradict the c...
Highly divergent sites in multiple sequence alignments, which stem from erroneous inference of homology and saturation of substitutions, are thought to negatively impact phylogenetic inference. Trimming methods aim to remove these sites before phylogenetic inference, but recent analysis suggests that doing so can worsen inference. We introduce Clip...
Gene regulatory networks (GRNs) drive developmental and cellular differentiation, and variation in their architectures gives rise to morphological diversity. Pioneering studies in Aspergillus fungi, coupled with subsequent work in other filamentous fungi, have shown that the GRN governed by the BrlA, AbaA, and WetA proteins controls the development...
Ascomycota, the largest and best-studied phylum of fungi, contains three subphyla: Saccharomycotina (budding yeasts), Pezizomycotina (filamentous fungi), and Taphrinomycotina (fission yeasts); organisms from all three subphyla have been invaluable as models in diverse fields (e.g., biotechnology, cell biology, genetics, and medicine). Despite its i...
The genomic data revolution has enabled biologists to develop innovative ways to infer key episodes in the history of life. Whether genome-scale data will eventually resolve all branches of the Tree of Life remains uncertain. However, through novel means of interrogating data, some explanations for why evolutionary relationships remain recalcitrant...
Angiosperms represent one of the most spectacular terrestrial radiations on the planet¹, but their early diversification and phylogenetic relationships remain uncertain2–5. A key reason for this impasse is the paucity of complete genomes representing early-diverging angiosperms. Here, we present high-quality, chromosomal-level genome assemblies of...
Gene regulatory networks (GRNs) drive developmental and cellular differentiation, and variation in their architectures gives rise to morphological diversity. Pioneering studies in Aspergillus fungi, coupled with subsequent work in other filamentous fungi, have shown that the GRN governed by the BrlA, AbaA, and WetA proteins controls the development...
Immunity genes have repeatedly experienced natural selection during mammalian evolution. Galectins are carbohydrate-binding proteins that regulate diverse immune responses, including maternal-fetal immune tolerance in placental pregnancy. Seven human galectins, four conserved across vertebrates and three specific to primates, are involved in placen...
Cell type in budding yeasts is determined by the genotype at the mating-type (MAT) locus, but yeast species differ widely in their mating compatibility systems and life cycles. Among sexual yeasts, heterothallic species are those in which haploid strains fall into two distinct and stable mating types (MATa and MATα), whereas homothallic species are...
The tangerine pathotype of the ascomycete fungus Alternaria alternata is the causal agent of citrus brown spot, which can result in significant losses of both yield and marketability for tangerines worldwide. A conditionally dispensable chromosome (CDC), which harbours the host‐selective ACT toxin gene cluster, is required for tangerine pathogenici...
The filamentous fungal family Aspergillaceae contains >1,000 known species, mostly in the genera Aspergillus and Penicillium. Several species are used in the food, biotechnology, and drug industries (e.g., Aspergillus oryzae and Penicillium camemberti), while others are dangerous human and plant pathogens (e.g., Aspergillus fumigatus and Penicilliu...
Cell-cycle checkpoints and DNA repair processes protect organisms from potentially lethal mutational damage. Compared to other budding yeasts in the subphylum Saccharomycotina, we noticed that a lineage in the genus Hanseniaspora exhibited very high evolutionary rates, low Guanine–Cytosine (GC) content, small genome sizes, and lower gene numbers. T...
The structural health monitoring system with electromechanical impedance technique is studied numerically and experimentally in this article. A three-dimensional numerical model is developed to account for the electromechanical coupling effect between the structures and the piezoelectric ceramic lead zirconate titanate transducers, and it is verifi...
Cell cycle checkpoints and DNA repair processes protect organisms from potentially lethal mutational damage. Compared to other budding yeasts in the subphylum Saccharomycotina, we noticed that a lineage in the genus Hanseniaspora exhibited very high evolutionary rates, low GC content, small genome sizes, and lower gene numbers. To better understand...
Operons are a hallmark of bacterial genomes, where they allow concerted expression of functionally related genes as single polycistronic transcripts. They are rare in eukaryotes, where each gene usually drives expression of its own independent messenger RNAs. Here, we report the horizontal operon transfer of a siderophore biosynthesis pathway from...
Immunity genes have repeatedly experienced natural selection during mammalian evolution. Galectins are carbohydrate-binding proteins that regulate diverse immune responses, including maternal-fetal immune tolerance in placental pregnancy. Seven human galectins, four conserved across vertebrates and three specific to primates, are involved in placen...
Significance
Evolutionary and comparative genomics, combined with reverse genetics, have the power to identify and characterize new biology. Here, we use these approaches in several nontraditional model species of budding yeasts to characterize a budding yeast secondary metabolite gene cluster, a set of genes responsible for production and reutiliz...
DNA glycosylases remove aberrant DNA nucleobases as the first enzymatic step of the base excision repair (BER) pathway. The alkyl‐DNA glycosylases AlkC and AlkD adopt a unique structure based on α‐helical HEAT repeats. Both enzymes identify and excise their substrates without a base‐flipping mechanism used by other glycosylases and nucleic acid pro...
Operons are a hallmark of bacterial genomes, where they allow concerted expression of multiple functionally related genes as single polycistronic transcripts. They are rare in eukaryotes, where each gene usually drives expression of its own independent messenger RNAs. Here we report the horizontal operon transfer of a catecholate-class siderophore...
Abbreviations
NT, nucleotide; AA, amino acid; CI, credible interval; RCV, relative composition variability; IC, internode certainty; GSF, gene support frequencies; GLS, gene-wise log-likelihood scores; DVMC, degree of violation of a molecular clock
The filamentous fungal family Aspergillaceae contains > 1,000 known species, mostly in the genera Asp...
The genetic code used in nuclear genes is almost universal, but here we report that it changed three times in parallel during the evolution of budding yeasts. All three changes were reassignments of the codon CUG, which is translated as serine (in 2 yeast clades), alanine (1 clade), or the 'universal' leucine (2 clades). The newly discovered Ser2 c...
Accession numbers or coordinates for the proteins used to construct the species phylogeny.
AI results for W/S species, C. infanticola, B. adeninivorans, and Su. lignohabitans.
Data used to construct the plots is presented in Figure 1.