Guojie ZhangZhejiang University · Evolutionary & Organismal Biology Research Center
Guojie Zhang
PhD
About
476
Publications
222,512
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
40,000
Citations
Introduction
Guojie Zhang currently works at the Department of Biology, University of Copenhagen. Guojie does research in Evolutionary Biology, Genetics and Neuroscience.
Additional affiliations
September 2017 - present
August 2012 - March 2017
April 2005 - present
Publications
Publications (476)
Ants are one of the most ecologically and evolutionarily successful groups of animals and exhibit a remarkable degree of phenotypic diversity. This success is largely attributed to the fact that all ants are eusocial and live in colonies with a reproductive division of labor between morphologically distinct queen and worker castes. Yet, despite ove...
Motivation
Recent advances in long-read sequencing technologies have significantly facilitated the production of high-quality genome assembly. The telomere-to-telomere (T2T) gapless assembly has become the new golden standard of genome assembly efforts. Several recent efforts have claimed to produce T2T-level reference genomes. However, a universal...
Sponge microbiomes are often highly diverse making it difficult to determine which lineages are important for maintaining host health and homeostasis. Characterising genomic traits associated with symbiosis can improve our knowledge of which lineages have adapted to their host and what functions they might provide. Here we examined five microbial f...
Phenotypic plasticity displayed by an animal in response to different environmental conditions is supposedly crucial for its survival and reproduction. The female adults of some ant lineages display phenotypic plasticity related to reproductive role. In pharaoh ant queens, insemination induces substantial physiological/behavioral changes and implic...
The first developmental table for an ant species, including embryonic, larval, and pupal development.
https://www.biorxiv.org/content/10.1101/2020.12.22.423970v2
Domestication represents one of the largest biological shifts of life on Earth, and for many animal species, behavioral selection is thought to facilitate early stages of the process. The gut microbiome of animals can respond to environmental changes and have diverse and powerful effects on host behavior. As such, we hypothesize that selection for...
Motivation
Recent advances in long-read sequencing technologies have significantly facilitated the production of high-quality genome assembly. The telomere-to-telomere (T2T) gapless assembly has become the new golden standard of genome assembly efforts. Several recent efforts have claimed to produce T2T level reference genomes. However, a universal...
Endogenous retroviruses (ERVs) are ancient retroviral remnants integrated in host genomes, and commonly deleted through unequal homologous recombination, leaving solitary long terminal repeats (solo-LTRs). This study, analysing the genomes of 362 bird species and their reptilian and mammalian outgroups, reveals an unusually higher level of solo-LTR...
The crab-eating macaques (Macaca fascicularis) and rhesus macaques (M. mulatta) are widely studied nonhuman primates in biomedical and evolutionary research. Despite their significance, the current understanding of the complex genomic structure in macaques and the differences between species requires substantial improvement. Here, we present a comp...
Genomes are typically mosaics of regions with different evolutionary histories. When speciation events are closely spaced in time, recombination makes the regions sharing the same history small, and the evolutionary history changes rapidly as we move along the genome. When examining rapid radiations such as the early diversification of Neoaves 66 M...
Despite tremendous efforts in the past decades, relationships among main avian lineages remain heavily debated without a clear resolution. Discrepancies have been attributed to diversity of species sampled, phylogenetic method and the choice of genomic regions1–3. Here we address these issues by analysing the genomes of 363 bird species⁴ (218 taxon...
Although previous studies have identified human-specific accelerated regions as playing a key role in the recent evolution of the human brain, the characteristics and cellular functions of rapidly evolving conserved elements (RECEs) in ancestral primate lineages remain largely unexplored. Here, based on large-scale primate genome assemblies, we ide...
Egg-laying mammals (monotremes) are considered “primitive” due to traits such as oviparity, cloaca, and incomplete homeothermy, all of which they share with reptiles. Two groups of monotremes, the terrestrial echidna (Tachyglossidae) and semiaquatic platypus (Ornithorhynchidae), have evolved highly divergent characters since their emergence in the...
Noncoding DNA is central to our understanding of human gene regulation and complex diseases1,2, and measuring the evolutionary sequence constraint can establish the functional relevance of putative regulatory elements in the human genome3–9. Identifying the genomic elements that have become constrained specifically in primates has been hampered by...
High genetic diversity is a good predictor of long-term population viability, yet some species persevere despite having low genetic diversity. Here we study the genomic erosion of the Seychelles paradise flycatcher (Terpsiphone corvina), a species that narrowly avoided extinction after having declined to 28 individuals in the 1960s. The species rec...
Aneuploidy compromises genomic stability, often leading to embryo inviability, and is frequently associated with tumorigenesis and aging. Different aneuploid chromosome stoichiometries lead to distinct transcriptomic and phenotypic changes, making it helpful to study aneuploidy in tightly controlled genetic backgrounds. By deploying the engineered...
The sponge microbiome underpins host function through provision and recycling of essential nutrients in a nutrient poor environment. Genomic data suggest that carbohydrate degradation, carbon fixation, nitrogen metabolism, sulphur metabolism and supplementation of B‐vitamins are central microbial functions. However, validation beyond the genomic po...
The black rhinoceros (Diceros bicornis L.) is a critically endangered species historically distributed across sub-Saharan Africa. Hunting and habitat disturbance have diminished both its numbers and distribution since the 19th century, but a poaching crisis in the late 20th century drove them to the brink of extinction. Genetic and genomic assessme...
Since the release of the complete human genome, the priority of human genomic study has now been shifting towards closing gaps in ethnic diversity. Here, we present a fully phased and well-annotated diploid human genome from a Han Chinese male individual (CN1), in which the assemblies of both haploids achieve the telomere-to-telomere (T2T) level. C...
Teleosts, the most species-rich group of jawed vertebrates, have evolved a remarkable diversity of primitive and specialized features that provide a valuable resource for understanding vertebrate evolution. Here, we collected a comprehensive genome dataset of fish, including 115 newly sequenced species, to fill genome gaps for seven orders. By comb...
The increased tameness to reduce avoidance of human in wild animals has been long proposed as the key step of animal domestication. The tameness is a complex behavior trait and largely determined by genetic factors. However, the underlying genetic mutations remain vague and how they influence the animal behaviors is yet to be explored. Behavior tes...
The biological mechanisms that underpin primate social evolution remain poorly understood. Asian colobines display a range of social organizations, which makes them good models for investigating social evolution. By integrating ecological, geological, fossil, behavioral, and genomic analyses, we found that colobine primates that inhabit colder envi...
Incomplete lineage sorting (ILS) causes the phylogeny of some parts of the genome to differ from the species tree. In this work, we investigate the frequencies and determinants of ILS in 29 major ancestral nodes across the entire primate phylogeny. We find up to 64% of the genome affected by ILS at individual nodes. We exploit ILS to reconstruct sp...
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 mill...
The Y chromosome usually plays a critical role in determining male sex and comprises sequence classes that have experienced unique evolutionary trajectories. Here we generated 19 new primate sex chromosome assemblies, analysed them with 10 existing assemblies and report rapid evolution of the Y chromosome across primates. The pseudoautosomal bounda...
Understanding the mechanisms underlying phenotypic innovation is a key goal of comparative genomic studies. Here, we investigated the evolutionary landscape of lineage-specific accelerated regions (LinARs) across 49 primate species. Genomic comparison with dense taxa sampling of primate species significantly improved LinAR detection accuracy and re...
Comparative analysis of primate genomes within a phylogenetic context is essential for understanding the evolution of human genetic architecture and primate diversity. We present such a study of 50 primate species spanning 38 genera and 14 families, including 27 genomes first reported here, with many from previously less well represented groups, th...
The rich diversity of morphology and behavior displayed across primate species provides an informative context in which to study the impact of genomic diversity on fundamental biological processes. Analysis of that diversity provides insight into long-standing questions in evolutionary and conservation biology and is urgent given severe threats the...
Although species can arise through hybridization, compelling evidence for hybrid speciation has been reported only rarely in animals. Here, we present phylogenomic analyses on genomes from 12 macaque species and show that the fascicularis group originated from an ancient hybridization between the sinica and silenus groups ~3.45 to 3.56 million year...
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole genome sequencing data for 809 individuals from 233 primate species, and identified 4.3 mil...
The rich diversity of morphology and behavior displayed across primate species provides an informative context in which to study the impact of genomic diversity on fundamental biological processes. Analysis of that diversity provides insight into long-standing questions in evolutionary and conservation biology, and is urgent given severe threats th...
Anticipating species’ responses to environmental change is a pressing mission in biodiversity conservation. Despite decades of research investigating how climate change may affect population sizes, historical context is lacking, and the traits that mediate demographic sensitivity to changing climate remain elusive. We use whole-genome sequence data...
The germline mutation rate determines the pace of genome evolution and is an evolving parameter itself¹. However, little is known about what determines its evolution, as most studies of mutation rates have focused on single species with different methodologies². Here we quantify germline mutation rates across vertebrates by sequencing and comparing...
Antarctic krill (Euphausia superba) is Earth's most abundant wild animal, and its enormous biomass is vital to the Southern Ocean ecosystem. Here, we report a 48.01-Gb chromosome-level Antarctic krill genome, whose large genome size appears to have resulted from inter-genic transposable element expansions. Our assembly reveals the molecular archite...
The availability of public genomic resources can greatly assist biodiversity assessment, conservation, and restoration efforts by providing evidence for scientifically informed management decisions. Here we survey the main approaches and applications in biodiversity and conservation genomics, considering practical factors, such as cost, time, prere...
The availability of public genomic resources can greatly assist biodiversity assessment, conservation, and restoration efforts by providing evidence for scientifically informed management decisions. Here we survey the main approaches and applications in biodiversity and conservation genomics, considering practical factors, such as cost, time, prere...
Despite evidence of declining biosphere integrity, we currently lack understanding of how the functional diversity associated with changes in abundance among ecological communities has varied over time and before widespread human disturbances. We combine morphological, ecological, and life-history trait data for >260 extant bird species with genomi...
Extensive adenosine-to-inosine (A-to-I) editing of nuclear-transcribed mRNAs is the hallmark of metazoan transcriptional regulation. Here, by profiling the RNA editomes of 22 species that cover major groups of Holozoa, we provide substantial evidence supporting A-to-I mRNA editing as a regulatory innovation originating in the last common ancestor o...
The chicken continues to hold its position as a leading model organism within many areas of research, as well as a being major source of protein for human consumption. The First Report on Chicken Genes and Chromosomes [Schmid et al., 2000], which was published in 2000, was the brainchild of the late, and sadly missed, Prof Michael Schmid of the Uni...
Polygamy and purifying selection in birds - supplementary material
High genetic diversity is often a good predictor of long-term population viability, yet some species persevere despite having low genetic diversity. Here we study the genomic erosion of the Seychelles paradise flycatcher (Terpsiphone corvina), a species that narrowly avoided extinction after having declined to 28 individuals in the 1960s. The speci...
Good genes theories of sexual selection predict that polygamy will be associated with more efficient removal of deleterious alleles (purifying selection), due to the alignment of sexual selection with natural selection. On the other hand, runaway selection theories expect no such alignment of natural and sexual selection, and may instead predict le...
Background
The Nile rat (Avicanthis niloticus) is an important animal model because of its robust diurnal rhythm, a cone-rich retina, and a propensity to develop diet-induced diabetes without chemical or genetic modifications. A closer similarity to humans in these aspects, compared to the widely used Mus musculus and Rattus norvegicus models, hold...
The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was g...
Ant colonies are higher-level organisms consisting of specialized reproductive and non-reproductive individuals that differentiate early in development, similar to germ–soma segregation in bilateral Metazoa. Analogous to diverging cell lines, developmental differentiation of individual ants has often been considered in epigenetic terms but the sets...
Lorises are a group of globally threatened strepsirrhine primates that exhibit many unusual physiological and behavioral features, including a low metabolic rate, slow movement, and hibernation. Here, we assembled a chromosome-level genome sequence of the pygmy loris ( Xanthonycticebus pygmaeus ) and resequenced whole genomes from 50 pygmy lorises...
Background
Many short-read genome assemblies have been found to be incomplete and contain mis-assemblies. The Vertebrate Genomes Project has been producing new reference genome assemblies with an emphasis on being as complete and error-free as possible, which requires utilizing long reads, long-range scaffolding data, new assembly algorithms, and m...
Manakins are a family of small suboscine passerine birds characterized by their elaborate courtship displays, non-monogamous mating system, and sexual dimorphism. This family has served as a good model for the study of sexual selection. Here we present genome assemblies of four manakin species, including Cryptopipo holochlora, Dixiphia pipra (also...
Aneuploidy compromises genomic stability, often leading to embryo inviability, and is frequently associated with tumorigenesis and aging. Different aneuploid chromosome stoichiometries lead to distinct transcriptomic and phenotypic changes, making it helpful to study aneuploidy in tightly controlled genetic backgrounds. By deploying the engineered...
The rate of mutation accumulation in germline cells can be affected by cell replication and/or DNA damage, which are further related to life history traits such as generation time and body mass. Leveraging the existing datasets of 233 neoavian bird species, here, we investigated whether generation time and body mass contribute to the interspecific...
Anticipating species’ responses to environmental change is a pressing mission in biodiversity conservation. Despite decades of research investigating how climate change may affect population sizes, historical context is lacking and the traits which mediate demographic sensitivity to changing climate remain elusive. We use whole-genome sequence data...
Ant colonies with permanent division of labour between castes and highly distinct roles of the sexes have been conceptualized to be superorganisms, but the cellular and molecular mechanisms that mediate caste/sex-specific behavioural specialization have remained obscure. Here we characterized the brain cell repertoire of queens, gynes (virgin queen...
Penguins lost the ability to fly more than 60 million years ago, subsequently evolving a hyper-specialized marine body plan. Within the framework of a genome-scale, fossil-inclusive phylogeny, we identify key geological events that shaped penguin diversification and genomic signatures consistent with widespread refugia/recolonization during major c...
Transcriptomic diversity greatly contributes to the fundamentals of disease, lineage-specific biology, and environmental adaptation. However, much of the actual isoform repertoire contributing to shaping primate evolution remains unknown. Here, we combined deep long- and short-read sequencing complemented with mass spectrometry proteomics in a pane...
Egg-laying mammals (monotremes) are a sister clade of therians (placental mammals and marsupials) and a key clade to understand mammalian evolution. They are classified into platypus and echidna, which exhibit distinct ecological features such as habitats and diet. Chemosensory genes, which encode sensory receptors for taste and smell, are believed...
Significance
The controversy over the taxonomic identity of the eggs exploited by Australia’s first people around 50,000 y ago is resolved. The birds that laid these eggs are extinct, and distinguishing between two main candidates, a giant flightless “mihirung” Genyornis and a large megapode Progura , had proven impossible using morphological and g...
Incomplete lineage sorting (ILS) makes ancestral genetic polymorphisms persist during rapid speciation events, inducing incongruences between gene trees and species trees. ILS has complicated phylogenetic inference in many lineages, including hominids. However, we lack empirical evidence that ILS leads to incongruent phenotypic variation. Here, we...
Ant colonies are higher-level organisms consisting of specialized reproductive and non-reproductive individuals that differentiate early in development, similar to germ-soma segregation in bilateral Metazoa. Analogous to diverging cell lines, developmental differentiation of individual ants has often been considered in epigenetic terms, but the set...
The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has greatly benefited society. However, it still has many gaps and errors, and does not represent a biological human genome since it is a blend of multiple individuals. Recently, a high-quality telomere-to-telomere reference gen...
Three principal methods are under discussion as possible pathways to “true” de-extinction; i.e., back-breeding, cloning, and genetic engineering.¹,² Of these, while the latter approach is most likely to apply to the largest number of extinct species, its potential is constrained by the degree to which the extinct species genome can be reconstructed...
Ants are highly diverse social insects living in colonies consisted of up to millions of individuals with reproductive division of labors. Due to the interests in disclosing the genetic and epigenetic regulation mechanisms underlying the distinct developmental trajectories between castes and division of labor in colonies, many ant species have rece...
Life on Earth has evolved from initial simplicity to the astounding complexity we experience today. Bacteria and archaea have largely excelled in metabolic diversification, but eukaryotes additionally display abundant morphological innovation. How have these innovations come about and what constraints are there on the origins of novelty and the con...
November 2020 marked 2 y since the launch of the Earth BioGenome Project (EBP), which aims to sequence all known eukaryotic species in a 10-y timeframe. Since then, significant progress has been made across all aspects of the EBP roadmap, as outlined in the 2018 article describing the project’s goals, strategies, and challenges (1). The launch phas...
A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific commun...