
Daniel Rokhsar- University of California, Berkeley
Daniel Rokhsar
- University of California, Berkeley
About
350
Publications
155,055
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
70,054
Citations
Introduction
Skills and Expertise
Current institution
Publications
Publications (350)
Although the adult pentaradial body plan of echinoderms evolved from a bilateral ancestor, identifying axial homologies between the morphologically divergent echinoderms and their bilaterian relatives has been an enduring problem in zoology. The expression of conserved bilaterian patterning genes in echinoderms provides a molecular framework for re...
Identifying populations at highest risk from climate change is a critical component of conservation efforts. However, vulnerability assessments are usually applied at the species level, even though intraspecific variation in exposure, sensitivity and adaptive capacity play a crucial role in determining vulnerability. Genomic data can inform intrasp...
Deuterostomes are a monophyletic group of animals that includes Hemichordata, Echinodermata (together called Ambulacraria), and Chordata. The diversity of deuterostome body plans has made it challenging to reconstruct their ancestral condition and to decipher the genetic changes that drove the diversification of deuterostome lineages. Here, we gene...
As the only surviving lineages of jawless fishes, hagfishes and lampreys provide a crucial window into early vertebrate evolution1–3. Here we investigate the complex history, timing and functional role of genome-wide duplications4–7 and programmed DNA elimination8,9 in vertebrates in the light of a chromosome-scale genome sequence for the brown hag...
Frogs are an ecologically diverse and phylogenetically ancient group of anuran amphibians that include important vertebrate cell and developmental model systems, notably the genus Xenopus. Here we report a high-quality reference genome sequence for the western clawed frog, Xenopus tropicalis, along with draft chromosome-scale sequences of three dis...
Cassava (Manihot esculenta Crantz) is a food and industrial storage root crop with substantial potential to contribute to managing risk associated with climate change due to its inherent resilience and in providing a biodegradable option in manufacturing. In Africa, cassava production is challenged by two viral diseases, cassava brown streak diseas...
Deuterostomes are an animal superphylum that includes Hemichordata and Echinodermata (together Ambulacraria) and Chordata. The diversity of deuterostome body plans has made it challenging to reconstruct their ancestral condition and to decipher the genetic changes that drove the diversification of deuterostome lineages. Here, we generate chromosome...
The origin of the pentaradial body plan of echinoderms from a bilateral ancestor is one of the most enduring zoological puzzles1,2. Because echinoderms are defined by morphological novelty, even the most basic axial comparisons with their bilaterian relatives are problematic. To revisit this classical question, we used conserved anteroposterior axi...
Hybridization brings together chromosome sets from two or more distinct progenitor species. Genome duplication associated with hybridization, or allopolyploidy, allows these chromosome sets to persist as distinct subgenomes during subsequent meioses. Here, we present a general method for identifying the subgenomes of a polyploid based on shared anc...
Cephalopods are remarkable among invertebrates for their cognitive abilities, adaptive camouflage, novel structures, and propensity for recoding proteins through RNA editing. Due to the lack of genetically tractable cephalopod models, however, the mechanisms underlying these innovations are poorly understood. Genome editing tools such as CRISPR-Cas...
A central question in evolutionary biology is whether sponges or ctenophores (comb jellies) are the sister group to all other animals. These alternative phylogenetic hypotheses imply different scenarios for the evolution of complex neural systems and other animal-specific traits1–6. Conventional phylogenetic approaches based on morphological charac...
As the only surviving lineages of jawless fishes, hagfishes and lampreys provide a critical window into early vertebrate evolution. Here, we investigate the complex history, timing, and functional role of genome-wide duplications in vertebrates in the light of a chromosome-scale genome of the brown hagfish Eptatretus atami. Using robust chromosome-...
Skates are cartilaginous fish whose body plan features enlarged wing-like pectoral fins, enabling them to thrive in benthic environments1,2. However, the molecular underpinnings of this unique trait remain unclear. Here we investigate the origin of this phenotypic innovation by developing the little skate Leucoraja erinacea as a genomically enabled...
Cassava (Manihot esculenta) is a starchy root crop that supports over a billion people in tropical and subtropical regions of the world. This staple, however, produces the neurotoxin cyanide and requires processing for safe consumption. Excessive consumption of insufficiently processed cassava, in combination with protein-poor diets, can have neuro...
Chromosomal rearrangements can initiate and drive cancer progression, yet it has been challenging to evaluate their impact, especially in genetically heterogeneous solid cancers. To address this problem we developed HiDENSEC, a new computational framework for analyzing chromatin conformation capture in heterogeneous samples, which can infer somatic...
The origin of the pentaradial body plan of echinoderms from a bilateral ancestor is one of the most enduring zoological puzzles. Since echinoderms are defined by morphological novelty, even the most basic axial comparisons with their bilaterian relatives are problematic. Here, we used conserved antero-posterior (AP) axial molecular markers to deter...
De novo genome assembly, i.e., rebuilding the sequence of an unknown genome from redundant and erroneous short sequences, is a key but computationally intensive step in many genomics pipelines. The exponential growth of genomic data is increasing the computational demand and requires scalable, high-performance approaches. In this work, we present a...
Key message
We demystify recent advances in genome assemblies for the heterozygous staple crop cassava (Manihot esculenta), and highlight key cassava genomic resources.
Abstract
Cassava, Manihot esculenta Crantz, is a crop of societal and agricultural importance in tropical regions around the world. Genomics provides a platform for accelerated imp...
A correction to this paper has been published: https://doi.org/10.1007/s11103-021-01139-7
Although the camera-type eyes of cephalopods and vertebrates are a canonical example of convergent morphological evolution, the cellular and molecular mechanisms underlying this convergence remain obscure. We used genomics and single cell transcriptomics to study these mechanisms in the visual system of the bobtail squid Euprymna berryi , an emergi...
Cephalopods are known for their large nervous systems, complex behaviors and morphological innovations. To investigate the genomic underpinnings of these features, we assembled the chromosomes of the Boston market squid, Doryteuthis (Loligo) pealeii, and the California two-spot octopus, Octopus bimaculoides , and compared them with those of the Haw...
The nutrient-rich tubers of the greater yam, Dioscorea alata L., provide food and income security for millions of people around the world. Despite its global importance, however, greater yam remains an orphan crop. Here, we address this resource gap by presenting a highly contiguous chromosome-scale genome assembly of D. alata combined with a dense...
Skates are cartilaginous fish whose novel body plan features remarkably enlarged wing-like pectoral fins that allow them to thrive in benthic environments. The molecular underpinnings of this unique trait, however, remain elusive. Here we investigate the origin of this phenotypic innovation by developing the little skate Leucoraja erinacea as a gen...
Animal genomes show networks of deeply conserved gene linkages whose phylogenetic scope and chromosomal context remain unclear. Here, we report chromosome-scale conservation of synteny among bilaterians, cnidarians, and sponges and use comparative analysis to reconstruct ancestral chromosomes across major animal groups. Comparisons among diverse me...
Frogs are an ecologically diverse and phylogenetically ancient group of living amphibians that include important vertebrate cell and developmental model systems, notably the genus Xenopus . Here we report a high-quality reference genome sequence for the western clawed frog, Xenopus tropicalis , along with draft chromosome-scale sequences of three d...
Cassava ( Manihot esculenta Crantz) is a starchy root crop that supports over a billion people in tropical and subtropical regions of the world. This staple, however, produces toxic cyanogenic compounds and requires processing for safe consumption. Excessive consumption of insufficiently processed cassava, in combination with protein-poor diets, ca...
Meiosis is conserved across eukaryotes yet varies in the details of its execution. Here we describe a new comparative model system for molecular analysis of meiosis, the nematode Pristionchus pacificus , a distant relative of the widely studied model organism Caenorhabditis elegans . P. pacificus shares many anatomical and other features that facil...
Cephalopods have recently moved into the research focus due to the growing number of sequenced genomes, molecular tools, and laboratory culture (Albertin & Simakov, 2020). Genome data now allows us to ask how the many known novelties of cephalopod morphology are reflected in their genomes and gene regulation. A crucial gap in this understanding ha...
The origin and dispersal of cultivated and wild mandarin and related citrus are poorly understood. Here, comparative genome analysis of 69 new east Asian genomes and other mainland Asian citrus reveals a previously unrecognized wild sexual species native to the Ryukyu Islands: C. ryukyuensis sp. nov. The taxonomic complexity of east Asian mandarins...
Bobtail and bottletail squid are small cephalopods with striking anti-predatory defensive mechanisms, bioluminescence, and complex morphology; that inhabit nektobenthic and pelagic environments around the world’s oceans. Yet, the evolution and diversification of these animals remain unclear. Here, we used shallow genome sequencing of thirty-two bob...
A pan-genome is the nonredundant collection of genes and/or DNA sequences in a species. Numerous studies have shown that plant pan-genomes are typically much larger than the genome of any individual and that a sizable fraction of the genes in any individual are present in only some genomes. The construction and interpretation of plant pan-genomes a...
The nutrient-rich tubers of the greater yam Dioscorea alata L. provide food and income security for millions of people around the world. Despite its global importance, however, greater yam remains an "orphan crop." Here we address this resource gap by presenting a highly-contiguous chromosome-scale genome assembly of greater yam combined with a den...
Long-term climate change and periodic environmental extremes threaten food and fuel security1 and global crop productivity2–4. Although molecular and adaptive breeding strategies can buffer the effects of climatic stress and improve crop resilience5, these approaches require sufficient knowledge of the genes that underlie productivity and adaptatio...
Although its sequence was recently determined in a genomic tour de force ,{Edger 2019} the ancestry of the cultivated octoploid strawberry Fragaria x ananassa remains controversial.{Liston 2020; Edger 2020} Polyploids that arise by hybridization generally have chromosome sets, or subgenomes, of distinct ancestry.{Stebbins 1947; Garsmeur 2014} The c...
Miscanthus is a perennial wild grass that is of global importance for paper production, roofing, horticultural plantings, and an emerging highly productive temperate biomass crop. We report a chromosome-scale assembly of the paleotetraploid M. sinensis genome, providing a resource for Miscanthus that links its chromosomes to the related diploid Sor...
Trifoliate orange (Poncirus trifoliata), a deciduous close relative of evergreen Citrus, has important traits for citrus production including tolerance/resistance to citrus greening disease (Huanglongbing, HLB) and other major diseases, and cold tolerance. It has been one of the most important rootstocks and one of the most valuable sources of resi...
Closely related muntjac deer show striking karyotype differences. Here we describe chromosome-scale genome assemblies for Chinese and Indian muntjacs, Muntiacus reevesi (2n = 46) and Muntiacus muntjak vaginalis (2n = 6/7), and analyze their evolution and architecture. The genomes show extensive collinearity with each other and with other deer and c...
Our understanding of polyploid genome evolution is constrained because we cannot know the exact founders of a particular polyploid. To differentiate between founder effects and post polyploidization evolution, we use a pan-genomic approach to study the allotetraploid Brachypodium hybridum and its diploid progenitors. Comparative analysis suggests t...
Metagenome sequence datasets can contain terabytes of reads, too many to be coassembled together on a single shared-memory computer; consequently, they have only been assembled sample by sample (multiassembly) and combining the results is challenging. We can now perform coassembly of the largest datasets using MetaHipMer, a metagenome assembler des...
Although it is widely believed that early vertebrate evolution was shaped by ancient whole-genome duplications, the number, timing and mechanism of these events remain elusive. Here, we infer the history of vertebrates through genomic comparisons with a new chromosome-scale sequence of the invertebrate chordate amphioxus. We show how the karyotypes...
Miscanthus is a perennial wild grass that is of global importance for paper production, roofing, horticultural plantings, and an emerging highly productive temperate biomass crop. We report a chromosome-scale assembly of the paleotetraploid M. sinensis genome, providing a resource for Miscanthus that links its chromosomes to the related diploid Sor...
Bobtail squid are emerging models for host–microbe interactions, behavior, and development, yet their species diversity and distribution remain poorly characterized. Here, we combine mitochondrial and transcriptome sequences with morphological analysis to describe three species of bobtail squid (Sepiolidae: Sepiolinae) from the Ryukyu archipelago,...
Introduction Dioscorea alata (water yam) is the most widely distributed species globally because of its agronomic flexibility and productive potential. D. alata faces a number of constraints that significantly reduce its potential to support rural development and meet consumers' needs as an affordable nutritional product. These constraints are ushe...
Recent advances in long-read sequencing enable the characterization of genome structure and its intra- and inter-species variation at a resolution that was previously impossible. Detecting overlaps between reads is integral to many long-read genomics pipelines, such as de novo genome assembly. While longer reads simplify genome assembly and improve...
Despite their recent divergence, muntjac deer show striking karyotype differences. Here we describe new chromosome-scale genome assemblies for the Chinese and Indian muntjacs, Muntiacus reevesi (2n=46) and Muntiacus muntjak (2n=6/7), and analyze their evolution and architecture. We identified six fusion events shared by both species relative to the...
We present a platinum-quality genome assembly for the model grass Setaria viridis, and high quality genomic sequences of 600+ wild accessions (average 42.6x coverage). Presence-absence variation (PAV) and single-nucleotide polymorphisms (SNPs) identify several subpopulations in North America. Using genome-wide association mapping plus CRISPR-Cas9 t...
Following publication of the original article [1], the authors reported that the Availability of data and materials section required updating. The updated text reads as follows:
Background:
Genomic variation is widespread, and both neutral and selective processes can generate similar patterns in the genome. These processes are not mutually exclusive, so it is difficult to infer the evolutionary mechanisms that govern population and species divergence. Boechera stricta is a perennial relative of Arabidopsis thaliana native...
The Western clawed frog Xenopus tropicalis is a diploid model system for both frog genetics and developmental biology, complementary to the paleotetraploid X. laevis. Here we report a chromosome-scale assembly of the X. tropicalis genome, improving the previously published draft genome assembly through the use of new assembly algorithms, additional...
Acoel-regeneration regulatory landscapes
Some animals, including some types of worms, can undergo whole-body regeneration and replace virtually any missing cell type. Gehrke et al. sequenced and assembled the genome of Hofstenia miamia , a regenerative acoel worm species (see the Perspective by Alonge and Schatz). They identified a variable motif c...
A persistent concern with CRISPR-Cas9 gene editing has been the potential to generate mutations at off-target genomic sites. While CRISPR-engineering mice to delete a ~360 bp intronic enhancer, here we discovered a founder line that had marked immune dysregulation caused by a 24 kb tandem duplication of the sequence adjacent to the on-target deleti...
Chaetognaths (arrow worms) are an enigmatic group of marine animals whose phylogenetic position remains elusive, in part because they display a mix of developmental and morphological characters associated with other groups [1, 2]. In particular, it remains unclear whether they are a sister group to protostomes [1, 2], one of the principal animal su...
Animal–microbe associations are critical drivers of evolutionary innovation, yet the origin of specialized symbiotic organs remains largely unexplored. We analyzed the genome of Euprymna scolopes, a model cephalopod, and observed large-scale genomic reorganizations compared with the ancestral bilaterian genome. We report distinct evolutionary signa...
Environmental stress is a major driver of ecological community dynamics and agricultural productivity. This is especially true for soil water availability, because drought is the greatest abiotic inhibitor of worldwide crop yields. Here, we test the genetic basis of drought responses in the genetic model for C4 perennial grasses, Panicum hallii, th...
Closely related to the model plant Arabidopsis thaliana, the genus Boechera is known to contain both sexual and apomictic species or accessions. Boechera retrofracta is a diploid sexually reproducing species and is thought to be an ancestral parent species of apomictic species. Here we report the de novo assembly of the B. retrofracta genome using...
Closely related to the model plant Arabidopsis thaliana, the genus Boechera is known to contain both sexual and apomictic species or accessions. Boechera retrofracta is a diploid sexually reproducing species and is thought to be an ancestral parent species of the apomictic species Boechera divaricarpa. Here we report the de novo assembly of the B....
The genus Citrus, comprising some of the most widely cultivated fruit crops worldwide, includes an uncertain number of species. Here we describe ten natural citrus species, using genomic, phylogenetic and biogeographic analyses of 60 accessions representing diverse citrus germ plasms, and propose that citrus diversified during the late Miocene epoc...
Key message:
QTL consistent across seasons were detected for resistance to cassava brown streak disease induced root necrosis and foliar symptoms. The CMD2 locus was detected in an East African landrace, and comprised two QTL. Cassava production in Africa is compromised by cassava brown streak disease (CBSD) and cassava mosaic disease (CMD). To re...
In Fig. 5 of the version of this Article originally published, the final number on the x axes of each panel was incorrectly written as 1.5; it should have read 7.5. This has now been corrected in all versions of the Article.
Genetic mapping of quantitative trait loci (QTL) for resistance to cassava brown streak disease (CBSD), cassava mosaic disease (CMD), and cassava green mite (CGM) was performed using an F1 cross developed between the Tanzanian landrace, Kiroba, and a breeding line, AR37-80. The population was evaluated for two consecutive years in two sites in Tanz...
De novo whole genome assembly reconstructs genomic sequence from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in modern genomics. This work presents HipMER, a high-quality end-to-end de novo assembler designed for extreme scale analysis, via efficient parallelization of the Meraculous code...
Fixed chromosomal inversions can reduce gene flow and promote speciation in two ways: by suppressing recombination and by carrying locally favoured alleles at multiple loci. However, it is unknown whether favoured mutations slowly accumulate on older inversions or if young inversions spread because they capture pre-existing adaptive quantitative tr...
While many short read assemblers attempt to simplify the de Brujin graph by identifying and resolving variant-induced bubbles to produce a haploid mosaic result, this approach is only viable when variants are relatively rare and the bubbles are well defined in a graph context. We observed that diploid genomes with very high levels of heterozygosity...