Erez Lieberman Aiden’s research while affiliated with Baylor College of Medicine and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (212)


Figure 4. Volcano plot of genes differentially expressed between northern and southern groups, 1065 controlling for seasonal differences. Positive log2 fold change values correspond to genes 1066 differentially over-expressed in the southern group, while negative log2 fold change values 1067 correspond to genes differentially under-expressed in the southern group. ADAM20 is not plotted 1068 here, but see Fig. S5. 1069
Chromosome-length genome assembly of Uta stansburiana and gene expression data reveal fast pace-of-life comes with environmental stability
  • Preprint
  • File available

May 2025

·

39 Reads

·

·

·

[...]

·

Dan L. Edwards

Uta stansburiana are an emerging model system for studying sexual selection, polymorphism, and the evolution of pace-of-life syndromes (POLS) whose distribution covers variable environments and a wide latitudinal gradient. POLS are suites of traits causing variation of life history along a slow maturing-fast maturing continuum. We present a high-quality chromosome-level reference genome for U. stansburiana and pair it with RNA-seq gene expression data to demonstrate, for the first time, the molecular basis for pace-of-life differences between locations with higher and lower climate seasonality and sexual size dimorphism (SSD). Our assembly is 2.1 Gbp, has scaffold N50 of 320 Mbp, includes 104 scaffolds, and has an L50 of 3. The assembly comprises six macrochromosomes and 11 microchromosomes. We annotated 20,350 genes for the assembly and found a repeat element composition of 49.23%, similar to work in other phrynosomatid lizards. RNA-seq data demonstrate differential expression in genes associated with pace-of-life divergence including those related to stress, sexual reproduction, and cell proliferation/carcinogenesis between distinctive environments. Our results provide the first differential gene expression evidence of environmentally mediated pace-of-life processes related to different degrees of SSD in U. stansburiana and demonstrate the utility of RNA-seq gene expression data in detecting POLS.

Download

Chromosome-length genome assembly of the critically endangered Mountain bongo (Tragelaphus eurycerus isaaci): A resource for conservation and comparative genomics

May 2025

·

18 Reads

G3 Genes Genomes Genetics

The Mountain bongo (Tragelaphus eurycerus isaaci), a critically endangered tragelaphine antelope native to the montane forests of Kenya, faces significant threats from habitat loss and hunting. Although the Mountain bongo is a flagship species in Kenya, the majority are found in small, isolated populations of less than 100 animals total, making it a species of high conservation concern. In this report, we present a chromosome-length draft genome assembly for the Mountain bongo, generated using a combination of linked-read and proximity ligation (Hi-C) sequencing techniques. The assembly resulted in a 2.96 Gb sized genome with a contig N50 of 79.5 kb and a scaffold N50 of 192 Mb. Assembly completeness was 95.1% based on 12,234 Benchmarking Universal Single-Copy Orthologs (BUSCO) and annotation revealed 29,820 protein coding genes, of which 27,761 were functionally annotated, and a repetitive content of 47.31%. Synteny analysis against the domestic cattle (Bos taurus) genome assembly revealed numerous chromosomal rearrangements between the two species. Our analysis also revealed insights into the evolutionary and demographic history of the Mountain bongo, offering valuable information for conservation management. We also assembled and annotated the mitochondrial genome which showed <1% differences from the Lowland bongo subspecies, T. e. eurycerus. By integrating genomic data with traditional conservation methods, this reference lays the foundation to evaluate and preserve genetic diversity of both in-situ and ex-situ populations of the Mountain bongo amidst growing environmental pressures.


Chromosome-level genomics and historical museum collections reveal new insights into the population structure and chromosome evolution of waterbuck

March 2025

·

29 Reads

Advances in the sequencing and assembly of chromosome-level genome assemblies has enabled the study of non-model animals, providing further insights into the evolution of genomes and chromosomes. Here, we present the waterbuck (Kobus ellipsiprymnus) as an emerging model antelope for studying population dynamics and chromosome evolution. Antelope evolutionary history has been shaped by Robertsonian (Rb) fusions, with waterbuck also showing variation in karyotype due to two polymorphic Rb fusions. These polymorphisms are variable between and within the two recognised subspecies, the common and defassa waterbuck. To provide new insights into waterbuck evolution, we firstly assembled a chromosome-level genome assembly for the defassa subspecies using PacBio HiFi and Hi-C sequencing. We then utilised museum collections to carry out whole genome sequencing (WGS) of 24 historical waterbuck skins from both subspecies. Combined with a previous WGS dataset (n = 119), this represents the largest study of waterbuck populations to date. We found novel population structure and gene flow between waterbuck populations and regions across the genome with high genomic differentiation between the two subspecies. Several of these regions were found around the centromeres of fixed and polymorphic Rb fusions, exhibiting signatures of low recombination and local population structure. Interestingly, these regions contain genes involved in development, fertility, and recombination. Our results highlight the importance of assembling genomes to the chromosome-level, the utility and value of historical collections in sampling a wide-ranging species to uncover fine-scale population structure, and the potential impacts of Rb fusions on genomic differentiation and the recombination landscape.


Map of the study region. Murray–Darling Basin (MDB) is shaded in grey. Major rivers and streams are in light grey, and major rivers sampled are in dark grey. Circles are individual sampling locations; circles in the lower panel are colour‐coded by reach. River reach codes are in Table 1.
Network diagram representing first‐order kin pairs (full‐sibling or parent–offspring relationships) of golden perch detected between river reaches. The thickness of line is indicative of the number of sibling pairs. River reaches are arranged geographically; reaches in the northern parts of the Murray–Darling Basin are in purple, and reaches in the southern parts of the MDB are in blue. Numbers of within‐reach first‐order kin pairs are given in parentheses.
Colour scale represents the number of golden perch stocked directly into the sampled reach from 1990 to 2019 (i.e., intensity of stocking). Size of the circle represents the effective population size (Ne). Plotted Ne values are the corrected LD method estimates. *Indicates that Ne estimates returned by the LD method were infinite. ^Indicates that the upper credible intervals for both the LD and colony methods spanned infinity.
of results of STRUCTURE analysis for K = 7 in golden perch from the Murray–Darling Basin. In panel (A), individuals are plotted as columns and are arranged along the X axis according to their natal origin as ‘wild’, ‘stocked’ or ‘unknown’ (inferred based on integrated otolith and genetic data). The Q values on the Y‐axis represent the proportional individual assignment (A) and mean individual assignment per natal origin category (B) to each of the seven genetic clusters in the Structure analysis. Assignment to the primary genetic cluster (Q1; orange genetic cluster) was associated with an individual being of ‘wild’ origin.
Results of hierarchical Bayesian regression model to explore changes in patterns of kin‐associated population genetic substructure (a proxy for stocking effects) in golden perch over time. Plot shows fitted relationship between birth year of individuals and their admixture coefficient for the primary genetic cluster (q1; proxy for wild origin) in the Structure analysis. Light grey and dark grey shading represent 80% and 95% credible intervals, respectively. Data points are also plotted.
Otolith and Genomic Data Reveal Temporal Insights Into Stocking Across a Large River Basin in a Mobile, Long‐Lived Australian Freshwater Fish Species

March 2025

·

71 Reads

Freshwater ecosystems and their biota are under increasing pressure from anthropogenic stressors. In response to declining fish stocks, hatchery and stocking programmes are widely implemented as core components of restoration and management strategies, with positive outcomes for some wild populations. Despite this, stocking remains contentious due to potential genetic and ecological risks to wild populations. Monitoring and evaluation of stocking outcomes are critical to ensuring the long‐term sustainability of wild populations, but identification of stocked individuals post‐release remains a key challenge, particularly for mobile species. In this study, we combined otolith (natal origin and age) and genomic data to identify stocked individuals and evaluate the genetic implications of stocking for a culturally and socioeconomically important and mobile freshwater fish, golden perch Macquaria ambigua (family: Percichthyidae), across Australia's Murray–Darling Basin (MDB). We also generated a chromosome‐level genome assembly. Many close kin were detected across the MDB, increasing in prevalence over recent decades and mostly of hatchery origin. Rivers with many close kin were associated with low effective population sizes (Ne < 100). Genetic signatures of stocking varied according to local context, being most pronounced in but not restricted to rivers considered functionally isolated for management purposes. Where fish are stocked into rivers that are part of the connected metapopulation, there is scope to modify current stocking practices to avoid over‐representation of related stocked individuals. Increased focus on the genetic diversity of stocked fish is likely to promote the long‐term persistence of golden perch in the wild.


Figure 2. Heterozygosity and demographic history reconstructed from the reference genome of the stone marten individual.
Carnivora species studied using stone marten chromosome painting probes (Zoo FISH experiments).
Chromosome-length genome assembly of the stone marten ( Martes foina , Mustelidae): A new view on one of the cornerstones in carnivore cytogenetics

January 2025

·

90 Reads

Journal of Heredity

The stone marten (Martes foina) is an important species for cytogenetic studies in the order Carnivora. ZooFISH probes created from its chromosomes provided a strong and clean signal in chromosome painting experiments and were valuable for studying the evolution of carnivoran genome architecture. The research revealed that the stone marten chromosome set is similar to the presumed ancestral karyotype of the Carnivora, which added an additional value for the species. Using linked-read and Hi-C sequencing, we generated a chromosome-length genome assembly of a male stone marten (Gansu province, China) from a primary cell line. The stone marten assembly had a length of 2.42 Gbp, scaffold N50 of 144 Mbp, and a 96.2% BUSCO completeness score. We identified 19 chromosomal scaffolds (2n=38) and assigned them chromosome ids based on chromosome painting data. Annotation identified 20,087 protein-coding gene models, of which 18,283 were assigned common names. Comparison of the stone marten assembly with the cat, dog, and human genomes revealed several small syntenic blocks absent on the published painting maps. Finally, we assessed the heterozygosity and its distribution over the chromosomes. The detected low heterozygosity level (0.4 hetSNPs/kbp) and the presence of long RoHs require further research and a new evaluation of the conservation status of the stone marten in China. Combined with available carnivoran genomes in large scale synteny analysis, the stone marten genome will highlight new features and events in carnivoran evolution, hidden from cytogenetic approaches.


The Sensory Shark: High-quality Morphological, Genomic and Transcriptomic Data for the Small-spotted Catshark Scyliorhinus Canicula Reveal the Molecular Bases of Sensory Organ Evolution in Jawed Vertebrates

December 2024

·

120 Reads

·

2 Citations

Molecular Biology and Evolution

Cartilaginous fishes (chondrichthyans: chimaeras and elasmobranchs -sharks, skates and rays) hold a key phylogenetic position to explore the origin and diversifications of jawed vertebrates. Here, we report and integrate reference genomic, transcriptomic and morphological data in the small-spotted catshark Scyliorhinus canicula to shed light on the evolution of sensory organs. We first characterise general aspects of the catshark genome, confirming the high conservation of genome organisation across cartilaginous fishes, and investigate population genomic signatures. Taking advantage of a dense sampling of transcriptomic data, we also identify gene signatures for all major organs, including chondrichthyan specializations, and evaluate expression diversifications between paralogs within major gene families involved in sensory functions. Finally, we combine these data with 3D synchrotron imaging and in situ gene expression analyses to explore chondrichthyan-specific traits and more general evolutionary trends of sensory systems. This approach brings to light, among others, novel markers of the ampullae of Lorenzini electro-sensory cells, a duplication hotspot for crystallin genes conserved in jawed vertebrates, and a new metazoan clade of the transient-receptor potential (TRP) family. These resources and results, obtained in an experimentally tractable chondrichthyan model, open new avenues to integrate multiomics analyses for the study of elasmobranchs and jawed vertebrates.


The genome of the Australian water dragon ( Intellagama lesueurii ), an agamid model for urban adaptation

October 2024

·

27 Reads

Journal of Heredity

Squamate reptiles are a highly diverse and intriguing group of tetrapods, offering valuable insights into the evolution of amniotes. The Australian water dragon (Intellagama lesueurii) is a member of the Agamidae, and sister to the core mesic Australian endemic radiation (Amphibolurinae). The species is renowned for its urban adaptability and complex social systems. We report a 1.8 Gb chromosome-length genome assembly together with the annotation of 23,675 protein-coding genes. Comparative analysis with other squamate genomes highlights gene family expansions associated with immune function, energetic homeostasis, and wound healing. This reference genome will serve as a valuable resource for studies of evolution and environmental resilience in lizards.


Prediction and functional interpretation of inter-chromosomal genome architecture from DNA sequence with TwinC

September 2024

·

12 Reads

Three-dimensional nuclear DNA architecture comprises well-studied intra-chromosomal (cis) folding and less characterized inter-chromosomal (trans) interfaces. Current predictive models of 3D genome folding can effectively infer pairwise cis-chromatin interactions from the primary DNA sequence but generally ignore trans contacts. There is an unmet need for robust models of trans-genome organization that provide insights into their underlying principles and functional relevance. We present TwinC, an interpretable convolutional neural network model that reliably predicts trans contacts measurable through genome-wide chromatin conformation capture (Hi- C). TwinC uses a paired sequence design from replicate Hi-C experiments to learn single base pair relevance in trans interactions across two stretches of DNA. The method achieves high predictive accuracy (AUROC=0.80) on a cross-chromosomal test set from Hi-C experiments in heart tissue. Mechanistically, the neural network learns the importance of compartments, chromatin accessibility, clustered transcription factor binding and G-quadruplexes in forming trans contacts. In summary, TwinC models and interprets trans genome architecture, shedding light on this poorly understood aspect of gene regulation.


Snail plot summaries of assembly statistics. (a) Lutzomyia longipalpis assembly ASM2433408v1. (b) Phlebotomus papatasi assembly JANPWV01. Both plots were generated using blobtoolkit⁴³.
Blobplots of base coverage against GC proportion. (a) Lutzomyia longipalpis assembly ASM2433408v1. (b) Phlebotomus papatasi assembly JANPWV01 with no-hits filtered out. Both plots were generated using blobtoolkit⁴³.
Hi-C contact maps. (a) Lutzomyia longipalpis (b) Phlebotomus papatasi. Chromosome-length Hi-C contact maps visualized in Juicebox⁴⁴.
Improved high quality sand fly assemblies enabled by ultra low input long read sequencing

August 2024

·

97 Reads

·

4 Citations

Scientific Data

Phlebotomine sand flies are the vectors of leishmaniasis, a neglected tropical disease. High-quality reference genomes are an important tool for understanding the biology and eco-evolutionary dynamics underpinning disease epidemiology. Previous leishmaniasis vector reference sequences were limited by sequencing technologies available at the time and inadequate for high-resolution genomic inquiry. Here, we present updated reference assemblies of two sand flies, Phlebotomus papatasi and Lutzomyia longipalpis. These chromosome-level assemblies were generated using an ultra-low input library protocol, PacBio HiFi long reads, and Hi-C technology. The new P. papatasi reference has a final assembly span of 351.6 Mb and contig and scaffold N50s of 926 kb and 111.8 Mb, respectively. The new Lu. longipalpis reference has a final assembly span of 147.8 Mb and contig and scaffold N50s of 1.09 Mb and 40.6 Mb, respectively. Benchmarking Universal Single-Copy Orthologue (BUSCO) assessments indicated 94.5% and 95.6% complete single copy insecta orthologs for P. papatasi and Lu. longipalpis. These improved assemblies will serve as an invaluable resource for future genomic work on phlebotomine sandflies.


Figure 2. Kmer histogram and Hi-C contact map. (A) Genomescope Profile of kmers derived from HiFi reads. (B) Hi-C contact map of the 14 chromosome-length scaffolds,
Current stonefly genome assemblies and quality statistics as of 2024
An unusually large genome from an unusually large stonefly: A chromosome-length genome assembly for the giant salmonfly, Pteronarcys californica (Plecoptera: Pteronarcyidae)

August 2024

·

159 Reads

Journal of Heredity

Pteronarcys californica (Newport 1848) is commonly referred to as the giant salmonfly and is the largest species of stonefly (Insecta: Plecoptera) in the western United States. Historically, it was widespread and abundant in western rivers, but populations have experienced a substantial decline in the past few decades, becoming locally extirpated in numerous rivers in Utah, Colorado, and Montana. Although previous research has explored the ecological variables conducive to the survivability of populations of the giant salmonfly, a lack of genomic resources hampers exploration of how genetic variation is spread across extant populations. To accelerate research on this imperiled species, we present a de novo chromosomal-length genome assembly of P. californica generated from PacBio HiFi sequencing and Hi-C chromosome conformation capture. Our assembly includes 14 predicted pseudo chromosomes and 98.8% of Insecta universal core orthologs. At 2.40 gigabases, the P. californica assembly is the largest of available stonefly assemblies, highlighting at least 9.5-fold variation in assembly size across the order. Repetitive elements (REs) account for much of the genome size increase in P. californica relative to other stonefly species, with the content of Class I retroelements alone exceeding the entire assembly size of all but two other species studied. We also observed preliminary suborder-specific trends in genome size that merit testing with more robust taxon sampling.


Citations (66)


... The results included the family of genes encoding succinate receptors (SUCNR1) with two lineage-specific duplicates in elasmobranchs (Fig. 5B). This method also identified tandem gene clusters previously reported in elasmobranch genomes including protocadherin (Pcdh; Hara et al., 2018), oxytocin/vasopressin (Hara et al., 2018), VLDLR (Ohishi et al., 2023), Lefty (reviewed in Kuraku, 2024), -crystallin (Mayeur et al., 2024), and Wnt1-6b-10b . CC-BY-NC-ND 4.0 International license made available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. ...

Reference:

Tracing genome size dynamics in sharks and rays with inclusive sequence analysis by the Squalomix Consortium
The Sensory Shark: High-quality Morphological, Genomic and Transcriptomic Data for the Small-spotted Catshark Scyliorhinus Canicula Reveal the Molecular Bases of Sensory Organ Evolution in Jawed Vertebrates

Molecular Biology and Evolution

... These technologies routinely produce reads in the 10-100 kb range and avoid PCR-associated artifacts. LRS has been successfully applied for genome polishing [107], sequencing of repetitive chromosome regions [113], and even for whole-genome assembly from single sandflies [114]. Novel applications include RNA isoform sequencing and epigenetic modifications measurements, combined with single-cell sequencing approaches [112,115,116]. ...

Improved high quality sand fly assemblies enabled by ultra low input long read sequencing

Scientific Data

... However, several studies have questioned the role of cohesin in gene regulation [18,19]. This discrepancy is potentially resolved by recent results indicating that EP pairs separated by more than 50-100 kb tend to be regulated by cohesin, whereas more proximal pairs are cohesin-independent [20,21]. Notably, cohesin is important for the regulation of many cell identity genes, which are often controlled by a distal SE [21]. ...

Cohesin-mediated 3D contacts tune enhancer-promoter regulation

... 35 Recently, PaleoHi-C revealed remarkable preservation of 3D genome architecture of a 52,000 year-old woolly mammoth (Mammuthus primigenius), proving the success of Hi-C on ancient, extinct samples. 36 of these questions. For example, polymer collapse effects due to irreversible crosslinking between monomers on contact probabilities for different crosslinker concentrations were numerically modelled, and experimental contact probability curves for different crosslinking concentrations measured as a function of time in Hi-C experiments, studying polymer concentration evolution during crosslinking-induced collapse. ...

Three-dimensional genome architecture persists in a 52,000-year-old woolly mammoth skin sample

Cell

... glabra and V. radiata var. sublobata 18 ; and a chickpea (Cicer arietinum) super-pangenome including wild species such as C. reticulatum, C. judaicum and C. pinnatifidium 19,20 . ...

Cicer super-pangenome provides insights into species evolution and agronomic trait loci for crop improvement in chickpea

Nature Genetics

... We identified a total of 10 and 14 peaks located in clusters 1 and 2, respectively (Fig. 2B). To further filter our data, we reanalyzed ATAC-seq data from sugar glider skin tissue (36), which, with the exception of Camp7, does not express cathelicidin genes (33). Thus, we could exclude shared open chromatin regions between these tissues as plausible regulators of cathelicidin gene expression. ...

Emx2 underlies the development and evolution of marsupial gliding membranes

Nature

... Conversely, some research focuses on using DNA sequences or epigenetic information, to predict the genomes's hierarchical structure and chromatin interaction patterns [41][42][43]. Specifically, models based on DNA sequence information or epigenetic signals, include CoRNN for A/B compartments prediction [44], TAD-Lactuca [45], and pTADS [46] for predicting TADs and their boundaries. ChINN [47], CharID [48], and 3DepiLoop [49] for predicting chromatin loops or interactions in open chromatin regions. ...

Predicting A/B compartments from histone modifications using deep learning

iScience

... SPOP governs the stability of numerous substrates that can directly or indirectly affect PD-L1 mRNA expression. Other research has indicated that B-type lamin degradation induces substantial changes in chromatin conformation, particularly near the nuclear periphery [43], implying a complex relationship between LMNB2 and PD-L1. Lamin proteins interact with chromatin and contribute to nuclear organization and structure [34]. ...

Depletion of lamins B1 and B2 promotes chromatin mobility and induces differential gene expression by a mesoscale-motion-dependent mechanism

Genome Biology

... To evaluate performance at more proximal versus more distal links, we computed average enrichment linking methods across cell types and datasets with genomic distance, training on fine-mapped eQTL data to assign a single probabilistic score to each candidate SNP-gene link. We evaluate the performance of pgBoost and existing methods using sets of SNP-gene links derived from eQTL 23,24 , activity-by-contact (ABC) 8,9 , CRISPR [25][26][27][28][29][30][31][32][33] and GWAS 34,35 data. We also investigate whether restricting to single-cell data from a focal cell type can improve power to detect links relevant to that cell type. ...

An encyclopedia of enhancer-gene regulatory interactions in the human genome

... Following the Spt2 region delimitation utilizing polymorphisms identified from CI5791 and Golden Promise genome alignment, flanking regions of the Spt2 interval of Morex V3 [31] were used as seeds to BLAST against each barley genome within the barley pangenome [64], additionally released pseudochromosome level genomes of cultivars Golden Promise [32,64], AAC Synergy [65], Clipper, Stirling [66], Haruna Nijo [67], Lasa Goumang [68], Zangqing320 [69], wild barley OUH602 [70], landrace Hatiexi [71] and landrace CI5791 from this study. The Spt2 regions from the sequences of all the barley lines were compared to assess the gene space diversity. ...

Genomic signatures of barley breeding for environmental adaptation to the new continents