Zhaohui S Qin

Zhaohui S Qin
Emory University | EU · Department of Biostatistics and Bioinformatics

PhD

About

296
Publications
28,562
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
17,331
Citations
Introduction
My research is in the areas of statistical genetics, genomics and bioinformatics. Motivated by scientific needs in these area and new technologies, I am undertaking both methodological and collaborative research. A major focus of my research is on developing and evaluating model-based methods to analyze genetics and genomics data, in particular, next generation sequencing data, and on developing software so that the methods can be easily adopted by the research community.
Additional affiliations
August 2010 - present
Emory University
August 2003 - July 2010
University of Michigan
Position
  • Professor (Assistant)
January 2003 - December 2010
University of Michigan
Education
September 1995 - June 2000
University of Michigan
Field of study
  • Statistics
March 1995 - June 1995
Utah State University
Field of study
  • Statistics
September 1990 - June 1994
Peking University
Field of study
  • Probability and Statistics

Publications

Publications (296)
Article
Full-text available
Background Targeting mitochondrial oncoproteins presents a new concept in the development of effective cancer therapeutics. ATAD3A is a nuclear-encoded mitochondrial enzyme contributing to mitochondrial dynamics, cholesterol metabolism, and signal transduction. However, its impact and underlying regulatory mechanisms in cancers remain ill-defined....
Article
The rising global HIV-1 burden urgently requires vaccines capable of providing heterologous protection. Here, we developed a clade C HIV-1 vaccine consisting of priming with modified vaccinia Ankara (MVA) and boosting with cyclically permuted trimeric gp120 (CycP-gp120) protein, delivered either orally using a needle-free injector or through parent...
Poster
Targeting mitochondria as a cancer therapeutic strategy has attracted more and more attention in the recent years. ATAD3A is identified as a nuclear-encoded mitochondrial enzyme that regulates mitochondrial dynamics, cholesterol metabolism, and signal transduction. However, its impact and underlying regulatory mechanisms in cancers remain to be det...
Preprint
A major question in systems biology is how to identify the core gene regulatory circuit that governs the decision-making of a biological process. Here, we develop a computational platform for constructing core transcription-factor regulatory networks, named NetAct, using both transcriptomics data and literature-based transcription factor-target dat...
Article
Full-text available
Transmitted/founder (T/F) HIV-1 envelope proteins (Envs) from infected individuals that developed neutralization breadth are likely to possess inherent features desirable for vaccine immunogen design. To explore this premise, we conducted an immunization study in rhesus macaques (RM) using T/F Env sequences from two human subjects, one of which dev...
Article
Full-text available
A/B compartments are observed in Hi-C data and coincide with eu/hetero-chromatin. However, many genomic regions are ambiguous under A/B compartment scheme. We develop MOSAIC (MOdularity and Singular vAlue decomposition-based Identification of Compartments), an accurate compartmental state detection scheme. MOSAIC reveals that those ambiguous region...
Article
Given most tissues are consist of abundant and diverse (sub-)cell types, an important yet unaddressed problem in bulk RNA-seq analysis is to identify at which (sub-)cell type(s) the differential expression occurs. Single-cell RNA-sequencing (scRNA-seq) technologies can answer the question, but they are often labor-intensive and cost-prohibitive. He...
Article
Lung cancer remains the leading cause of cancer deaths worldwide despite advances in knowledge in cancer biology and options of various targeted therapies. Efforts in identifying innovative and effective therapies are still highly appreciated. Targeting bromodomain and extra terminal (BET) proteins that function as epigenetic readers and master tra...
Article
Full-text available
Introduction Integrating brain imaging with large scale omics data may identify novel mechanisms of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). We integrated and analyzed brain magnetic resonance imaging (MRI) with cerebrospinal fluid (CSF) metabolomics to elucidate metabolic mechanisms and create a “metabolic map” of the br...
Article
Full-text available
Cancer genomic, transcriptomic, and proteomic profiling has generated extensive data that necessitate the development of tools for its analysis and dissemination. We developed UALCAN to provide a portal for easy exploring, analyzing, and visualizing these data, allowing users to integrate the data to better understand the gene, proteins, and pathwa...
Article
Full-text available
Papillary renal cell carcinoma (pRCC) is the most heterogenous renal cell carcinoma. Patient survival varies and no effective therapies for advanced pRCC exist. Histological and molecular characterization studies have highlighted the heterogeneity of pRCC tumours. Recent studies identified the proximal tubule (PT) cell as a cell-of-origin for pRCC....
Article
Full-text available
The collection of expression quantitative trait loci (eQTLs) is an important resource to study complex traits through understanding where and how transcriptional regulations are controlled by genetic variations in the non-coding regions of the genome. Previous studies have focused on associating eQTLs with traits to identify the roles of trait-rela...
Article
Full-text available
Understanding the impact of non-coding sequence variants on complex diseases is an essential problem. We present a novel ensemble learning framework-CASAVA, to predict genomic loci in terms of disease category-specific risk. Using disease-associated variants identified by GWAS as training data, and diverse sequencing-based genomics and epigenomics...
Preprint
Chromosome conformation capture-based experiments have shown that eukaryotic chromosomes are partitioned into A and B compartments conventionally identified by the first eigenvector (EV1) of dimension reduction methods. However, many genomic regions show marginal EV1 values, indicating the ambiguity of A/B compartment scheme on these regions. We de...
Article
Full-text available
The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that they disrupted the canonical transcription factor...
Article
Background: Metabolic differences between human papillomavirus (HPV)-associated head and neck squamous cell carcinoma (HNSCC) and smoking-associated HNSCC may partially explain differences in prognosis. The former relies on mitochondrial oxidative phosphorylation (OXPHOS) while the latter relies on glycolysis. These differences have not been studi...
Preprint
The rapid proliferation of single-cell RNA-sequencing (scRNA-seq) datasets have revealed cell heterogeneity at unprecedented scales. Several deconvolution methods have been developed to decompose bulk experiments to reveal cell type contributions. However, these methods lack power in identifying the accurate cell type composition when having a cons...
Article
Full-text available
Alzheimer’s disease (AD) is influenced by both genetic and environmental factors; thus, brain epigenomic alterations may provide insights into AD pathogenesis. Multiple array-based Epigenome-Wide Association Studies (EWASs) have identified robust brain methylation changes in AD; however, array-based assays only test about 2% of all CpG sites in the...
Article
Full-text available
Identifying biomarkers to predict the clinical outcomes of individual patients is a fundamental problem in clinical oncology. Multiple single-gene biomarkers have already been identified and used in clinics. However, multiple oncogenes or tumor-suppressor genes are involved during the process of tumorigenesis. Additionally, the efficacy of single-g...
Article
Full-text available
Elucidating transcription mediated by the glucocorticoid receptor (GR) is crucial for understanding the role of glucocorticoids (GCs) in the treatment of diseases. Podocyte is a useful model for studying GR regulation because GCs are the primary medication for podocytopathy. In this study, we integrated data from transcriptome, transcription factor...
Article
Background Transcriptional programs control cell fate, and identifying their components is critical for understanding diseases caused by cell lesion, such as podocytopathy. Although many transcription factors (TFs) are necessary for cell-state maintenance in glomeruli, their roles in transcriptional regulation are not well understood. Methods The...
Article
Full-text available
Background Cell type-specific transcriptional programming results from the combinatorial interplay between the repertoire of active regulatory elements. Disease-associated variants disrupt such programming, leading to altered expression of downstream regulated genes and the onset of pathological states. However, due to the non-linear regulatory pro...
Article
Full-text available
Adolescent exposure to chronic stress, a risk factor for mood disorders in adulthood, sensitizes the neuroinflammatory response to a subsequent immune challenge. We previously showed that chronic adolescent stress (CAS) in rats led to distinct patterns of neuroimmune priming in adult male and female rats. However, sex differences in the neuroimmune...
Article
Full-text available
A Correction to this paper has been published: https://doi.org/10.1038/s41467-020-19873-9
Article
Full-text available
Human papilloma virus (HPV) causes a subset of head and neck squamous cell carcinomas (HNSCC) of the oropharynx. We combined targeted DNA-and genome-wide RNA-sequencing to identify genetic variants and gene expression signatures respectively from patients with HNSCC including oropharyngeal squamous cell carcinomas (OPSCC). DNA and RNA were purified...
Article
Full-text available
http://deepblue.lib.umich.edu/bitstream/2027.42/173434/1/12859_2020_Article_3580.pdf
Article
Full-text available
Polyglutamine expansion in proteins can cause selective neurodegeneration, although the mechanisms are not fully understood. In Huntington’s disease (HD), proteolytic processing generates toxic N-terminal huntingtin (HTT) fragments that preferentially kill striatal neurons. Here, using CRISPR/Cas9 to truncate full-length mutant HTT in HD140Q knock-...
Article
Full-text available
While our understanding of cellular and molecular processes has grown exponentially, issues related to the cell microenvironment and cellular heterogeneity have sparked a new debate concerning the cell identity. Cell composition (chromatin and nuclear architecture) poses a strong risk for dynamic changes in the diseased condition. Since chromatin a...
Article
Full-text available
Identification of repeat-associated non-AUG (RAN) translation in trinucleotide (CAG) repeat diseases has led to the emerging concept that CAG repeat diseases are caused by nonpolyglutamine products. Nonetheless, the in vivo contribution of RAN translation to the pathogenesis of CAG repeat diseases remains elusive. Via CRISPR/Cas9-mediated genome ed...
Article
Motivation: The availability of thousands of genome-wide ChIP-Seq datasets across hundreds of transcriptional factors (TFs) and cell lines provides an unprecedented opportunity to jointly analyze large-scale transcription factor binding in vivo, making possible the discovery of the potential interaction and cooperation among different TFs. The int...
Article
Full-text available
Identifying effective pharmacological treatments for heart failure (HF) patients remains critically important. Given that the development of drugs de novo is expensive and time consuming, drug repositioning has become an increasingly important branch. In the present study, we propose a two-step drug repositioning pipeline and investigate the novel...
Article
Motivation: Annotating a given genomic locus or a set of genomic loci is an important yet challenging task. This is especially true for the non-coding part of the genome which is enormous yet poorly understood. Since gene set enrichment analyses have demonstrated to be effective approach to annotate a set of genes, the same idea can be extended to...
Article
Full-text available
DNA molecules are highly compacted in the eukaryotic nucleus where distal regulatory elements reach their targets through three-dimensional chromosomal interactions. G-quadruplexes, stable four-stranded non-canonical DNA structures, can change local chromatin organization through the exclusion of nucleosomes. However, the relationship between G-qua...
Article
Full-text available
The temporal order of DNA replication is regulated during development and is highly correlated with gene expression, histone modifications and 3D genome architecture. We tracked changes in replication timing, gene expression, and chromatin conformation capture (Hi-C) A/B compartments over the first two cell cycles during differentiation of human em...
Article
Full-text available
Asthma is a complex disease with striking disparities across racial and ethnic groups. Despite its relatively high burden, representation of individuals of African ancestry in asthma genome-wide association studies (GWAS) has been inadequate, and true associations in these underrepresented minority groups have been inconclusive. We report the resul...
Article
Full-text available
Motivation: The replication timing (RT) program has been linked to many key biological processes including cell fate commitment, 3D chromatin organization and transcription regulation. Significant technology progress now allows to characterize the RT program in the entire human genome in a high-throughput and high-resolution fashion. These experim...
Article
There are significant correlations among different types of genetic, genomic and epigenomic features within the genome. These correlations make the in silico feature prediction possible through statistical or machine learning models. With the accumulation of a vast amount of high-throughput data, feature prediction has gained significant interest l...
Preprint
Full-text available
The replication timing (RT) program has been linked to many key biological processes including cell fate commitment, 3D chromatin organization and transcription regulation. Significant technology progress now allows to characterize the RT program in the entire human genome in a high-throughput and high-resolution fashion. These experiments suggest...
Article
The recently developed Hi-C technology enables a genome-wide view of chromosome spatial organizations, and has shed deep insights into genome structure and genome function. However, multiple sources of uncertainties make downstream data analysis and interpretation challenging. Specifically, statistical models for inferring three-dimensional (3D) ch...
Article
The EWS/ETS fusion transcription factors drive Ewing sarcoma (EWS) by orchestrating an oncogenic transcription program. Therapeutic targeting of EWS/ETS has been unsuccessful; however, identifying mediators of the EWS/ETS function could offer new therapeutic options. Here we describe the dependency of EWS/ETS-driven transcription upon chromatin rea...
Article
Full-text available
Background Sorted merging of genomic data is a common data operation necessary in many sequencing-based studies. It involves sorting and merging genomic data from different subjects by their genomic locations. In particular, merging a large number of Variant Call Format (VCF) files is frequently required in large scale whole genome sequencing or wh...