Aziz Khan

Aziz Khan
Stanford University | SU · Stanford Cancer Institute

PhD (Bioinformatics/Computational Biology)

About

46
Publications
11,580
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
4,644
Citations
Citations since 2016
44 Research Items
4622 Citations
201620172018201920202021202202004006008001,0001,200
201620172018201920202021202202004006008001,0001,200
201620172018201920202021202202004006008001,0001,200
201620172018201920202021202202004006008001,0001,200

Publications

Publications (46)
Article
Full-text available
JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) and TF flexible models (TFFMs) for TFs across multiple species in six taxonomic groups. In the 2018 release of JASPAR, the CORE collection has been expanded with 322 new PFMs...
Preprint
Full-text available
Background: Super-enhancers are clusters of transcriptional enhancers densely occupied by the Mediators, transcription factors and chromatin regulators. They control the expression of cell identity genes and disease associated genes. Current studies demonstrated the possibility of multiple factors with important roles in super-enhancer formation; h...
Article
Full-text available
Super-enhancers and stretch enhancers represent classes of transcriptional enhancers that have been shown to control the expression of cell identity genes and carry disease- and trait-associated variants. Specifically, super-enhancers are clusters of enhancers defined based on the binding occupancy of master transcription factors, chromatin regulat...
Article
Ductal carcinoma in situ (DCIS) is the most common precursor of invasive breast cancer (IBC), with variable propensity for progression. We perform multiscale, integrated molecular profiling of DCIS with clinical outcomes by analyzing 774 DCIS samples from 542 patients with 7.3 years median follow-up from the Translational Breast Cancer Research Con...
Preprint
Full-text available
Aberrant shifts in DNA methylation have long been regarded as an early marker for cancer onset and progression. To chart DNA methylation changes that occur during the transformation from normal healthy colon tissue to malignant colorectal cancer (CRC), we collected over 50 samples from 15 familial adenomatous polyposis (FAP) and non-FAP colorectal...
Preprint
The earliest events during human tumor initiation are poorly characterized but may hold clues as to how to detect and prevent malignancy. Here we model this occult process by engineering TP53 deficiency in primary human gastric organoids and performing experimental evolution in multiple clonally derived cultures over two years, thereby defining cau...
Article
Full-text available
The imminent release of tissue atlases combining multichannel microscopy with single-cell sequencing and other omics data from normal and diseased specimens creates an urgent need for data and metadata standards to guide data deposition, curation and release. We describe a Minimum Information about Highly Multiplexed Tissue Imaging (MITI) standard...
Article
Background. DCIS consists of a molecularly heterogeneous group of premalignant lesions, with variable risk of invasive progression. Understanding biomarkers for invasive progression could help individualize treatment recommendations based upon tumor biology. As part of the NCI Human Tumor Atlas Network (HTAN), we conducted comprehensive genomic ana...
Article
Full-text available
JASPAR (http://jaspar.genereg.net/) is an open-access database containing manually curated, non-redundant transcription factor (TF) binding profiles for TFs across six taxonomic groups. In this 9th release, we expanded the CORE collection with 341 new profiles (148 for plants, 101 for vertebrates, 85 for urochordates, and 7 for insects), which corr...
Article
Full-text available
Background Transcription factors (TFs) bind specifically to TF binding sites (TFBSs) at cis-regulatory regions to control transcription. It is critical to locate these TF-DNA interactions to understand transcriptional regulation. Efforts to predict bona fide TFBSs benefit from the availability of experimental data mapping DNA binding regions of TFs...
Preprint
Full-text available
Familial adenomatous polyposis (FAP) is a genetic disease causing hundreds of premalignant polyps in affected patients, leading to colorectal cancer (CRC), and is an ideal model to study early transition to CRC. We performed deep multi-omic profiling of 135 normal mucosal, benign and dysplastic polyps and adenocarcinoma samples from 6 FAP patients...
Article
Full-text available
Conferences are a pivotal part of the scientific enterprise, but large in-person meetings have several disadvantages. As the pandemic experience has shown, online meetings are a viable alternative. Accelerating efforts to improve conferences in virtual formats can lead to a more equitable and sustainable conference culture.
Article
Full-text available
Since the Black Lives Matter movement rose to mainstream prominence, the academic enterprise has started recognizing the systematic racism present in science. However, there have been relatively few efforts to make sure that the language used to communicate science is inclusive. Here, I quantify the number of research articles published between 200...
Preprint
Full-text available
Transcription factors (TFs) bind specifically to TF binding sites (TFBSs) at cis-regulatory regions to control transcription. Hence, it is critical to locate these TF-DNA interactions to understand transcriptional regulation. The availability of datasets generated by chromatin immunoprecipitation followed by sequencing (ChIP-seq) empowers our effor...
Article
Full-text available
Motivation: Accurate motif enrichment analyses depend on the choice of background DNA sequences used, which should ideally match the sequence composition of the foreground sequences. It is important to avoid false positive enrichment due to sequence biases in the genome, such as GC-bias. Therefore, relying on an appropriate set of background seque...
Preprint
Full-text available
Scientific conferences provide valuable opportunities for researchers across career stages and disciplines to present their latest work and to network with their peers. The advent of the internet has opened new possibilities for interaction, collaboration and networking, yet the uptake of tools enabling remote participation at scientific meetings h...
Article
Crucial transitions in cancer—including tumor initiation, local expansion, metastasis, and therapeutic resistance—involve complex interactions between cells within the dynamic tumor ecosystem. Transformative single-cell genomics technologies and spatial multiplex in situ methods now provide an opportunity to interrogate this complexity at unprecede...
Article
Full-text available
JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) for TFs across multiple species in six taxonomic groups. In this 8th release of JASPAR, the CORE collection has been expanded with 245 new PFMs (169 for vertebrates, 42 for p...
Article
RNA-binding proteins (RBPs) regulate post-transcriptional gene expression by recognizing short and degenerate sequence motifs in their target transcripts, but precisely defining their binding specificity remains challenging. Crosslinking and immunoprecipitation (CLIP) allows for mapping of the exact protein-RNA crosslink sites, which frequently res...
Article
Full-text available
O-GlcNAc transferase (OGT) is overexpressed in aggressive prostate cancer. OGT modifies intra-cellular proteins via single sugar conjugation (O-GlcNAcylation) to alter their activity. We recently discovered the first fast-acting OGT inhibitor OSMI-2. Here, we probe the stability and function of the chromatin O-GlcNAc and identify transcription fact...
Article
Full-text available
Super-enhancers (SEs) are clusters of transcriptional enhancers which control the expression of cell identity and disease-associated genes. Current studies demonstrated the role of multiple factors in SE formation; however, a systematic analysis to assess the relative predictive importance of chromatin and sequence features of SEs and their constit...
Preprint
Full-text available
RNA-binding proteins (RBPs) regulate post-transcriptional gene expression by recognizing short and degenerate sequence elements in their target transcripts. Despite the expanding list of RBPs with in vivo binding sites mapped genomewide using crosslinking and immunoprecipitation (CLIP), defining precise RBP binding specificity remains challenging....
Preprint
Full-text available
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most popular assay to identify genomic regions, called ChIP-seq peaks, that are bound in vivo by transcription factors (TFs). These regions are derived from direct TF-DNA interactions, indirect binding of the TF to the DNA (through a co-binding partner), nonspecific binding to t...
Article
Full-text available
GIVE is a framework and library for creating portable and personalized genome browsers. It makes visualizing genomic data as easy as building a laboratory homepage.
Preprint
Full-text available
Background Super-enhancers and stretch enhancers represent classes of transcriptional enhancers that have been shown to control the expression of cell identity genes and carry disease- and trait-associated variants. Specifically, super-enhancers are clusters of enhancers defined based on the binding occupancy of master transcription factors (TFs),...
Article
Scientists spend considerable efforts with reformatting their research articles before each new submission. A widely adopted format‐free submission process would save them valuable time to do research instead.
Article
JASPAR is a widely used open-access database of curated, non-redundant transcription factor binding profiles. Currently, data from JASPAR can be retrieved as flat files or by using programming language-specific interfaces. Here, we present a programming language-independent application programming interface (API) to access JASPAR data using the Rep...
Preprint
Full-text available
JASPAR is a widely used open-access database of curated, non-redundant transcription factor binding profiles. Currently, data from JASPAR can be retrieved as flat files or by using programming language-specific interfaces. Here, we present a programming language-independent application programming interface (API) to access JASPAR data using the Rep...
Article
Full-text available
The above paper was corrected to add a reference to its companion paper: Chéneby, J., Gheorghe, M.,Artufel,M.,Mathelier, A., Ballester, B. (2017) ReMap 2018: An updated atlas of regulatory regions from an integrative analysis of DNA-binding ChIP-seq experiments. Nucleic Acids Res., doi: 10.1093/nar/gkx1092.
Preprint
Full-text available
We present Bioconda (https://bioconda.github.io), a distribution of bioinformatics software for the lightweight, multi-platform and language-agnostic package manager Conda. Currently, Bioconda offers a collection of over 3000 software packages, which is continuously maintained, updated, and extended by a growing global community of more than 200 co...
Poster
Full-text available
A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. To address this gap, we have developed the Intervene to...
Article
Full-text available
Background A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. Results To address this gap, we have develo...
Preprint
Full-text available
Background A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. Results To address this gap, we have develo...
Article
Full-text available
Super-enhancers are clusters of transcriptional enhancers that drive cell-type-specific gene expression and are crucial to cell identity. Many disease-associated sequence variations are enriched in super-enhancer regions of disease-relevant cell types. Thus, super-enhancers can be used as potential biomarkers for disease diagnosis and therapeutics....
Poster
Full-text available
Enhancers are cis-regulatory elements of DNA that enhance the transcription of target genes and play a key role in development and diseases. Recently, a new class of enhancers named “super-enhancers” has been discovered, which refers to large clusters of transcriptional enhancers that can drive cell-type-specific gene expression and are crucial in...
Article
Full-text available
This study investigates a new method for adaptive watermark strength op-timization in Discrete Cosine Transform (DCT) domain. Watermarking is a method to embed secret information in the host medium. Optimal strength selection for watermark is the most critical aspect in the whole process and it had attracted many researchers in the recent past. In...

Network

Cited By

Projects

Project (1)
Archived project