About
19
Publications
4,568
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,308
Citations
Publications
Publications (19)
Many fields, including Natural Language Processing (NLP), have recently witnessed the benefit of pre-training with large generic datasets to improve the accuracy of prediction tasks. However, there exist key differences between the longitudinal healthcare data (e.g., claims) and NLP tasks, which makes direct application of NLP pre-training methods...
The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) provides a unified model to integrate disparate real-world data (RWD) sources. An integral part of the OMOP CDM is the Standardized Vocabularies (henceforth referred to as the OMOP vocabulary), which enables organization and standardization of medical concepts across vari...
The availability of high-quality RNA-sequencing and genotyping data of post-mortem brain collections from consortia such as CommonMind Consortium (CMC) and the Accelerating Medicines Partnership for Alzheimer’s Disease (AMP-AD) Consortium enable the generation of a large-scale brain cis-eQTL meta-analysis. Here we generate cerebral cortical eQTL fr...
Systemic lupus erythematosus (SLE) is a chronic, remitting, and relapsing, inflammatory disease involving multiple organs, which exhibits abnormalities of both the innate and adaptive immune responses. A limited number of transcriptomic studies have characterized the gene pathways involved in SLE in an attempt to identify the key pathogenic drivers...
2020, The Author(s). The availability of high-quality RNA-sequencing and genotyping data of post-mortem brain collections from consortia such as CommonMind Consortium (CMC) and the Accelerating Medicines Partnership for Alzheimer’s Disease (AMP-AD) Consortium enable the generation of a large-scale brain cis-eQTL meta-analysis. Here we generate cere...
Background:
Activation of microglia, the resident immune cells of the central nervous system, is a prominent pathological hallmark of Alzheimer's disease (AD). However, the gene expression changes underlying microglia activation in response to tau pathology remain elusive. Furthermore, it is not clear how murine gene expression changes relate to h...
Background
The Legumes (Fabaceae) are an economically and ecologically important group of plant species with the conspicuous capacity for symbiotic nitrogen fixation in root nodules, specialized plant organs containing symbiotic microbes. With the aim of understanding the underlying molecular mechanisms leading to nodulation, many efforts are under...
Background
Terminal repeat retrotransposons in miniature (TRIMs) are a unique group of small long terminal repeat retrotransposons that are difficult to identify. Thus far, only a few TRIMs have been characterized in the euphyllophytes, and their evolutionary and biological significance as well as their transposition mechanisms are poorly understoo...
Even though vast amounts of genome-wide gene expression data have become available in plants, it remains a challenge to effectively mine this information for the discovery of genes and gene networks, for instance those that control agronomically important traits. These networks reflect potential interactions among genes and, therefore, can lead to...
With the development of high-throughput genomic technologies, large, genome-wide datasets have been collected and integration of these datasets should provide large-scale, multi-dimensional and insightful views of biological systems. We developed a method for gene association network construction based on gene expression data that integrates a vari...
Terminal-repeat retrotransposons in miniature (TRIMs) are structurally similar to long terminal repeat (LTR) retrotransposons except that they are extremely small and difficult to identify. Thus far, only a few TRIMs have been characterized in the euphyllophytes and the evolutionary and biological impacts and transposition mechanism of TRIMs are po...
Legume plants regulate the number of nitrogen-fixing root nodules they form via a process called the Autoregulation of Nodulation (AON). Despite being one of the most economically important and abundantly consumed legumes, little is known about the AON pathway of common bean (Phaseolus vulgaris). We used comparative- and functional-genomic approach...
Key message:
The Co - x anthracnose R gene of common bean was fine-mapped into a 58 kb region at one end of chromosome 1, where no canonical NB-LRR-encoding genes are present in G19833 genome sequence. Anthracnose, caused by the phytopathogenic fungus Colletotrichum lindemuthianum, is one of the most damaging diseases of common bean, Phaseolus vul...
In higher eukaryotes, centromeres are typically composed of megabase-sized array of satellite repeats that evolve rapidly and homogenize within a species' genome. Despite the importance of centromeres, our knowledge is limited to a few model species. We conducted comprehensive analysis of common bean (Phaseolus vulgaris) centromeric satellite DNA u...
A number of next-generation sequencing (NGS) technologies such as Roche/454, Illumina and AB SOLiD have recently become available. These technologies are capable of generating hundreds of thousands or tens of millions of short DNA sequence reads at a relatively low cost. These NGS technologies, now referred as second-generation sequencing (SGS) tec...
Legumes and many nonleguminous plants enter symbiotic interactions with microbes, and it is poorly understood how host plants respond to promote beneficial, symbiotic microbial interactions while suppressing those that are deleterious or pathogenic. Trans-acting siRNAs (tasiRNAs) negatively regulate target transcripts and are characterized by siRNA...
Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sanger-based bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds repre...
Anthracnose, caused by the phytopathogenic fungus Colletotrichum lindemuthianum, is one of the most important diseases of common bean, Phaseolus vulgaris. Various specific resistance (R) genes, named Co-, and conferring race-specific resistance to different strains of C. lindemuthianum have been identified. The Andean cultivar JaloEEP558 was report...
Common bean (Phaseolus vulgaris) is an agronomically important legume especially in developing countries. Common bean is very diverse, consisting of two major geographically distinct gene pools, Andean and Mesoamerican, each with three or four races. Currently, the genome sequence of Andean domesticated accession G19833 is in progress, thus enablin...