Yupeng Li

Yupeng Li
Independent Researcher

About

19
Publications
4,568
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,308
Citations

Publications

Publications (19)
Article
Full-text available
Many fields, including Natural Language Processing (NLP), have recently witnessed the benefit of pre-training with large generic datasets to improve the accuracy of prediction tasks. However, there exist key differences between the longitudinal healthcare data (e.g., claims) and NLP tasks, which makes direct application of NLP pre-training methods...
Article
Full-text available
The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) provides a unified model to integrate disparate real-world data (RWD) sources. An integral part of the OMOP CDM is the Standardized Vocabularies (henceforth referred to as the OMOP vocabulary), which enables organization and standardization of medical concepts across vari...
Article
Full-text available
The availability of high-quality RNA-sequencing and genotyping data of post-mortem brain collections from consortia such as CommonMind Consortium (CMC) and the Accelerating Medicines Partnership for Alzheimer’s Disease (AMP-AD) Consortium enable the generation of a large-scale brain cis-eQTL meta-analysis. Here we generate cerebral cortical eQTL fr...
Article
Full-text available
Systemic lupus erythematosus (SLE) is a chronic, remitting, and relapsing, inflammatory disease involving multiple organs, which exhibits abnormalities of both the innate and adaptive immune responses. A limited number of transcriptomic studies have characterized the gene pathways involved in SLE in an attempt to identify the key pathogenic drivers...
Article
2020, The Author(s). The availability of high-quality RNA-sequencing and genotyping data of post-mortem brain collections from consortia such as CommonMind Consortium (CMC) and the Accelerating Medicines Partnership for Alzheimer’s Disease (AMP-AD) Consortium enable the generation of a large-scale brain cis-eQTL meta-analysis. Here we generate cere...
Article
Full-text available
Background: Activation of microglia, the resident immune cells of the central nervous system, is a prominent pathological hallmark of Alzheimer's disease (AD). However, the gene expression changes underlying microglia activation in response to tau pathology remain elusive. Furthermore, it is not clear how murine gene expression changes relate to h...
Article
Full-text available
Background The Legumes (Fabaceae) are an economically and ecologically important group of plant species with the conspicuous capacity for symbiotic nitrogen fixation in root nodules, specialized plant organs containing symbiotic microbes. With the aim of understanding the underlying molecular mechanisms leading to nodulation, many efforts are under...
Article
Full-text available
Background Terminal repeat retrotransposons in miniature (TRIMs) are a unique group of small long terminal repeat retrotransposons that are difficult to identify. Thus far, only a few TRIMs have been characterized in the euphyllophytes, and their evolutionary and biological significance as well as their transposition mechanisms are poorly understoo...
Article
Even though vast amounts of genome-wide gene expression data have become available in plants, it remains a challenge to effectively mine this information for the discovery of genes and gene networks, for instance those that control agronomically important traits. These networks reflect potential interactions among genes and, therefore, can lead to...
Article
Full-text available
With the development of high-throughput genomic technologies, large, genome-wide datasets have been collected and integration of these datasets should provide large-scale, multi-dimensional and insightful views of biological systems. We developed a method for gene association network construction based on gene expression data that integrates a vari...
Preprint
Full-text available
Terminal-repeat retrotransposons in miniature (TRIMs) are structurally similar to long terminal repeat (LTR) retrotransposons except that they are extremely small and difficult to identify. Thus far, only a few TRIMs have been characterized in the euphyllophytes and the evolutionary and biological impacts and transposition mechanism of TRIMs are po...
Article
Legume plants regulate the number of nitrogen-fixing root nodules they form via a process called the Autoregulation of Nodulation (AON). Despite being one of the most economically important and abundantly consumed legumes, little is known about the AON pathway of common bean (Phaseolus vulgaris). We used comparative- and functional-genomic approach...
Article
Full-text available
Key message: The Co - x anthracnose R gene of common bean was fine-mapped into a 58 kb region at one end of chromosome 1, where no canonical NB-LRR-encoding genes are present in G19833 genome sequence. Anthracnose, caused by the phytopathogenic fungus Colletotrichum lindemuthianum, is one of the most damaging diseases of common bean, Phaseolus vul...
Article
In higher eukaryotes, centromeres are typically composed of megabase-sized array of satellite repeats that evolve rapidly and homogenize within a species' genome. Despite the importance of centromeres, our knowledge is limited to a few model species. We conducted comprehensive analysis of common bean (Phaseolus vulgaris) centromeric satellite DNA u...
Article
A number of next-generation sequencing (NGS) technologies such as Roche/454, Illumina and AB SOLiD have recently become available. These technologies are capable of generating hundreds of thousands or tens of millions of short DNA sequence reads at a relatively low cost. These NGS technologies, now referred as second-generation sequencing (SGS) tec...
Article
Full-text available
Legumes and many nonleguminous plants enter symbiotic interactions with microbes, and it is poorly understood how host plants respond to promote beneficial, symbiotic microbial interactions while suppressing those that are deleterious or pathogenic. Trans-acting siRNAs (tasiRNAs) negatively regulate target transcripts and are characterized by siRNA...
Article
Full-text available
Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sanger-based bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds repre...
Conference Paper
Anthracnose, caused by the phytopathogenic fungus Colletotrichum lindemuthianum, is one of the most important diseases of common bean, Phaseolus vulgaris. Various specific resistance (R) genes, named Co-, and conferring race-specific resistance to different strains of C. lindemuthianum have been identified. The Andean cultivar JaloEEP558 was report...
Conference Paper
Common bean (Phaseolus vulgaris) is an agronomically important legume especially in developing countries. Common bean is very diverse, consisting of two major geographically distinct gene pools, Andean and Mesoamerican, each with three or four races. Currently, the genome sequence of Andean domesticated accession G19833 is in progress, thus enablin...

Network

Cited By