Praveen Kumar

Praveen Kumar
AstraZeneca | AZ · Discovery Sciences

Doctor of Philosophy

About

54
Publications
7,652
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,638
Citations
Research Experience
August 2020 - present
AstraZeneca
Position
  • Senior Bioinformatician - Proteomics
December 2019 - August 2020
University of Minnesota Twin Cities
Position
  • Post-Doctoral Associate
Description
  • Tim Griffin's Lab; Galaxy-P Team
September 2016 - November 2019
University of Minnesota
Position
  • Research Assistant
Description
  • Tim Griffin's Lab; Galaxy-P Team
Education
September 2013 - November 2019
University of Minnesota Twin Cities
Field of study
  • Bioinformatics and Computational Biology
September 2013 - August 2016
University of Minnesota Twin Cities
Field of study
  • Bioinformatics and Computational Biology
July 2007 - May 2009
Pondicherry University
Field of study
  • Bioinformatics

Publications

Publications (54)
Article
The Earth Microbiome Project (EMP) aided in understanding the role of microbial communities and the influence of collective genetic material (the ‘microbiome’) and microbial diversity patterns across the habitats of our planet. With the evolution of new sequencing technologies, researchers can now investigate the microbiome and map its influence on...
Article
metaQuantome is a software suite that enables the quantitative analysis, statistical evaluation. and visualization of mass-spectrometry-based metaproteomics data. In the latest update of this software, we have provided several extensions, including a step-by-step training guide, the ability to perform statistical analysis on samples from multiple c...
Article
The Human Microbiome Project (HMP) aided in understanding the role of microbial communities and the influence of collective genetic material (the ‘microbiome’) in human health and disease. With the evolution of new sequencing technologies, researchers can now investigate the microbiome and map its influence on human health. Advances in bioinformati...
Article
Full-text available
jats:p>To gain a thorough appreciation of microbiome dynamics, researchers characterize the functional relevance of expressed microbial genes or proteins. This can be accomplished through metaproteomics, which characterizes the protein expression of microbiomes. Several software tools exist for analyzing microbiomes at the functional level by measu...
Article
Full-text available
For mass spectrometry-based peptide and protein quantification, label-free quantification (LFQ) based on precursor mass peak (MS1) intensities is considered reliable due to its dynamic range, reproducibility, and accuracy. LFQ enables peptide-level quantitation, which is useful in proteomics (analyzing peptides carrying post-translational modificat...
Article
Multi-omics approaches focused on mass-spectrometry (MS)-based data, such as metaproteomics, utilize genomic and/or transcriptomic sequencing data to generate a comprehensive protein sequence database. These databases can be very large, containing millions of sequences, which reduces the sensitivity of matching tandem mass spectrometry (MS/MS) data...
Preprint
Full-text available
For mass spectrometry-based peptide and protein quantification, label-free quantification (LFQ) based on precursor mass peak (MS1) intensities is considered reliable due to its dynamic range, reproducibility, and accuracy. In LFQ workflows, protein abundance changes are inferred from peptide-level information, including microbial peptides (for meta...
Article
Full-text available
Background Proteogenomics integrates genomics, transcriptomics, and mass spectrometry (MS)-based proteomics data to identify novel protein sequences arising from gene and transcript sequence variants. Proteogenomic data analysis requires integration of disparate ‘omic software tools, as well as customized tools to view and interpret results. The fl...
Preprint
Full-text available
To gain a thorough appreciation of microbiome dynamics, researchers characterize the functional role of expressed microbial genes/proteins. This can be accomplished through metaproteomics, which characterizes the protein complement of the microbiome. Several software tools exist for analyzing microbiomes at the functional level by measuring their c...
Article
Workflows for large-scale (MS)-based shotgun proteomics can potentially lead to costly errors in the form of incorrect peptide spectrum matches (PSMs). To improve robustness of these workflows, we have investigated the use of the precursor mass discrepancy (PMD) to detect and filter potentially false PSMs that have, nonetheless, a high confidence s...
Preprint
Background: Proteogenomics integrates genomics, transcriptomics and mass spectrometry (MS)-based proteomics data to identify novel protein sequences arising from gene and transcript sequence variants. Proteogenomic data analysis requires integration of disparate omic software tools, as well as customized tools to view and interpret results. The fle...
Preprint
Full-text available
Multi-omics approaches focused on mass-spectrometry (MS)-based data, such as metaproteomics, utilize genomic and/or transcriptomic sequencing data to generate a comprehensive protein sequence database. These databases can be very large, containing millions of sequences, which reduces the sensitivity of matching tandem mass spectrometry (MS/MS) data...
Preprint
Full-text available
Workflows for large-scale (MS)-based shotgun proteomics can potentially lead to costly errors in the form of incorrect peptide spectrum matches (PSMs). To improve robustness of these workflows, we have investigated the use of the precursor mass discrepancy (PMD) to detect and filter potentially false PSMs that have, nonetheless, a high confidence s...
Article
Full-text available
Microbiome research offers promising insights into the impact of microorganisms on biological systems. Metaproteomics, the study of microbial proteins at the community level, integrates genomic, transcriptomic, and proteomic data to determine the taxonomic and functional state of a microbiome. However, standard metaproteomics software is subject to...
Poster
Full-text available
The effect of microbiota on human health, disease and environment has been demonstrated through metagenomics and metaproteomics research. Metaproteomics is capable of analyzing the proteins expressed by microorganisms and provides information regarding the functions of the individual community members. While it is important to identify proteins, fu...
Chapter
Affinity proteomics (AP-MS) is growing in importance for characterizing protein-protein interactions (PPIs) in the form of protein complexes and signaling networks. The AP-MS approach necessitates several different software tools, integrated into reproducible and accessible workflows. However, if the scientist (e.g., a bench biologist) lacks a comp...
Article
Full-text available
Galaxy provides an accessible platform where multi-step data analysis workflows integrating disparate software can be run, even by researchers with limited programming expertise. Applications of such sophisticated workflows are many, including those which integrate software from different ‘omic domains (e.g. genomics, proteomics, metabolomics). In...
Article
Next-generation sequencing technologies, coupled with advances in mass spectrometry-based proteomics, have facilitated system-wide quantitative profiling of expressed mRNA transcripts and proteins. Proteo-transcriptomic analysis compares the relative abundance levels of transcripts and their corresponding proteins, illuminating discordant gene prod...
Article
Full-text available
Galaxy provides an accessible platform where multi-step data analysis workflows integrating disparate software can be run, even by researchers with limited programming expertise. Applications of such sophisticated workflows are many, including those which integrate software from different ‘omic domains (e.g. genomics, proteomics, metabolomics). In...
Article
The chromosome-centric human proteome project (C-HPP) seeks to comprehensively characterize all protein products coded by the genome, including those expressed sequence variants confirmed via proteogenomics methods. The closely related biology and disease human proteome project (B/D-HPP) seeks to understand the biological and pathological associati...
Poster
Full-text available
Protein/peptide-level quantification (either labeled or label-free) is routinely used in shotgun proteomics data analysis for determining the abundance of proteins in a given sample. However, accurate, rapid and robust label-free quantification is still a major challenge in the field of quantitative proteomics. Label-free quantification (LFQ) based...
Poster
Full-text available
Mass Spectrometry (MS) based quantitative proteomics provides information regarding protein expression and abundance in a given sample. Protein / Peptide level quantitation (either labeled or label-free) is routinely used in analysis of shotgun proteomics data. For multi-omics studies such as proteogenomics and metaproteomics, peptide-detection and...
Article
Full-text available
The impact of microbial communities, also known as the microbiome, on human health and the environment is receiving increased attention. Studying translated gene products (proteins) and comparing metaproteomic profiles may elucidate how microbiomes respond to specific environmental stimuli, and interact with host organisms. Characterizing proteins...
Article
Full-text available
Proteogenomics has emerged as a valuable approach in cancer research, which integrates genomic and transcriptomic data with mass spectrometry-based proteomics data to directly identify expressed, variant protein sequences that may have functional roles in cancer. This approach is computationally intensive, requiring integration of disparate softwar...
Article
Full-text available
Cellular function and diversity are orchestrated by complex interactions of fundamental biomolecules including DNA, RNA and proteins. Technological advances in genomics, epigenomics, transcriptomics and proteomics have enabled massively parallel and unbiased measurements. Such high-throughput technologies have been extensively used to carry out bro...
Article
Pichia pastoris is a widely used eukaryotic host for production of recombinant proteins. We performed a proteogenomic analysis using high resolution Fourier transform mass spectrometry to characterize the proteome of the GS115 strain. Our analysis resulted in identification of 46889 unique peptides mapping to 3914 unique protein groups, which corre...
Article
Full-text available
Accurate annotation of protein-coding genes is one of the primary tasks upon completion of whole genome sequencing of any organism. In this study, we used an integrated transcriptomic and proteomic strategy to validate and improve the existing zebrafish genome annotation. We undertook high resolution mass spectrometry-based proteomic profiling of t...
Article
Full-text available
The PIK3CA gene is frequently mutated in human cancers. Here we carry out a SILAC-based quantitative phosphoproteomic analysis using isogenic knockin cell lines containing 'driver' oncogenic mutations of PIK3CA to dissect the signalling mechanisms responsible for oncogenic phenotypes induced by mutant PIK3CA. From 8,075 unique phosphopeptides ident...
Article
Full-text available
miRNAs regulate gene expression by binding to cognate mRNAs causing mRNA degradation or translational repression. Mass spectrometry-based proteomic analysis is being widely used to identify miRNA targets. The miR-200b miRNA cluster is often overexpressed in multiple cancer types, but the identity of the targets remains elusive. Using SILAC-based an...
Article
Full-text available
Abstract Anopheles gambiae has a well-adapted system for host localization, feeding, and mating behavior, which are all governed by neuronal processes in the brain. However, there are no published reports characterizing the brain proteome to elucidate neuronal signaling mechanisms in the vector. To this end, a large-scale mapping of the brain prote...
Article
Full-text available
Abstract Among the neglected tropical diseases, leishmaniasis is one of the most devastating, resulting in significant mortality and contributing to nearly 2 million disability-adjusted life years. Cutaneous leishmaniasis is a debilitating disorder caused by the kinetoplastid protozoan parasite Leishmania major, which results in disfiguration and s...
Article
Full-text available
The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here we present a draft map of the human proteome using high-resolution Fourier-transform mass spectrometry. In-depth proteomic pr...
Article
Full-text available
Cryptococcus neoformans, a basidiomycetous fungus of universal occurrence, is a significant opportunistic human pathogen causing meningitis. Owing to an increase in the number of immunosuppressed individuals along with emergence of drug-resistant strains, C. neoformans is gaining importance as a pathogen. Although, whole genome sequencing of three...
Article
Gastric cancer is a commonly occurring cancer in Asia and one of the leading causes of cancer deaths. However, there is no reliable blood-based screening test for this cancer. Identifying proteins secreted from tumor cells could lead to the discovery of clinically useful biomarkers for early detection of gastric cancer. A SILAC-based quantitative p...
Article
Unlabelled: The kinetoplastid protozoan parasite, Leishmania donovani, is the causative agent of kala azar or visceral leishmaniasis. Kala azar is a severe form of leishmaniasis that is fatal in the majority of untreated cases. Studies on proteomic analysis of L. donovani thus far have been carried out using homology-based identification based on...
Article
Full-text available
Introduction Rabies is a fatal acute viral disease of the central nervous system, which is a serious public health problem in Asian and African countries. Based on the clinical presentation, rabies can be classified into encephalitic (furious) or paralytic (numb) rabies. Early diagnosis of this disease is particularly important as rabies is invaria...
Data
Differentially expressed proteins identified in rabies. Description of data: List of 94 proteins that are differentially expressed in rabies compared to normal brain tissues.
Data
Functional analysis of the proteins by ingenuity pathway analysis. Description of data: Additional file 3: Table S3 provides functional annotations of identified proteins as well as p-value with FDR correction.
Data
Combined list of peptides for proteins identified using Spectrum Mill and Mascot. Description of data: A list of 402 peptides and their fold changes compared to normal are summarized in Additional file 2: Table S2.
Conference Paper
Full-text available
MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression and protein synthesis. To characterize functions of miRNAs and to assess their potential applications, we carried out an integrated multi-omics analysis to study miR-145, a miRNA that has been shown to suppress tumor growth. We employed gene expression profiling, miRNA profi...
Article
Full-text available
Introduction Tuberculous meningitis is a frequent extrapulmonary disease caused by Mycobacterium tuberculosis and is associated with high mortality rates and severe neurological sequelae. In an earlier study employing DNA microarrays, we had identified genes that were differentially expressed at the transcript level in human brain tissue from cases...
Data
Table S2.A complete list of Proteins identified in TBM.
Data
Table S1.List of TBM and control samples used in the present study.
Data
Table S3.A complete list of peptides identified in TBM.
Article
PURPOSE: Gastric cancer is a commonly occurring cancer in Asia and one of the leading causes of cancer deaths. However, there is no reliable blood-based screening test for this cancer. Identifying proteins secreted from tumor cells could lead to the discovery of clinically useful biomarkers for early detection of gastric cancer. EXPERIMENTAL DESIGN...
Article
Mangifera indica (Mango) is an important fruit crop in tropical countries with India being the leading producer in the world. Substantial research efforts are being devoted to produce fruit that have desirable characteristics including those that pertain to taste, hardiness and resistance to pests. Characterization of the genome and proteome of man...
Article
Full-text available
The genome sequencing of H37Rv strain of Mycobacterium tuberculosis was completed in 1998 followed by the whole genome sequencing of a clinical isolate, CDC1551 in 2002. Since then, the genomic sequences of a number of other strains have become available making it one of the better studied pathogenic bacterial species at the genomic level. However,...
Article
Candida glabrata is a common opportunistic human pathogen leading to significant mortality in immunosuppressed and immunodeficient individuals. We carried out proteomic analysis of C. glabrata using high resolution Fourier transform mass spectrometry with MS resolution of 60,000 and MS/MS resolution of 7500. On the basis of 32,453 unique peptides i...
Article
Full-text available
The genome sequencing of H37Rv strain of Mycobacterium tuberculosis was completed in 1998 followed by the whole genome sequencing of a clinical isolate, CDC1551 in 2002. Since then, the genomic sequences of a number of other strains have become available making it one of the better studied pathogenic bacterial species at the genomic level. However,...
Article
Oligodendrocytes (OLs) are glial cells of the central nervous system, which produce myelin. Cultured OLs provide immense therapeutic opportunities for treating a variety of neurological conditions. One of the most promising sources for such therapies is human embryonic stem cells (ESCs) as well as providing a model to study human OL development. Fo...
Article
Full-text available
Esophageal squamous cell carcinoma (ESCC) is among the top ten most frequent malignancies worldwide. In this study, our objective was to identify potential biomarkers for ESCC through a quantitative proteomic approach using the isobaric tags for relative and absolute quantitation (iTRAQ) approach. We compared the protein expression profiles of ESCC...
Article
Full-text available
The study of the human urinary proteome has the potential to offer significant insights into normal physiology as well as disease pathology. The information obtained from such studies could be applied to the diagnosis of various diseases. The high sensitivity, resolution, and mass accuracy of the latest generation of mass spectrometers provides an...

Network

Cited By