Subina MehtaUniversity of Minnesota Twin Cities | UMN · Department of Biochemistry, Molecular Biology and Biophysics (CBS)
Subina Mehta
Master of Science
About
54
Publications
6,194
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
989
Citations
Introduction
Publications
Publications (54)
Metaproteomics is an increasingly popular methodology that provides information regarding the metabolic functions of specific microbial taxa and has potential for contributing to ocean ecology and biogeochemical studies. A blinded multi-laboratory intercomparison was conducted to assess comparability and reproducibility of taxonomic and functional...
Proteogenomics is a growing “multi-omics” research area that combines mass spectrometry–based proteomics and high-throughput nucleotide sequencing technologies. Proteogenomics has helped in genomic annotation for organisms whose complete genome sequences became available by using high-throughput DNA sequencing technologies. Apart from genome annota...
Background
Obstructive lung disease (OLD) is increasingly prevalent among persons living with HIV (PLWH). However, the role of proteases in HIV-associated OLD remains unclear.
Methods
We combined proteomics and peptidomics to comprehensively characterize protease activities. We combined mass spectrometry (MS) analysis on bronchoalveolar lavage flu...
The upper respiratory tract (URT) is home to a diverse range of microbial species. Respiratory infections disturb the microbial flora in the URT, putting people at risk of secondary infections. The potential dangers and clinical effects of bacterial and fungal coinfections with SARS-CoV-2 support the need to investigate the microbiome of the URT us...
Airway microbiota are known to contribute to lung diseases, such as cystic fibrosis (CF), but their contributions to pathogenesis are still unclear. To improve our understanding of host-microbe interactions, we have developed an integrated analytical and bioinformatic mass spectrometry (MS)-based metaproteomics workflow to analyze clinical bronchoa...
Clinical metaproteomics has the potential to offer insights into the host-microbiome interactions underlying diseases. However, the field faces challenges in characterizing microbial proteins found in clinical samples, usually present at low abundance relative to the host proteins. As a solution, we have developed an integrated workflow coupling ma...
Background: Obstructive lung disease (OLD) is increasingly prevalent among persons living with HIV (PLWH). However, the role of proteases in HIV-associated OLD remains unclear.
Methods: We combined proteomics and peptidomics to comprehensively characterize protease activities. We combined mass spectrometry (MS) analysis on bronchoalveolar lavage fl...
Metaproteomics is an increasingly popular methodology that provides information regarding the metabolic functions of specific microbial taxa and has potential for contributing to ocean ecology and biogeochemical studies. A blinded multi-laboratory intercomparison was conducted to assess comparability and reproducibility of taxonomic and functional...
Clinical metaproteomics has the potential to offer insights into the host-microbiome interactions underlying diseases. However, the field faces challenges in characterizing microbial proteins found in clinical samples, which are usually present at low abundance relative to the host proteins. As a solution, we have developed an integrated workflow c...
Introduction:
Continuous advances in mass spectrometry (MS) technologies have enabled deeper and more reproducible proteome characterization and a better understanding of biological systems when integrated with other 'omics data. Bioinformatic resources meeting the analysis requirements of increasingly complex MS-based proteomic data and associate...
During the COVID-19 pandemic, impaired immunity and medical interventions resulted in cases of secondary infections. The clinical difficulties and dangers associated with secondary infections in patients necessitate the exploration of their microbiome. Metaproteomics is a powerful approach to study the taxonomic composition and functional status of...
Background
‘Omics methods have empowered scientists to tackle the complexity of microbial communities on a scale not attainable before. Individually, omics analyses can provide great insight; while combined as “meta-omics”, they enhance the understanding of which organisms occupy specific metabolic niches, how they interact, and how they utilize en...
Background:
Clinical bronchoalveolar lavage fluid (BALF) samples are rich in biomolecules, including proteins, and useful for molecular studies of lung health and disease. However, mass spectrometry (MS)-based proteomic analysis of BALF is challenged by the dynamic range of protein abundance, and potential for interfering contaminants. A robust, M...
Respiratory infections disrupt the microbiota in the upper respiratory tract (URT), putting patients at a risk for subsequent infections. During the pandemic, cases of COVID-19 were aggravated by secondary infections because of impaired immunity and medical interventions, which was clearly evident in the second wave of COVID-19 in India. The potent...
There is an ongoing explosion of scientific datasets being generated, brought on by recent technological advances in many areas of the natural sciences. As a result, the life sciences have become increasingly computational in nature, and bioinformatics has taken on a central role in research studies. However, basic computational skills, data analys...
Background
Clinical Bronchoalveolar lavage fluid (BALF) samples are rich in biomolecules, including proteins, and useful for molecular studies of lung health and disease. However, mass spectrometry (MS)-based proteomic analysis of BALF is challenged by the dynamic range of protein abundance, and potential for interfering contaminants. A robust, MS-...
The Coronavirus disease 2019 (COVID-19) pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) resulted in a major health crisis worldwide with its continuously emerging new strains, resulting in new viral variants that drive “waves” of infection. PCR or antigen detection assays have been routinely used to detect clinic...
There is an ongoing explosion of scientific datasets being generated, brought on by recent technological advances in many areas of the natural sciences. As a result, the life sciences have become increasingly computational in nature, and bioinformatics has taken on a central role in research studies. However, basic computational skills, data analys...
Intestinal proteases mediate digestion and immune signalling, while increased gut proteolytic activity disrupts the intestinal barrier and generates visceral hypersensitivity, which is common in irritable bowel syndrome (IBS). However, the mechanisms controlling protease function are unclear. Here we show that members of the gut microbiota suppress...
Galaxy is a mature, browser accessible workbench for scientific computing. It enables scientists to share, analyze and visualize their own data, with minimal technical impediments. A thriving global community continues to use, maintain and contribute to the project, with support from multiple national infrastructure providers that enable freely acc...
Chronic inflammation of the colon causes genomic and/or transcriptomic events, which can lead to expression of non-canonical protein sequences contributing to oncogenesis. To better understand these mechanisms, Rag2−/−Il10−/− mice were infected with Helicobacter hepaticus to induce chronic inflammation of the cecum and the colon. Transcriptomic dat...
Background
The Coronavirus Disease 2019 (COVID-19) global pandemic has had a profound, lasting impact on the world's population. A key aspect to providing care for those with COVID-19 and checking its further spread is early and accurate diagnosis of infection, which has been generally done via methods for amplifying and detecting viral RNA molecul...
The Earth Microbiome Project (EMP) aided in understanding the role of microbial communities and the influence of collective genetic material (the ‘microbiome’) and microbial diversity patterns across the habitats of our planet. With the evolution of new sequencing technologies, researchers can now investigate the microbiome and map its influence on...
metaQuantome is a software suite that enables the quantitative analysis, statistical evaluation. and visualization of mass-spectrometry-based metaproteomics data. In the latest update of this software, we have provided several extensions, including a step-by-step training guide, the ability to perform statistical analysis on samples from multiple c...
The Coronavirus Disease 2019 (COVID19) global pandemic has had a profound, lasting impact on the world's population. A key aspect to providing care for those with COVID19 and checking its further spread is early and accurate diagnosis of infection, which has been generally done via methods for amplifying and detecting viral RNA molecules. Detection...
The Human Microbiome Project (HMP) aided in understanding the role of microbial communities and the influence of collective genetic material (the ‘microbiome’) in human health and disease. With the evolution of new sequencing technologies, researchers can now investigate the microbiome and map its influence on human health. Advances in bioinformati...
In this Letter, we reanalyze published mass spectrometry data sets of clinical samples with a focus on determining the coinfection status of individuals infected with SARS-CoV-2 coronavirus. We demonstrate the use of ComPIL 2.0 software along with a metaproteomics workflow within the Galaxy platform to detect cohabitating potential pathogens in COV...
To gain a thorough appreciation of microbiome dynamics, researchers characterize the functional relevance of expressed microbial genes or proteins. This can be accomplished through metaproteomics, which characterizes the protein expression of microbiomes. Several software tools exist for analyzing microbiomes at the functional level by measuring th...
For mass spectrometry-based peptide and protein quantification, label-free quantification (LFQ) based on precursor mass peak (MS1) intensities is considered reliable due to its dynamic range, reproducibility, and accuracy. LFQ enables peptide-level quantitation, which is useful in proteomics (analyzing peptides carrying post-translational modificat...
Multi-omics approaches focused on mass-spectrometry (MS)-based data, such as metaproteomics, utilize genomic and/or transcriptomic sequencing data to generate a comprehensive protein sequence database. These databases can be very large, containing millions of sequences, which reduces the sensitivity of matching tandem mass spectrometry (MS/MS) data...
For mass spectrometry-based peptide and protein quantification, label-free quantification (LFQ) based on precursor mass peak (MS1) intensities is considered reliable due to its dynamic range, reproducibility, and accuracy. In LFQ workflows, protein abundance changes are inferred from peptide-level information, including microbial peptides (for meta...
Background
Proteogenomics integrates genomics, transcriptomics, and mass spectrometry (MS)-based proteomics data to identify novel protein sequences arising from gene and transcript sequence variants. Proteogenomic data analysis requires integration of disparate ‘omic software tools, as well as customized tools to view and interpret results. The fl...
To gain a thorough appreciation of microbiome dynamics, researchers characterize the functional role of expressed microbial genes/proteins. This can be accomplished through metaproteomics, which characterizes the protein complement of the microbiome. Several software tools exist for analyzing microbiomes at the functional level by measuring their c...
Workflows for large-scale (MS)-based shotgun proteomics can potentially lead to costly errors in the form of incorrect peptide spectrum matches (PSMs). To improve robustness of these workflows, we have investigated the use of the precursor mass discrepancy (PMD) to detect and filter potentially false PSMs that have, nonetheless, a high confidence s...
Background:
Proteogenomics integrates genomics, transcriptomics and mass spectrometry (MS)-based proteomics data to identify novel protein sequences arising from gene and transcript sequence variants. Proteogenomic data analysis requires integration of disparate omic software tools, as well as customized tools to view and interpret results. The fle...
Multi-omics approaches focused on mass-spectrometry (MS)-based data, such as metaproteomics, utilize genomic and/or transcriptomic sequencing data to generate a comprehensive protein sequence database. These databases can be very large, containing millions of sequences, which reduces the sensitivity of matching tandem mass spectrometry (MS/MS) data...
Workflows for large-scale (MS)-based shotgun proteomics can potentially lead to costly errors in the form of incorrect peptide spectrum matches (PSMs). To improve robustness of these workflows, we have investigated the use of the precursor mass discrepancy (PMD) to detect and filter potentially false PSMs that have, nonetheless, a high confidence s...
Microbiome research offers promising insights into the impact of microorganisms on biological systems. Metaproteomics, the study of microbial proteins at the community level, integrates genomic, transcriptomic, and proteomic data to determine the taxonomic and functional state of a microbiome. However, standard metaproteomics software is subject to...
The effect of microbiota on human health, disease and environment has been demonstrated through metagenomics and metaproteomics research. Metaproteomics is capable of analyzing the proteins expressed by microorganisms and provides information regarding the functions of the individual community members. While it is important to identify proteins, fu...
Affinity proteomics (AP-MS) is growing in importance for characterizing protein-protein interactions (PPIs) in the form of protein complexes and signaling networks. The AP-MS approach necessitates several different software tools, integrated into reproducible and accessible workflows. However, if the scientist (e.g., a bench biologist) lacks a comp...
Galaxy provides an accessible platform where multi-step data analysis workflows integrating disparate software can be run, even by researchers with limited programming expertise. Applications of such sophisticated workflows are many, including those which integrate software from different ‘omic domains (e.g. genomics, proteomics, metabolomics). In...
Next-generation sequencing technologies, coupled with advances in mass spectrometry-based proteomics, have facilitated system-wide quantitative profiling of expressed mRNA transcripts and proteins. Proteo-transcriptomic analysis compares the relative abundance levels of transcripts and their corresponding proteins, illuminating discordant gene prod...
moFF is a modular and operating system independent tool for quantitative analysis of label-free mass-spectrometry based proteomics data. The moFF workflow, comprising matching-between-runs and apex quantification, can be applied to any upstream search engine’s output, along with the corresponding Thermo or mzML raw file. We here present moFF 2.0, w...
Galaxy provides an accessible platform where multi-step data analysis workflows integrating disparate software can be run, even by researchers with limited programming expertise. Applications of such sophisticated workflows are many, including those which integrate software from different ‘omic domains (e.g. genomics, proteomics, metabolomics). In...
The chromosome-centric human proteome project (C-HPP) seeks to comprehensively characterize all protein products coded by the genome, including those expressed sequence variants confirmed via proteogenomics methods. The closely related biology and disease human proteome project (B/D-HPP) seeks to understand the biological and pathological associati...
Protein/peptide-level quantification (either labeled or label-free) is routinely used in shotgun proteomics data analysis for determining the abundance of proteins in a given sample. However, accurate, rapid and robust label-free quantification is still a major challenge in the field of quantitative proteomics. Label-free quantification (LFQ) based...
Mass Spectrometry (MS) based quantitative proteomics provides information regarding protein expression and abundance in a given sample. Protein / Peptide level quantitation (either labeled or label-free) is routinely used in analysis of shotgun proteomics data. For multi-omics studies such as proteogenomics and metaproteomics, peptide-detection and...
The impact of microbial communities, also known as the microbiome, on human health and the environment is receiving increased attention. Studying translated gene products (proteins) and comparing metaproteomic profiles may elucidate how microbiomes respond to specific environmental stimuli, and interact with host organisms. Characterizing proteins...
Background: Noroviruses (NoV) from genogroup II, genotype 4 (GII.4) strains have historically caused the majority of NoV outbreaks worldwide and are associated with increased hospitalization and mortality rates relative to other NoV genotypes. The emergence of the NoV GII.4 Sydney 2012 variant strain, first identified in Sydney Australia in March 2...