Robi D Mitra

Washington University in St. Louis, Saint Louis, MO, USA

Are you Robi D Mitra?

Claim your profile

Publications (26)288.87 Total impact

  • Article: Ultralow protein adsorbing coatings from clickable PEG nanogel solutions: Benefits of attachment under salt-induced phase separation conditions and comparison with PEG/albumin nanogel coatings.
    [show abstract] [hide abstract]
    ABSTRACT: Clickable nanogel solutions were synthesized by using the copper catalyzed azide/alkyne cycloaddition (CuAAC) to partially polymerize solutions of azide and alkyne functionalized poly(ethylene glycol) (PEG) monomers. Coatings were fabricated using a second click reaction: a UV thiol-yne attachment of the nanogel solutions to mercaptosilanated glass. Because the CuAAC reaction was effectively halted by the addition of a copper-chelator, we were able to prevent bulk gelation and limit the coating thickness to a single monolayer of nanogels in the absence of the solution reaction. This enabled the inclusion of kosmotropic salts, which caused the PEG to phase-separate and nearly double the nanogel packing density, as confirmed by Quartz Crystal Microbalance with Dissipation (QCM-D). Protein adsorption was analyzed by single molecule counting with total internal reflection fluorescence (TIRF) microscopy and cell adhesion assays. Coatings formed from the phase-separated clickable nanogel solutions attached with salt adsorbed significantly less fibrinogen than other 100% PEG coatings tested, as well as poly-L-lysine-g-PEG (PLL-g-PEG) coatings. However, PEG/albumin nanogel coatings still outperformed the best 100% PEG clickable nanogel coatings. Additional surface crosslinking of the clickable nanogel coating in the presence of copper further reduced levels of fibrinogen adsorption closer to those of PEG/albumin nanogel coatings. However, this step negatively impacted long-term resistance to cell adhesion and dramatically altered the morphology of the coating by atomic force microscopy (AFM). The main benefit of the click strategy is that the partially polymerized solutions are stable almost indefinitely, allowing attachment in the phase-separated state without danger of bulk gelation, and thus, producing the best performing 100% PEG coating that we have studied to date.
    Langmuir 02/2013; · 4.19 Impact Factor
  • Article: Single ABCA3 Mutations Increase Risk for Neonatal Respiratory Distress Syndrome.
    [show abstract] [hide abstract]
    ABSTRACT: BACKGROUND AND OBJECTIVE:Neonatal respiratory distress syndrome (RDS) due to pulmonary surfactant deficiency is heritable, but common variants do not fully explain disease heritability.METHODS:Using next-generation, pooled sequencing of race-stratified DNA samples from infants ≥34 weeks' gestation with and without RDS (n = 513) and from a Missouri population-based cohort (n = 1066), we scanned all exons of 5 surfactant-associated genes and used in silico algorithms to identify functional mutations. We validated each mutation with an independent genotyping platform and compared race-stratified, collapsed frequencies of rare mutations by gene to investigate disease associations and estimate attributable risk.RESULTS:Single ABCA3 mutations were overrepresented among European-descent RDS infants (14.3% of RDS vs 3.7% of non-RDS; P = .002) but were not statistically overrepresented among African-descent RDS infants (4.5% of RDS vs 1.5% of non-RDS; P = .23). In the Missouri population-based cohort, 3.6% of European-descent and 1.5% of African-descent infants carried a single ABCA3 mutation. We found no mutations among the RDS infants and no evidence of contribution to population-based disease burden for SFTPC, CHPT1, LPCAT1, or PCYT1B.CONCLUSIONS:In contrast to lethal neonatal RDS resulting from homozygous or compound heterozygous ABCA3 mutations, single ABCA3 mutations are overrepresented among European-descent infants ≥34 weeks' gestation with RDS and account for ∼10.9% of the attributable risk among term and late preterm infants. Although ABCA3 mutations are individually rare, they are collectively common among European- and African-descent individuals in the general population.
    PEDIATRICS 11/2012; · 4.47 Impact Factor
  • Article: Detection of rare genomic variants from pooled sequencing using SPLINTER.
    [show abstract] [hide abstract]
    ABSTRACT: As DNA sequencing technology has markedly advanced in recent years(2), it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought(3). In contrast, array-based genotyping has failed to identify a significant contribution of common sequence variants to the phenotypic variability of common disease(4,5). Taken together, these observations have led to the evolution of the Common Disease / Rare Variant hypothesis suggesting that the majority of the "missing heritability" in common and complex phenotypes is instead due to an individual's personal profile of rare or private DNA variants(6-8). However, characterizing how rare variation impacts complex phenotypes requires the analysis of many affected individuals at many genomic loci, and is ideally compared to a similar survey in an unaffected cohort. Despite the sequencing power offered by today's platforms, a population-based survey of many genomic loci and the subsequent computational analysis required remains prohibitive for many investigators. To address this need, we have developed a pooled sequencing approach(1,9) and a novel software package(1) for highly accurate rare variant detection from the resulting data. The ability to pool genomes from entire populations of affected individuals and survey the degree of genetic variation at multiple targeted regions in a single sequencing library provides excellent cost and time savings to traditional single-sample sequencing methodology. With a mean sequencing coverage per allele of 25-fold, our custom algorithm, SPLINTER, uses an internal variant calling control strategy to call insertions, deletions and substitutions up to four base pairs in length with high sensitivity and specificity from pools of up to 1 mutant allele in 500 individuals. Here we describe the method for preparing the pooled sequencing library followed by step-by-step instructions on how to use the SPLINTER package for pooled sequencing analysis (http://www.ibridgenetwork.org/wustl/splinter). We show a comparison between pooled sequencing of 947 individuals, all of whom also underwent genome-wide array, at over 20kb of sequencing per person. Concordance between genotyping of tagged and novel variants called in the pooled sample were excellent. This method can be easily scaled up to any number of genomic loci and any number of individuals. By incorporating the internal positive and negative amplicon controls at ratios that mimic the population under study, the algorithm can be calibrated for optimal performance. This strategy can also be modified for use with hybridization capture or individual-specific barcodes and can be applied to the sequencing of naturally heterogeneous samples, such as tumor DNA.
    Journal of Visualized Experiments 01/2012;
  • Source
    Article: Sensitive single-molecule protein quantification and protein complex detection in a microarray format.
    Lee A Tessler, Robi D Mitra
    [show abstract] [hide abstract]
    ABSTRACT: Single-molecule protein analysis provides sensitive protein quantitation with a digital read-out and is promising for studying biological systems and detecting biomarkers clinically. However, current single-molecule platforms rely on the quantification of one protein at a time. Conventional antibody microarrays are scalable to detect many proteins simultaneously, but they rely on less sensitive and less quantitative quantification by the ensemble averaging of fluorescent molecules. Here, we demonstrate a single-molecule protein assay in a microarray format enabled by an ultra-low background surface and single-molecule imaging. The digital read-out provides a highly sensitive, low femtomolar limit of detection and four orders of magnitude of dynamic range through the use of hybrid digital-analog quantification. From crude cell lysate, we measured levels of p53 and MDM2 in parallel, proving the concept of a digital antibody microarray for use in proteomic profiling. We also applied the single-molecule microarray to detect the p53-MDM2 protein complex in cell lysate. Our study is promising for development and application of single-molecule protein methods because it represents a technological bridge between single-plex and highly multiplex studies.
    Proteomics 12/2011; 11(24):4731-5. · 4.43 Impact Factor
  • Article: Nanogel surface coatings for improved single-molecule imaging substrates.
    [show abstract] [hide abstract]
    ABSTRACT: Surfaces that resist protein adsorption are important for many bioanalytical applications. Bovine serum albumin (BSA) coatings and multi-arm poly(ethylene glycol) (PEG) coatings display low levels of non-specific protein adsorption and have enabled highly quantitative single-molecule (SM) protein studies. Recently, a method was developed for coating a glass with PEG-BSA nanogels, a promising hybrid of these two low-background coatings. We characterized the nanogel coating to determine its suitability for SM protein experiments. SM adsorption counting revealed that nanogel-coated surfaces exhibit lower protein adsorption than covalently coupled BSA surfaces and monolayers of multi-arm PEG, so this surface displays one of the lowest degrees of protein adsorption yet observed. Additionally, the nanogel coating was resistant to DNA adsorption, underscoring the utility of the coating across a variety of SM experiments. The nanogel coating was found to be compatible with surfactants, whereas the BSA coating was not. Finally, applying the coating to a real-world study, we found that single ligand molecules could be tethered to this surface and detected with high sensitivity and specificity by a digital immunoassay. These results suggest that PEG-BSA nanogel coatings will be highly useful for the SM analysis of proteins.
    Journal of The Royal Society Interface 02/2011; 8(63):1400-8. · 4.40 Impact Factor
  • Source
    Article: High-throughput discovery of rare insertions and deletions in large cohorts.
    [show abstract] [hide abstract]
    ABSTRACT: Pooled-DNA sequencing strategies enable fast, accurate, and cost-effect detection of rare variants, but current approaches are not able to accurately identify short insertions and deletions (indels), despite their pivotal role in genetic disease. Furthermore, the sensitivity and specificity of these methods depend on arbitrary, user-selected significance thresholds, whose optimal values change from experiment to experiment. Here, we present a combined experimental and computational strategy that combines a synthetically engineered DNA library inserted in each run and a new computational approach named SPLINTER that detects and quantifies short indels and substitutions in large pools. SPLINTER integrates information from the synthetic library to select the optimal significance thresholds for every experiment. We show that SPLINTER detects indels (up to 4 bp) and substitutions in large pools with high sensitivity and specificity, accurately quantifies variant frequency (r = 0.999), and compares favorably with existing algorithms for the analysis of pooled sequencing data. We applied our approach to analyze a cohort of 1152 individuals, identifying 48 variants and validating 14 of 14 (100%) predictions by individual genotyping. Thus, our strategy provides a novel and sensitive method that will speed the discovery of novel disease-causing rare variants.
    Genome Research 11/2010; 20(12):1711-8. · 13.61 Impact Factor
  • Article: TATA is a modular component of synthetic promoters.
    [show abstract] [hide abstract]
    ABSTRACT: The expression of most genes is regulated by multiple transcription factors. The interactions between transcription factors produce complex patterns of gene expression that are not always obvious from the arrangement of cis-regulatory elements in a promoter. One critical element of promoters is the TATA box, the docking site for the RNA polymerase holoenzyme. Using a synthetic promoter system coupled to a thermodynamic model of combinatorial regulation, we analyze the effects of different strength TATA boxes on various aspects of combinatorial cis-regulation. The thermodynamic model explains 75% of the variance in gene expression in synthetic promoter libraries with different strength TATA boxes, suggesting that many of the salient aspects of cis-regulation are captured by the model. Our results demonstrate that the effect of changing the TATA box on gene expression is the same for all synthetic promoters regardless of the arrangement of cis-regulatory sites we studied. Our analysis also showed that in our synthetic system the strength of the RNA polymerase-TATA interaction does not alter the combinatorial interactions between transcription factors, or between transcription factors and RNA polymerase. Finally, we show that although stronger TATA boxes increase expression in a predictable fashion, stronger TATA boxes have very little effect on noise in our synthetic promoters, regardless of the arrangement of cis-regulatory sites. Our results support a modular model of promoter function, where cis-regulatory elements can be mixed and matched (programmed) with outcomes on expression that are predictable based on the rules of simple protein-protein and protein-DNA interactions.
    Genome Research 10/2010; 20(10):1391-7. · 13.61 Impact Factor
  • Article: Cardiac signaling genes exhibit unexpected sequence diversity in sporadic cardiomyopathy, revealing HSPB7 polymorphisms associated with disease.
    [show abstract] [hide abstract]
    ABSTRACT: Sporadic heart failure is thought to have a genetic component, but the contributing genetic events are poorly defined. Here, we used ultra-high-throughput resequencing of pooled DNAs to identify SNPs in 4 biologically relevant cardiac signaling genes, and then examined the association between allelic variants and incidence of sporadic heart failure in 2 large Caucasian populations. Resequencing of DNA pools, each containing DNA from approximately 100 individuals, was rapid, accurate, and highly sensitive for identifying common and rare SNPs; it also had striking advantages in time and cost efficiencies over individual resequencing using conventional Sanger methods. In 2,606 individuals examined, we identified a total of 129 separate SNPs in the 4 cardiac signaling genes, including 23 nonsynonymous SNPs that we believe to be novel. Comparison of allele frequencies between 625 Caucasian nonaffected controls and 1,117 Caucasian individuals with systolic heart failure revealed 12 SNPs in the cardiovascular heat shock protein gene HSPB7 with greater proportional representation in the systolic heart failure group; all 12 SNPs were confirmed in an independent replication study. These SNPs were found to be in tight linkage disequilibrium, likely reflecting a single genetic event, but none altered amino acid sequence. These results establish the power and applicability of pooled resequencing for comparative SNP association analysis of target subgenomes in large populations and identify an association between multiple HSPB7 polymorphisms and heart failure.
    The Journal of clinical investigation 01/2010; 120(1):280-9. · 15.39 Impact Factor
  • Article: Identifying genetic determinants needed to establish a human gut symbiont in its habitat.
    [show abstract] [hide abstract]
    ABSTRACT: The human gut microbiota is a metabolic organ whose cellular composition is determined by a dynamic process of selection and competition. To identify microbial genes required for establishment of human symbionts in the gut, we developed an approach (insertion sequencing, or INSeq) based on a mutagenic transposon that allows capture of adjacent chromosomal DNA to define its genomic location. We used massively parallel sequencing to monitor the relative abundance of tens of thousands of transposon mutants of a saccharolytic human gut bacterium, Bacteroides thetaiotaomicron, as they established themselves in wild-type and immunodeficient gnotobiotic mice, in the presence or absence of other human gut commensals. In vivo selection transforms this population, revealing functions necessary for survival in the gut: we show how this selection is influenced by community composition and competition for nutrients (vitamin B(12)). INSeq provides a broadly applicable platform to explore microbial adaptation to the gut and other ecosystems.
    Cell host & microbe 09/2009; 6(3):279-89. · 13.02 Impact Factor
  • Source
    Article: Protein quantification in complex mixtures by solid phase single-molecule counting.
    [show abstract] [hide abstract]
    ABSTRACT: Here we present a procedure for quantifying single protein molecules affixed to a surface by counting bound antibodies. We systematically investigate many of the parameters that have prevented the robust single-molecule detection of surface-immobilized proteins. We find that a chemically adsorbed bovine serum albumin surface facilitates the efficient detection of single target molecules with fluorescent antibodies, and we show that these antibodies bind for lengths of time sufficient for imaging billions of individual protein molecules. This surface displays a low level of nonspecific protein adsorption so that bound antibodies can be directly counted without employing two-color coincidence detection. We accurately quantify protein abundance by counting bound antibody molecules and perform this robustly in real-world serum samples. The number of antibody molecules we quantify relates linearly to the number of immobilized protein molecules (R(2) = 0.98), and our precision (1-5% CV) facilitates the reliable detection of small changes in abundance (7%). Thus, our procedure allows for single, surface-immobilized protein molecules to be detected with high sensitivity and accurately quantified by counting bound antibody molecules. Promisingly, we can probe flow cells multiple times with antibodies, suggesting that in the future it should be possible to perform multiplexed single-molecule immunoassays.
    Analytical Chemistry 08/2009; 81(17):7141-8. · 5.86 Impact Factor
  • Source
    Article: Intra-tumor heterogeneity of MLH1 promoter methylation revealed by deep single molecule bisulfite sequencing.
    [show abstract] [hide abstract]
    ABSTRACT: A single tumor may contain cells with different somatic mutations. By characterizing this genetic heterogeneity within tumors, advances have been made in the prognosis, treatment and understanding of tumorigenesis. In contrast, the extent of epigenetic intra-tumor heterogeneity and how it influences tumor biology is under-explored. We have characterized epigenetic heterogeneity within individual tumors using next-generation sequencing. We used deep single molecule bisulfite sequencing and sample-specific DNA barcodes to determine the spectrum of MLH1 promoter methylation across an average of 1000 molecules in each of 33 individual samples in parallel, including endometrial cancer, matched blood and normal endometrium. This first glimpse, deep into each tumor, revealed unexpectedly heterogeneous patterns of methylation at the MLH1 promoter within a subset of endometrial tumors. This high-resolution analysis allowed us to measure the clonality of methylation in individual tumors and gain insight into the accumulation of aberrant promoter methylation on both alleles during tumorigenesis.
    Nucleic Acids Research 07/2009; 37(14):4603-12. · 8.03 Impact Factor
  • Article: Nested Patch PCR for highly multiplexed amplification of genomic loci.
    Katherine E Varley, Robi D Mitra
    [show abstract] [hide abstract]
    ABSTRACT: Nested Patch polymerase chain reaction (PCR) amplifies a large number (greater than 90) of targeted loci from genomic DNA simultaneously in the same reaction. These amplified loci can then be sequenced on a second-generation sequencing machine to detect single nucleotide polymorphisms (SNPs) and mutations. The reaction is highly specific: 90% of sequencing reads match targeted loci. Nested Patch PCR can be performed on many samples in parallel, and by using sample-specific DNA barcodes, these can be pooled and sequenced in a single reaction. Thus, the Nested Patch PCR protocol that is described here provides an easy workflow to identify SNPs and mutations across many targeted loci for many samples in parallel.
    Cold Spring Harbor Protocols 07/2009; 2009(7):pdb.prot5252. · 4.63 Impact Factor
  • Article: Quantification of rare allelic variants from pooled genomic DNA.
    [show abstract] [hide abstract]
    ABSTRACT: We report a targeted, cost-effective method to quantify rare single-nucleotide polymorphisms from pooled human genomic DNA using second-generation sequencing. We pooled DNA from 1,111 individuals and targeted four genes to identify rare germline variants. Our base-calling algorithm, SNPSeeker, derived from large deviation theory, detected single-nucleotide polymorphisms present at frequencies below the raw error rate of the sequencing platform.
    Nature Methods 04/2009; 6(4):263-5. · 19.28 Impact Factor
  • Source
    Article: 'Calling Cards' method for high-throughput identification of targets of yeast DNA-binding proteins.
    [show abstract] [hide abstract]
    ABSTRACT: We present a protocol for a novel method for identifying the targets of DNA-binding proteins in the genome of the yeast Saccharomyces cerevisiae. This is accomplished by engineering a DNA-binding protein so that it leaves behind in the genome a permanent mark -- a 'calling card' -- that provides a record of that protein's visit to that region of the genome. The calling card is the yeast Ty5 retrotransposon, whose integrase interacts with the Sir4 protein. If Sir4 is fused to a DNA-binding protein, it recruits the Ty5 integrase, which directs insertion of a Ty5 calling card into the genome. The calling card along with the flanking genomic DNA is harvested by inverse PCR and its genomic location is determined by hybridization of the product to a DNA microarray. This method provides a straightforward alternative to the 'ChIP-chip' method for determining the targets of DNA-binding proteins. This protocol takes approximately 2 weeks to complete.
    Nature Protocol 02/2008; 3(10):1569-77. · 8.36 Impact Factor
  • Article: Bedraggled, a putative transporter, influences the tissue polarity complex during the R3/R4 fate decision in the Drosophila eye.
    [show abstract] [hide abstract]
    ABSTRACT: The tissue polarity pathway is required for the establishment of epithelial polarity in a variety of vertebrate and invertebrate organs. Core tissue polarity proteins act in a dynamically regulated complex to direct the polarization of the Drosophila eye. We report the identification and characterization of bedraggled (bdg), a novel gene that regulates one output of the tissue polarity pathway--the establishment of the R3/R4 photoreceptor fates. bdg encodes a novel, putative transporter protein and interacts genetically with all of the core polarity genes to influence the specification of the R3 and R4 cell fates. Finally, bdg is required for both viability and the initial stages of imaginal disc development.
    Genetics 10/2007; 177(1):313-28. · 4.01 Impact Factor
  • Source
    Article: Long-range polony haplotyping of individual human chromosome molecules.
    [show abstract] [hide abstract]
    ABSTRACT: We report a method for multilocus long-range haplotyping on human chromosome molecules in vitro based on the DNA polymerase colony (polony) technology. By immobilizing thousands of intact chromosome molecules within a polyacrylamide gel on a microscope slide and performing multiple amplifications from single molecules, we determined long-range haplotypes spanning a 153-Mb region of human chromosome 7 and found evidence of rare mitotic recombination events in human lymphocytes. Furthermore, the parallel nature of DNA polony technology allows efficient haplotyping on pooled DNAs from a population on one slide, with a throughput three orders of magnitudes higher than current molecular haplotyping methods. Linkage disequilibrium statistics established by our pooled DNA haplotyping method are more accurate than statistically inferred haplotypes. This haplotyping method is well suited for candidate gene-based association studies as well as for investigating the pattern of recombination in mammalian cells.
    Nature Genetics 04/2006; 38(3):382-7. · 35.53 Impact Factor
  • Article: Accurate multiplex polony sequencing of an evolved bacterial genome.
    [show abstract] [hide abstract]
    ABSTRACT: We describe a DNA sequencing technology in which a commonly available, inexpensive epifluorescence microscope is converted to rapid nonelectrophoretic DNA sequencing automation. We apply this technology to resequence an evolved strain of Escherichia coli at less than one error per million consensus bases. A cell-free, mate-paired library provided single DNA molecules that were amplified in parallel to 1-micrometer beads by emulsion polymerase chain reaction. Millions of beads were immobilized in a polyacrylamide gel and subjected to automated cycles of sequencing by ligation and four-color imaging. Cost per base was roughly one-ninth as much as that of conventional sequencing. Our protocols were implemented with off-the-shelf instrumentation and reagents.
    Science 10/2005; 309(5741):1728-32. · 31.20 Impact Factor
  • Source
    Article: Advanced sequencing technologies: methods and goals.
    Nature Reviews Genetics 06/2004; 5(5):335-44. · 38.08 Impact Factor
  • Article: Digital quantitative measurements of gene expression.
    [show abstract] [hide abstract]
    ABSTRACT: One of the primary goals of functional genomics is to provide a quantitative understanding of gene function. However, the success of this enterprise is dependent on the accuracy and precision of the functional genomic data. A novel approach, digital analysis of gene expression (DAGE) described herein, is an accurate and precise technology for measuring digital gene expression on a relative or absolute scale by simply counting the number of transcripts of a gene being expressed at a given time. The result is a greatly improved technology sensitive enough for identifying and quantifying small (but biologically important and statistically relevant) changes in gene expression. Fourteen genes involved in galactose metabolism in Saccharomyces cerevisiae were analyzed for their expression levels in glucose and galactose minimal media. The quantitative expression results were characterized in terms of distributional and accuracy attributes; they were also in general agreement (in terms of direction of change) with corresponding results obtained using microarray technology. DAGE is likely to have profound implications in the field of functional genomics because the gene expression measurements are digital in nature and therefore more accurate than any other technologies.
    Biotechnology and Bioengineering 05/2004; 86(2):117-24. · 3.95 Impact Factor
  • Source
    Article: Fluorescent in situ sequencing on polymerase colonies.
    [show abstract] [hide abstract]
    ABSTRACT: Integration of DNA isolation, amplification, and sequencing can be achieved by the use of polymerase colonies (polonies) and cycles of fluorescent dNTP incorporation. In this paper, we present four advances that bring us closer to sequencing genomes cost-effectively using the polony technology. First, a polymerase trapping technique enables efficient nucleotide extension by DNA polymerase in a polyacrylamide matrix and eliminates loss of enzyme during sequencing cycles. Next, we present two novel types of reversibly dye-labeled nucleotide analogues, show that DNA polymerase can incorporate these analogues, and demonstrate that the dyes can be removed by thiol reduction or light exposure. Using these nucleotides, we have sequenced multiple polonies in parallel. In addition, we have found that a high density of polonies can be achieved with minimal overlap between adjacent polonies by limiting the concentration of free primer in the polony amplification reactions. Finally, we have developed software for automated image alignment and sequence calling.
    Analytical Biochemistry 10/2003; 320(1):55-65. · 3.00 Impact Factor