Next generation sequencing for TCR repertoire profiling: Platform-specific features and correction algorithms

Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry RAS, Moscow, Russia.
European Journal of Immunology (Impact Factor: 4.52). 11/2012; 42(11). DOI: 10.1002/eji.201242517
Source: PubMed

ABSTRACT The TCR repertoire is a mirror of the human immune system that reflects processes caused by infections, cancer, autoimmunity, and aging. Next generation sequencing (NGS) is becoming a powerful tool for deep TCR profiling; yet, questions abound regarding the methodological approaches for sample preparation and correct data interpretation. Accumulated PCR and sequencing errors along with library preparation bottlenecks and uneven PCR efficiencies lead to information loss, biased quantification, and generation of huge artificial TCR diversity. Here, we compare Illumina, 454, and Ion Torrent platforms for individual TCR profiling, evaluate the rate and character of errors, and propose advanced platform-specific algorithms to correct massive sequencing data. These developments are applicable to a wide variety of next generation sequencing applications. We demonstrate that advanced correction allows the removal of the majority of artificial TCR diversity with concomitant rescue of most of the sequencing information. Thus, this correction enhances the accuracy of clonotype identification and quantification as well as overall TCR diversity measurements.

Download full-text


Available from: Ilgar Z Mamedov, Jun 29, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Hemoglobinopathies, such as sickle cell disease (SCD) and beta-thalassemia major (TM), are severe diseases and the most common autosomal recessive condition worldwide and in particular in Oman. Early screening and diagnosis of carriers are the key for primary prevention. Once a country-wide population screening program is mandated by law, a sequencing technology that can rapidly confirm or identify disease-causing mutations for a large number of patients in a short period of time will be necessary. While Sanger sequencing is the standard protocol for molecular diagnosis, next generation sequencing starts to become available to reference laboratories. Using the Ion Torrent PGM sequencer, we have analyzed a cohort of 297 unrelated Omani cases and reliably identified mutations in the beta-globin (HBB) gene. Our model study has shown that Ion Torrent PGM can rapidly sequence such a small gene in a large number of samples using a barcoded uni-directional or bi-directional sequence methodology, reducing cost, workload and providing accurate diagnosis. Based on our results we believe that the Ion Torrent PGM sequencing platform, able to analyze hundreds of patients simultaneously for a single disease gene can be a valid molecular screening alternative to ABI sequencing in the diagnosis of hemoglobinopathies and other genetic disorders in the near future.
    Blood Cells Molecules and Diseases 05/2014; 53(3). DOI:10.1016/j.bcmd.2014.05.002 · 2.33 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Vaccines, like drugs and medical procedures, are increasingly amenable to individualization or personalization, often based on novel data resulting from high throughput "omics" technologies. As a result of these technologies, 21st century vaccinology will increasingly see the abandonment of a "one size fits all" approach to vaccine dosing and delivery, as well as the abandonment of the empiric "isolate-inactivate-inject" paradigm for vaccine development. In this review, we discuss the immune response network theory and its application to the new field of vaccinomics and adversomics, and illustrate how vaccinomics can lead to new vaccine candidates, new understandings of how vaccines stimulate immune responses, new biomarkers for vaccine response, and facilitate the understanding of what genetic and other factors might be responsible for rare side effects due to vaccines. Perhaps most exciting will be the ability, at a systems biology level, to integrate increasingly complex high throughput data into descriptive and predictive equations for immune responses to vaccines. Herein, we discuss the above with a view toward the future of vaccinology.
    Seminars in Immunology 06/2013; 25(2). DOI:10.1016/j.smim.2013.04.007 · 6.12 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: A major feature of an adaptive immune system is its ability to generate B- and T-cell clones capable of recognizing and neutralizing specific antigens. These clones recognize antigens with the help of the surface molecules, called antigen receptors, acquired individually during the clonal development process. In order to ensure a response to a broad range of antigens, the number of different receptor molecules is extremely large, resulting in a huge clonal diversity of both B- and T-cell receptor populations and making their experimental comparisons statistically challenging. To facilitate such comparisons, we propose a flexible parametric model of multivariate count data and illustrate its use in a simultaneous analysis of multiple antigen receptor populations derived from mammalian T-cells. The model relies on a representation of the observed receptor counts as a multivariate Poisson abundance mixture (mPAM). A Bayesian parameter fitting procedure is proposed, based on the complete posterior likelihood, rather than the conditional one used typically in similar settings. The new procedure is shown to be considerably more efficient than its conditional counterpart (as measured by the Fisher information) in the regions of mPAM parameter space relevant to modeling T-cell data.
    Journal of Theoretical Biology 03/2013; DOI:10.1016/j.jtbi.2013.02.009 · 2.30 Impact Factor