The 1000 Genomes Project: data management and community access.

European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
Nature Methods (Impact Factor: 23.57). 04/2012; 9(5):459-62. DOI: 10.1038/nmeth.1974
Source: PubMed

ABSTRACT The 1000 Genomes Project was launched as one of the largest distributed data collection and analysis projects ever undertaken in biology. In addition to the primary scientific goals of creating both a deep catalog of human genetic variation and extensive methods to accurately discover and characterize variation using new sequencing technologies, the project makes all of its data publicly available. Members of the project data coordination center have developed and deployed several tools to enable widespread data access.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Human identity testing is critical to the fields of forensics, paternity, and hematopoietic stem cell transplantation. Most bone marrow (BM) engraftment testing currently uses microsatellites or short tandem repeats that are resolved by capillary electrophoresis. Single-nucleotide polymorphisms (SNPs) are theoretically a better choice among polymorphic DNA; however, ultrasensitive detection of SNPs using next-generation sequencing is currently not possible because of its inherently high error rate. We circumvent this problem by analyzing blocks of closely spaced SNPs, or haplotypes. As proof-of-principle, we chose the HLA-A locus because it is highly polymorphic and is already genotyped to select proper donors for BM transplant recipients. We aligned common HLA-A alleles and identified a region containing 18 closely spaced SNPs, flanked by nonpolymorphic DNA for primer placement. Analysis of cell line mixtures shows that the assay is accurate and precise, and has a lower limit of detection of approximately 0.01%. The BM from a series of hematopoietic stem cell transplantation patients who tested as all donor by short tandem repeat analysis demonstrated 0% to 1.5% patient DNA. Comprehensive analysis of the human genome using the 1000 Genomes database identified many additional loci that could be used for this purpose. This assay may prove useful to identify hematopoietic stem cell transplantation patients destined to relapse, microchimerism associated with solid organ transplantation, forensic applications, and possibly patient identification.
    The Journal of molecular diagnostics: JMD 09/2014; 16(5):495-503. · 3.48 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: With diminishing costs of next generation sequencing (NGS), whole genome analysis becomes a standard tool for identifying genetic causes of inherited diseases. Commercial NGS service providers in general not only provide raw genomic reads, but further deliver SNP calls to their clients. However, the question for the user arises whether to use the SNP data as is, or process the raw sequencing data further through more sophisticated SNP calling pipelines with more advanced algorithms.
    BMC Research Notes 01/2014; 7(1):747.
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: It is assumed that DNA sequences are conserved in the diverse cell types present in a multicellular organism like the human being. Thus, in order to compare the sequences in the genome of DNA from different individuals, nucleic acid is commonly isolated from a single tissue. In this regard, blood cells are widely used for this purpose because of their availability. Thus blood DNA has been used to study genetic familiar diseases that affect other tissues and organs, such as the liver, heart, and brain. While this approach is valid for the identification of familial diseases in which mutations are present in parental germinal cells and, therefore, in all the cells of a given organism, it is not suitable to identify sporadic diseases in which mutations might occur in specific somatic cells. This review addresses somatic DNA variations in different tissues or cells (mainly in the brain) of single individuals and discusses whether the dogma of DNA invariance between cell types is indeed correct. We will also discuss how single nucleotide somatic variations arise, focusing on the presence of specific DNA mutations in the brain.
    Front. Aging Neurosci. 01/2014; 6:323.

Full-text (2 Sources)

Available from
Jun 5, 2014