Automating sequence-based detection and genotyping of SNPs from diploid samples

Department of Statistics, University of Washington, Seattle, Washington 98195, USA.
Nature Genetics (Impact Factor: 29.65). 04/2006; 38(3):375-81. DOI: 10.1038/ng1746
Source: PubMed

ABSTRACT The detection of sequence variation, for which DNA sequencing has emerged as the most sensitive and automated approach, forms the basis of all genetic analysis. Here we describe and illustrate an algorithm that accurately detects and genotypes SNPs from fluorescence-based sequence data. Because the algorithm focuses particularly on detecting SNPs through the identification of heterozygous individuals, it is especially well suited to the detection of SNPs in diploid samples obtained after DNA amplification. It is substantially more accurate than existing approaches and, notably, provides a useful quantitative measure of its confidence in each potential SNP detected and in each genotype called. Calls assigned the highest confidence are sufficiently reliable to remove the need for manual review in several contexts. For example, for sequence data from 47-90 individuals sequenced on both the forward and reverse strands, the highest-confidence calls from our algorithm detected 93% of all SNPs and 100% of high-frequency SNPs, with no false positive SNPs identified and 99.9% genotyping accuracy. This algorithm is implemented in a software package, PolyPhred version 5.0, which is freely available for academic use.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The majority of the world's cacao for chocolate manufacture is produced in West Africa. Cocoa breeding programs in West Africa need genetic markers to reduce the time needed for improving cocoa by screening seedlings for the presence of the markers rather than mature plants for the phenotypic traits (i.e., marker-assisted selection [MAS]). For MAS to be successful, the breeder must have both access to markers linked to desired traits and a convenient marker-assay system that can be performed locally. In this study, microsatellite markers that flanked disease resistance quantitative trait loci (QTL) but could not be assayed conveniently in West Africa were converted using a genome walking method into single nucleotide polymorphism (SNP) markers that could be assayed locally. The SNP and microsatellite markers were equally effective in identifying off-types in two different mapping populations of cacao. Also, SNPs cast doubt on whether all microsatellite markers are identical by descent.
    Journal of Crop Improvement 03/2013; 27(2). DOI:10.1080/15427528.2012.752773
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The aim of this case-control study was to investigate whether the vitamin D receptor (VDR) 1a promoter gene polymorphisms are associated with susceptibility to polycystic ovary syndrome (PCOS). Women with PCOS and a control group, all aged 18-45 years, were enrolled. Genotypes of two functional single nucleotide polymorphisms (SNPs), the 1521 bp (G/C) and 1012 bp (A/G), located on the 1a promoter of the VDR gene were determined by using direct sequencing. Serum 25-hydroxyvitamin D levels were measured by ELISA. Two functional SNPs in the 1a promoter region of the VDR gene were in complete linkage disequilibrium. The genotype distributions of these two polymorphisms in the PCOS group were not significantly different from those of the control group. Further subgroup analyses according to body mass index also revealed no significant differences in the genotype distribution in the PCOS group. Significantly lower serum 25-hydroxyvitamin D levels were observed in the heterozygous 1521CG/1012GA haplotype of both groups. Metformin treatment was only effective to increase serum 25-hydroxyvitamin D levels in PCOS patients carrying the homozygous 1521G/1012A haplotype. These results suggest that the VDR 1a promoter polymorphisms may not be associated with the risk for PCOS, but are associated with serum 25-hydroxyvitamin D levels. Metformin treatment will be beneficial to PCOS patients without the VDR 1a promoter variant in Taiwanese population.
    Taiwanese journal of obstetrics & gynecology 12/2012; 51(4):565-71. DOI:10.1016/j.tjog.2012.09.011 · 1.26 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Differential allelic expression (DAE) is a powerful tool to identify cis-regulatory elements for gene expression. The UDP-glucuronosyltransferase 2 family, polypeptide B15 (UGT2B15), is an important enzyme involved in the metabolism of multiple endobiotics and xenobiotics. In the present study, we measured the relative expression of two alleles at SNP c.1568C>A (rs4148269) in this gene, which causes an amino acid substitution (T523K). An excess of the C over the A allele was consistently observed in both liver (P=0.0021) and breast (P=0.012) samples, suggesting that SNP(s) in strong linkage disequilibrium (LD) with c.1568C>A can regulate UGT2B15 expression in both tissues. By resequencing, one such SNP, c.1761T>C (rs3100) in 3' untranslated region (UTR), was identified. Reporter gene assays showed that the 1761T allele results in a significantly higher gene expression level than the 1761C allele in HepG2, MCF-7, LNCaP, and Caco-2 cell lines (all P<0.001), thus indicating that this variation can regulate UGT2B15 gene expression in liver, breast, colon, and prostate tissues. Considering its location, we postulated that this SNP is within an unknown microRNA binding site and can influence microRNA targeting. Considering the importance of UGT2B15 in metabolism, we proposed that this SNP might contribute to multiple cancer risk and variability in drug response.
    Gene 07/2011; 481(1):24-8. DOI:10.1016/j.gene.2011.04.001 · 2.08 Impact Factor

Preview (2 Sources)

Available from