Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

Department of Computer Science, National Tsing Hua University, Hsin-Chu, Taiwan.
The Scientific World Journal (Impact Factor: 1.22). 06/2012; 2012:365104. DOI: 10.1100/2012/365104
Source: PubMed

ABSTRACT The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: A new species of the chlorophycean genus Desmodesmus, D. baconii, was described based upon analyses of the morphology of the coenobia and DNA sequences from the nuclear ribosomal 18S RNA gene and the internal transcribed spacer region. Desmodesmus baconii was unusual in that it possessed two rows of spines only on the terminal cells. This species was easily distinguished from other Desmodesmus taxa with two rows of spines by the point symmetry of the coenobia and the lack of rows of spines on median cells of four-celled coenobia. Spines tapered from prominent incurved spines near one pole to small spines at the opposite pole. Median cells and the pole of terminal cells opposite the large spines had two or three short spines. Cell sizes were slightly larger for two-celled (4.2-6.7 mu m x 1.6-2.9 mu m) than for four-celled coenobia (3.2-6.5 mu m x 1.2-4.5 mu m). Results of analyses of DNA sequence data indicated that D. baconii, although a member of the monophyletic genus Desmodesmus, was not closely allied with any other Desmodesmus taxon. Desmodesmus baconii was isolated from hypereutrophic Lake Chicot in Arkansas, USA, and is not known from any other location.
    Phycologia 11/2013; 52(6):565-572. DOI:10.2216/12-116.1 · 1.82 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In this study, we propose an approach aiming at fine-mapping adiposity QTL in chicken, integrating whole genome re-sequencing data. First, two QTL regions for adiposity were identified by performing a classical linkage analysis on 1362 offspring in 11 sire families obtained by crossing two meat-type chicken lines divergently selected for abdominal fat weight. Those regions, located on chromosome 7 and 19, contained a total of 77 and 84 genes, respectively. Then, SNPs and indels in these regions were identified by re-sequencing sires. Considering issues related to polymorphism annotations for regulatory regions, we focused on the 120 and 104 polymorphisms having an impact on protein sequence, and located in coding regions of 35 and 42 genes situated in the two QTL regions. Subsequently, a filter was applied on SNPs considering their potential impact on the protein function based on conservation criteria. For the two regions, we identified 42 and 34 functional polymorphisms carried by 18 and 24 genes, and likely to deeply impact protein, including 3 coding indels and 4 nonsense SNPs. Finally, using gene functional annotation, a short list of 17 and 4 polymorphisms in 6 and 4 functional genes has been defined. Even if we cannot exclude that the causal polymorphisms may be located in regulatory regions, this strategy gives a complete overview of the candidate polymorphisms in coding regions and prioritize them on conservation- and functional-based arguments.
    PLoS ONE 10/2014; 9(10):e111299. DOI:10.1371/journal.pone.0111299 · 3.53 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Aim. The relationship between genetic polymorphisms of the glucagon-like peptide-1 (GLP-1) receptor (GLP1R) gene and unresponsiveness to GLP-1 analogue treatment in patients with poorly controlled type 2 diabetes mellitus (DM) is unclear. Methods. Thirty-six patients with poorly controlled type 2 DM were enrolled and they received six days of continuous subcutaneous insulin infusion for this study. After the normalization of blood glucose in the first 3 days, the patients then received a combination therapy with injections of the GLP-1 analogue, exenatide, for another 3 days. All 13 exons and intron-exon boundaries of the GLP1R gene were amplified to investigate the association. Results. The short tandem repeat at 8GA/7GA (rs5875654) had complete linkage disequilibrium (LD, with ) with single nucleotide polymorphism (SNP) rs761386. Quantitative trait loci analysis of GLP1R gene variation with clinical response of GLP1 analogue showed the missense rs3765467 and rs761386 significantly associated with changes in the standard deviation of plasma glucose () ( and 0.019, resp.). The reported values became insignificant after multiple testing adjustments. Conclusion. The variable response to the GLP-1 analogue was not statistically correlated with polymorphisms of the GLP1R gene in patients with poorly controlled type 2 DM.
    Journal of Diabetes Research 01/2015; 2015:1-10. DOI:10.1155/2015/176949 · 3.54 Impact Factor

Full-text (2 Sources)

Available from
May 20, 2014