Rapid gene-based SNP and haplotype marker development in non-model eukaryotes using 3'UTR sequencing

Department of Horticulture, Washington State University, Pullman, WA, USA.
BMC Genomics (Impact Factor: 4.04). 01/2012; 13:18. DOI: 10.1186/1471-2164-13-18
Source: PubMed

ABSTRACT Sweet cherry (Prunus avium L.), a non-model crop with narrow genetic diversity, is an important member of sub-family Amygdoloideae within Rosaceae. Compared to other important members like peach and apple, sweet cherry lacks in genetic and genomic information, impeding understanding of important biological processes and development of efficient breeding approaches. Availability of single nucleotide polymorphism (SNP)-based molecular markers can greatly benefit breeding efforts in such non-model species. RNA-seq approaches employing second generation sequencing platforms offer a unique avenue to rapidly identify gene-based SNPs. Additionally, haplotype markers can be rapidly generated from transcript-based SNPs since they have been found to be extremely utile in identification of genetic variants related to health, disease and response to environment as highlighted by the human HapMap project.
RNA-seq was performed on two sweet cherry cultivars, Bing and Rainier using a 3' untranslated region (UTR) sequencing method yielding 43,396 assembled contigs. In order to test our approach of rapid identification of SNPs without any reference genome information, over 25% (10,100) of the contigs were screened for the SNPs. A total of 207 contigs from this set were identified to contain high quality SNPs. A set of 223 primer pairs were designed to amplify SNP containing regions from these contigs and high resolution melting (HRM) analysis was performed with eight important parental sweet cherry cultivars. Six of the parent cultivars were distantly related to Bing and Rainier, the cultivars used for initial SNP discovery. Further, HRM analysis was also performed on 13 seedlings derived from a cross between two of the parents. Our analysis resulted in the identification of 84 (38.7%) primer sets that demonstrated variation among the tested germplasm. Reassembly of the raw 3'UTR sequences using upgraded transcriptome assembly software yielded 34,620 contigs containing 2243 putative SNPs in 887 contigs after stringent filtering. Contigs with multiple SNPs were visually parsed to identify 685 putative haplotypes at 335 loci in 301 contigs.
This approach, which leverages the advantages of RNA-seq approaches, enabled rapid generation of gene-linked SNP and haplotype markers. The general approach presented in this study can be easily applied to other non-model eukaryotes irrespective of the ploidy level to identify gene-linked polymorphisms that are expected to facilitate efficient Gene Assisted Breeding (GAB), genotyping and population genetics studies. The identified SNP haplotypes reveal some of the allelic differences in the two sweet cherry cultivars analyzed. The identification of these SNP and haplotype markers is expected to significantly improve the genomic resources for sweet cherry and facilitate efficient GAB in this non-model crop.

  • [Show abstract] [Hide abstract]
    ABSTRACT: High-throughput DNA and RNA sequencing technologies have resulted in the successful identification of Single nucleotide polymorphisms (SNPs). In order to develop a large SNP set for wide application in apricot (Prunus armeniaca L.), we carried out RNA high-throughput sequencing (RNA-Seq) in two apricot genotypes, “Rojo Pasión” and “Z506-7.” After trimming and cleaning, 70 % of RNA-Seq reads were aligned to the reference peach genome. Sequences uniquely mapped on the peach genome allowed for the discovery of 300 k SNP/INDEL variations, with a density of one SNP per 850 bp. Some 95 SNPs of the 99 tested were analyzed in a set of 37 apricot accessions using SNPlex™ genotyping technology. The results provide accurate values for nucleotide diversity in coding sequences in apricot. The combination of a highly efficient RNA-Seq approach and SNPlex™ high-throughput genotyping technology thus provide a powerful tool for apricot genetic analysis. SNP markers produced a total of 267 allelic combinations in the 37 apricot accessions assayed with a mean of 2.8 combinations per locus, an observed heterozygosity per marker ranging from 0.06 to 0.65, and a power of discrimination ranging from 0.12 to 0.66. In addition, SNP markers confirmed parentage and also determined relationships between the accessions in a manner consistent with their pedigree relationships.
    Tree Genetics & Genomes 02/2015; 11(1). DOI:10.1007/s11295-015-0845-2 · 2.44 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Camelina sativa, a largely relict crop, has recently returned to interest due to its potential as an industrial oilseed. Molecular markers are key tools that will allow C. sativa to benefit from modern breeding approaches. Two complementary methodologies, capture of 3′ cDNA tags and genomic reduced-representation libraries, both of which exploited second generation sequencing platforms, were used to develop a low density (768) Illumina GoldenGate single nucleotide polymorphism (SNP) array. The array allowed 533 SNP loci to be genetically mapped in a recombinant inbred population of C. sativa. Alignment of the SNP loci to the C. sativa genome identified the underlying sequenced regions that would delimit potential candidate genes in any mapping project. In addition, the SNP array was used to assess genetic variation among a collection of 175 accessions of C. sativa, identifying two sub-populations, yet low overall gene diversity. The SNP loci will provide useful tools for future crop improvement of C. sativa. Electronic supplementary material The online version of this article (doi:10.1007/s11032-015-0224-6) contains supplementary material, which is available to authorized users.
    Molecular Breeding 01/2015; 35:35. DOI:10.1007/s11032-015-0224-6 · 2.28 Impact Factor
  • Source

Full-text (4 Sources)

Available from
Jun 5, 2014