Development and application of a 6.5 Million feature Affymetrix GeneChip(R) for massively parallel discovery of single position polymorphisms in lettuce (Lactuca spp.)

BMC Genomics (Impact Factor: 3.99). 05/2012; 13(1):185. DOI: 10.1186/1471-2164-13-185
Source: PubMed


High-resolution genetic maps are needed in many crops to help characterize the genetic diversity that determines agriculturally important traits. Hybridization to microarrays to detect single feature polymorphisms is a powerful technique for marker discovery and genotyping because of its highly parallel nature. However, microarrays designed for gene expression analysis rarely provide sufficient gene coverage for optimal detection of nucleotide polymorphisms, which limits utility in species with low rates of polymorphism such as lettuce (Lactuca sativa).
We developed a 6.5 million feature Affymetrix GeneChip® for efficient polymorphism discovery and genotyping, as well as for analysis of gene expression in lettuce. Probes on the microarray were designed from 26,809 unigenes from cultivated lettuce and an additional 8,819 unigenes from four related species (L. serriola, L. saligna, L. virosa and L. perennis). Where possible, probes were tiled with a 2 bp stagger, alternating on each DNA strand; providing an average of 187 probes covering approximately 600 bp for each of over 35,000 unigenes; resulting in up to 13 fold redundancy in coverage per nucleotide. We developed protocols for hybridization of genomic DNA to the GeneChip® and refined custom algorithms that utilized coverage from multiple, high quality probes to detect single position polymorphisms in 2 bp sliding windows across each unigene. This allowed us to detect greater than 18,000 polymorphisms between the parental lines of our core mapping population, as well as numerous polymorphisms between cultivated lettuce and wild species in the lettuce genepool. Using marker data from our diversity panel comprised of 52 accessions from the five species listed above, we were able to separate accessions by species using both phylogenetic and principal component analyses. Additionally, we estimated the diversity between different types of cultivated lettuce and distinguished morphological types.
By hybridizing genomic DNA to a custom oligonucleotide array designed for maximum gene coverage, we were able to identify polymorphisms using two approaches for pair-wise comparisons, as well as a highly parallel method that compared all 52 genotypes simultaneously.

Download full-text


Available from: Hamid Ashrafi
  • Source
    • "The most popular types produced for leaf consumption are iceberg with a large spherical head, romaine with an elongated head, butterhead with a small spherical head and pliable leaves with oily texture, and non-heading leaf-type lettuces with loose leaves. Analyses with molecular markers identified low genetic variability in iceberg and to lesser extent also in romaine and butterhead types12345. Narrow genetic base may mean that these types are impoverished with respect to genes for resistance to pathogens relative to the lettuce genepools6. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Many cultivars of lettuce (Lactuca sativa L.), the most popular leafy vegetable, are susceptible to downy mildew disease caused by Bremia lactucae. Cultivars Iceberg and Grand Rapids that were released in the 18(th) and 19(th) centuries, respectively, have high levels of quantitative resistance to downy mildew. We developed a population of recombinant inbred lines (RILs) originating from a cross between these two legacy cultivars, constructed a linkage map, and identified two QTLs for resistance on linkage groups 2 (qDM2.1) and 5 (qDM5.1) that determined resistance under field conditions in California and the Netherlands. The same QTLs determined delayed sporulation at the seedling stage in laboratory experiments. Alleles conferring elevated resistance at both QTLs originate from cultivar Iceberg. An additional QTL on linkage group 9 (qDM9.1) was detected through simultaneous analysis of all experiments with mixed-model approach. Alleles for elevated resistance at this locus originate from cultivar Grand Rapids.
    Full-text · Article · Oct 2013 · Scientific Reports
  • Source
    • "The reference set value was calculated for all GC bins and all chips so that the data from each chip were comparable with a chip’s own reference set. A background signal for each chip was also calculated based on anti-genomic probes within each GC bin (Stoffel et al. 2012). A probe was excluded from further analysis if its hybridization intensity was less than the 90 percentile of the hybridization intensity of all anti-genomic probes (see Stoffel et al. 2012) in its specific GC bin. "
    [Show abstract] [Hide abstract]
    ABSTRACT: We have generated an ultra-high-density genetic map for lettuce, an economically important member of the Compositae, consisting of 12,842 unigenes (13,943 markers) mapped in 3,696 genetic bins distributed over nine chromosomal linkage groups. Genomic DNA was hybridized to a custom Affymetrix oligonucleotide array containing 6.4 million features representing 35,628 unigenes of Lactuca spp. Segregation of single-position polymorphisms was analyzed using 213 F7:8 recombinant inbred lines (RILs) that had been generated by crossing cultivated Lactuca sativa cv. Salinas and L. serriola acc. US96UC23, the wild progenitor species of L. sativa. The high level of replication of each allele in the recombinant inbred lines was exploited to identify single-position polymorphisms that were assigned to parental haplotypes. Marker information has been made available using GBrowse to facilitate access to the map. This map has been anchored to the previously published integrated map of lettuce providing candidate genes for multiple phenotypes. The high density of markers achieved in this ultra-dense map allowed syntenic studies between lettuce and Vitis vinifera as well as other plant species.
    Full-text · Article · Mar 2013 · G3-Genes Genomes Genetics
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: BACKGROUND:Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeno and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes.RESULTS:Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip(R) microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80--120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins.CONCLUSIONS:Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project.
    Full-text · Article · Jan 2012
Show more