Article

A high-resolution map of segmental DNA copy number variation in the mouse genome.

Department of Medicine, Division of Oncology, Stem Cell Biology Section, Washington University, St. Louis, Missouri, United States of America.
PLoS Genetics (impact factor: 8.69). 02/2007; 3(1):e3. DOI:10.1371/journal.pgen.0030003
Source: PubMed

ABSTRACT Submicroscopic (less than 2 Mb) segmental DNA copy number changes are a recently recognized source of genetic variability between individuals. The biological consequences of copy number variants (CNVs) are largely undefined. In some cases, CNVs that cause gene dosage effects have been implicated in phenotypic variation. CNVs have been detected in diverse species, including mice and humans. Published studies in mice have been limited by resolution and strain selection. We chose to study 21 well-characterized inbred mouse strains that are the focus of an international effort to measure, catalog, and disseminate phenotype data. We performed comparative genomic hybridization using long oligomer arrays to characterize CNVs in these strains. This technique increased the resolution of CNV detection by more than an order of magnitude over previous methodologies. The CNVs range in size from 21 to 2,002 kb. Clustering strains by CNV profile recapitulates aspects of the known ancestry of these strains. Most of the CNVs (77.5%) contain annotated genes, and many (47.5%) colocalize with previously mapped segmental duplications in the mouse genome. We demonstrate that this technique can identify copy number differences associated with known polymorphic traits. The phenotype of previously uncharacterized strains can be predicted based on their copy number at these loci. Annotation of CNVs in the mouse genome combined with sequence-based analysis provides an important resource that will help define the genetic basis of complex traits.

0 0
 · 
0 Bookmarks
 · 
56 Views
  • Source
    Article: Unexpected complexity in the haplotypes of commonly used inbred strains of laboratory mice.
    [show abstract] [hide abstract]
    ABSTRACT: Investigation of sequence variation in common inbred mouse strains has revealed a segmented pattern in which regions of high and low variant density are intermixed. Furthermore, it has been suggested that allelic strain distribution patterns also occur in well defined blocks and consequently could be used to map quantitative trait loci (QTL) in comparisons between inbred strains. We report a detailed analysis of polymorphism distribution in multiple inbred mouse strains over a 4.8-megabase region containing a QTL influencing anxiety. Our analysis indicates that it is only partly true that the genomes of inbred strains exist as a patchwork of segments of sequence identity and difference. We show that the definition of haplotype blocks is not robust and that methods for QTL mapping may fail if they assume a simple block-like structure.
    Proceedings of the National Academy of Sciences 07/2004; 101(26):9734-9. · 9.68 Impact Factor
  • Source
    Article: Detection of large-scale variation in the human genome.
    [show abstract] [hide abstract]
    ABSTRACT: We identified 255 loci across the human genome that contain genomic imbalances among unrelated individuals. Twenty-four variants are present in > 10% of the individuals that we examined. Half of these regions overlap with genes, and many coincide with segmental duplications or gaps in the human genome assembly. This previously unappreciated heterogeneity may underlie certain human phenotypic variation and susceptibility to disease and argues for a more dynamic human genome structure.
    Nature Genetics 10/2004; 36(9):949-51. · 35.53 Impact Factor
  • Source
    Article: BLAT--the BLAST-like alignment tool.
    [show abstract] [hide abstract]
    ABSTRACT: Analyzing vertebrate genomes requires rapid mRNA/DNA and cross-species protein alignments. A new tool, BLAT, is more accurate and 500 times faster than popular existing tools for mRNA/DNA alignments and 50 times faster for protein alignments at sensitivity settings typically used when comparing vertebrate sequences. BLAT's speed stems from an index of all nonoverlapping K-mers in the genome. This index fits inside the RAM of inexpensive computers, and need only be computed once for each genome assembly. BLAT has several major stages. It uses the index to find regions in the genome likely to be homologous to the query sequence. It performs an alignment between homologous regions. It stitches together these aligned regions (often exons) into larger alignments (typically genes). Finally, BLAT revisits small internal exons possibly missed at the first stage and adjusts large gap boundaries that have canonical splice sites where feasible. This paper describes how BLAT was optimized. Effects on speed and sensitivity are explored for various K-mer sizes, mismatch schemes, and number of required index matches. BLAT is compared with other alignment programs on various test sets and then used in several genome-wide applications. http://genome.ucsc.edu hosts a web-based BLAT server for the human genome.
    Genome Research 05/2002; 12(4):656-64. · 13.61 Impact Factor

Full-text (2 Sources)

View
9 Downloads
Available from
24 Jan 2013

Keywords

annotated genes
 
cause gene dosage effects
 
Clustering strains
 
CNV profile recapitulates aspects
 
CNVs range
 
comparative genomic hybridization
 
copy number differences
 
copy number variants
 
disseminate phenotype data
 
diverse species
 
international effort
 
mapped segmental duplications
 
mouse genome
 
phenotypic variation
 
Published studies
 
recognized source
 
strain selection
 
strains
 
study 21 well-characterized inbred mouse strains
 
uncharacterized strains