Massively parallel sequencing of the mouse exome to accurately identify rare, induced mutations: an immediate source for thousands of new mouse models

Immunogenomics Laboratory , Australian National University , GPO Box 334, Canberra City, Australian Capital Territory, 2601 , Australia.
Open Biology (Impact Factor: 4.56). 05/2012; 2(5):120061. DOI: 10.1098/rsob.120061
Source: PubMed

ABSTRACT Accurate identification of sparse heterozygous single-nucleotide variants (SNVs) is a critical challenge for identifying the causative mutations in mouse genetic screens, human genetic diseases and cancer. When seeking to identify causal DNA variants that occur at such low rates, they are overwhelmed by false-positive calls that arise from a range of technical and biological sources. We describe a strategy using whole-exome capture, massively parallel DNA sequencing and computational analysis, which identifies with a low false-positive rate the majority of heterozygous and homozygous SNVs arising de novo with a frequency of one nucleotide substitution per megabase in progeny of N-ethyl-N-nitrosourea (ENU)-mutated C57BL/6j mice. We found that by applying a strategy of filtering raw SNV calls against known and platform-specific variants we could call true SNVs with a false-positive rate of 19.4 per cent and an estimated false-negative rate of 21.3 per cent. These error rates are small enough to enable calling a causative mutation from both homozygous and heterozygous candidate mutation lists with little or no further experimental validation. The efficacy of this approach is demonstrated by identifying the causative mutation in the Ptprc gene in a lymphocyte-deficient strain and in 11 other strains with immune disorders or obesity, without the need for meiotic mapping. Exome sequencing of first-generation mutant mice revealed hundreds of unphenotyped protein-changing mutations, 52 per cent of which are predicted to be deleterious, which now become available for breeding and experimental analysis. We show that exome sequencing data alone are sufficient to identify induced mutations. This approach transforms genetic screens in mice, establishes a general strategy for analysing rare DNA variants and opens up a large new source for experimental models of human disease.

  • [Show abstract] [Hide abstract]
    ABSTRACT: The 22q11.2 deletion syndrome (22q11DS) is thought to be a contiguous gene syndrome caused by haploinsufficiency for a variable number of genes with overlapping function during the development of the craniofacial, pharyngeal and cardiac structures. The complexity of genetic and developmental anomalies resulting in 22q11DS has made attributing causation to specific genes difficult. The CRKL gene resides within the common 3-Mb region, most frequently affected in 22q11DS, and has been shown to play an essential role in the development of tissues affected in 22q11DS. Here, we report the characterisation of a mouse strain we named 'snoopy', harbouring a novel Crkl splice-site mutation that results in a loss of Crkl expression. The snoopy strain exhibits a variable phenotype that includes micrognathia, pharyngeal occlusion, aglossia and holoprosencephaly, and altered retinoic acid and endothelin signalling. Together, these features are reminiscent of malformations occurring in auriculocondylar syndrome and agnathia-otocephaly complex, 2 conditions not previously associated with the CRKL function. Comparison of the features of a cohort of patients harbouring small 22q11.2 deletions centred over the CRKL gene, but sparing TBX1, highlights the role of CRKL in contributing to the craniofacial features of 22q11DS. These analyses demonstrate the central role of Crkl in regulating signalling events in the developing oropharyngeal complex and its potential to contribute to dysmorphology.
    Molecular syndromology 12/2014; 5(6):276-86. DOI:10.1159/000368865
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Genome-wide saturation mutagenesis and subsequent phenotype-driven screening has been central to a comprehensive understanding of complex biological processes in classical model organisms such as flies, nematodes, and plants. The degree of "saturation" (i.e., the fraction of possible target genes identified) has been shown to be a critical parameter in determining all relevant genes involved in a biological function, without prior knowledge of their products. In mammalian model systems, however, the relatively large scale and labor intensity of experiments have hampered the achievement of actual saturation mutagenesis, especially for recessive traits that require biallelic mutations to manifest detectable phenotypes. By exploiting the recently established haploid mouse embryonic stem cells (ESCs), we present an implementation of almost complete saturation mutagenesis in a mammalian system. The haploid ESCs were mutagenized with the chemical mutagen N-ethyl-N-nitrosourea (ENU) and processed for the screening of mutants defective in various steps of the glycosylphosphatidylinositol-anchor biosynthetic pathway. The resulting 114 independent mutant clones were characterized by a functional complementation assay, and were shown to be defective in any of 20 genes among all 22 known genes essential for this well-characterized pathway. Ten mutants were further validated by whole-exome sequencing. The predominant generation of single-nucleotide substitutions by ENU resulted in a gene mutation rate proportional to the length of the coding sequence, which facilitated the experimental design of saturation mutagenesis screening with the aid of computational simulation. Our study enables mammalian saturation mutagenesis to become a realistic proposition. Computational simulation, combined with a pilot mutagenesis experiment, could serve as a tool for the estimation of the number of genes essential for biological processes such as drug target pathways when a positive selection of mutants is available.
    BMC Genomics 11/2014; 15(1):1016. DOI:10.1186/1471-2164-15-1016 · 4.04 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The generation of naïve T lymphocytes is critical for immune function yet the mechanisms governing their maturation remain incompletely understood. We have identified a mouse mutant, bloto, that harbors a hypomorphic mutation in the zinc finger protein Zfp335. Zfp335(bloto/bloto) mice exhibit a naïve T cell deficiency due to an intrinsic developmental defect that begins to manifest in the thymus and continues into the periphery, affecting T cells that have recently undergone thymic egress. The effects of Zfp335(bloto) are multigenic and cannot be attributed to altered thymic selection, proliferation or Bcl2-dependent survival. Zfp335 binds to promoter regions via a consensus motif, and its target genes are enriched in categories related to protein metabolism, mitochondrial function, and transcriptional regulation. Restoring the expression of one target, Ankle2, partially rescues T cell maturation. These findings identify Zfp335 as a transcription factor and essential regulator of late-stage intrathymic and post-thymic T cell maturation.
    eLife Sciences 10/2014; 3. DOI:10.7554/eLife.03549 · 8.52 Impact Factor

Full-text (2 Sources)

Available from
Jun 6, 2014