
Jeremy F McRae- PhD
- Bioinformatics Scientist at Illumina
Jeremy F McRae
- PhD
- Bioinformatics Scientist at Illumina
About
83
Publications
26,344
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
9,819
Citations
Introduction
Current institution
Publications
Publications (83)
Polygenic scores (PGSs), increasingly used in clinical settings, frequently include many genetic variants, with performance typically peaking at thousands of variants. Such highly parameterized PGSs often include variants that do not pass a genome-wide significance threshold. We propose a mathematical perspective that renders the effects of many of...
Accurately predicting the impact of genetic variants is essential for interpreting genomic data, yet no consensus exists on how to measure classifier performance. We prepared the most comprehensive set of benchmarks to date and applied them to the recently published models PrimateAI-3D and AlphaMissense. PrimateAI-3D outperforms AlphaMissense on ra...
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 mill...
We examined 454,712 exomes for genes associated with a wide spectrum of complex traits and common diseases and observed that rare, penetrant mutations in genes implicated by genome-wide association studies confer ~10-fold larger effects than common variants in the same genes. Consequently, an individual at the phenotypic extreme and at the greatest...
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole genome sequencing data for 809 individuals from 233 primate species, and identified 4.3 mil...
We examined 454,712 exomes for genes associated with a wide spectrum of complex traits and common diseases and observed that rare, penetrant mutations in genes implicated by genome-wide association studies confer ∼10-fold larger effects than common variants in the same genes. Consequently, an individual at the phenotypic extreme and at the greatest...
Mutations in the germline generates all evolutionary genetic variation and is a cause of genetic disease. Parental age is the primary determinant of the number of new germline mutations in an individual’s genome1,2. Here we analysed the genome-wide sequences of 21,879 families with rare genetic diseases and identified 12 individuals with a hypermut...
Mutation in the germline is the source of all evolutionary genetic variation and a cause of genetic disease. Previous studies have shown parental age to be the primary determinant of the number of new germline mutations seen in an individual's genome. Here we analysed the genome-wide sequences of 21,879 families with rare genetic diseases and ident...
Over 130 X-linked genes have been robustly associated with developmental disorders, and X-linked causes have been hypothesised to underlie the higher developmental disorder rates in males. Here, we evaluate the burden of X-linked coding variation in 11,044 developmental disorder patients, and find a similar rate of X-linked causes in males and fema...
De novo mutations in protein-coding genes are a well-established cause of developmental disorders. However, genes known to be associated with developmental disorders account for only a minority of the observed excess of such de novo mutations. Here, to identify previously undescribed genes associated with developmental disorders, we integrate healt...
Over 130 X-linked genes have been robustly associated with developmental disorders (DDs), and X-linked causes have been hypothesised to underlie the higher DD rates in males. We evaluated the burden of X-linked coding variation in 11,046 DD patients, and found a similar rate of X-linked causes in males and females (6.0% and 6.9%, respectively), ind...
De novo mutations (DNMs) in protein-coding genes are a well-established cause of developmental disorders (DD). However, known DD-associated genes only account for a minority of the observed excess of such DNMs. To identify novel DD-associated genes, we integrated healthcare and research exome sequences on 31,058 DD parent-offspring trios, and devel...
Trio-based whole-exome sequence (WES) data have established confident genetic diagnoses in ∼40% of previously undiagnosed individuals recruited to the Deciphering Developmental Disorders (DDD) study. Here we aim to use the breadth of phenotypic information recorded in DDD to augment diagnosis and disease variant discovery in probands. Median Euclid...
De novo mutations (DNMs) in protein-coding genes are a well-established cause of developmental disorders (DD). However, known DD-associated genes only account for a minority of the observed excess of such DNMs. To identify novel DD-associated genes, we integrated healthcare and research exome sequences on 31,058 DD parent-offspring trios, and devel...
Trio-based whole-exome sequence (WES) data have established confident genetic diagnoses in ∼40% of previously undiagnosed individuals recruited to the Deciphering Developmental Disorders (DDD) study. Here we aim to use the breadth of phenotypic information recorded in DDD to augment diagnosis and disease variant discovery in probands. Median Euclid...
Mosaic genetic variants can have major clinical impact. We systematically analyse trio exome sequence data from 4,293 probands from the DDD Study with severe developmental disorders for pathogenic postzygotic mosaicism (PZM) in the child or a clinically-unaffected parent, and use ultrahigh-depth sequencing to validate candidate mosaic variants. We...
Approximately 2% of de novo single-nucleotide variants (SNVs) appear as part of clustered mutations that create multinucleotide variants (MNVs). MNVs are an important source of genomic variability as they are more likely to alter an encoded protein than a SNV, which has important implications in disease as well as evolution. Previous studies of MNV...
We delineate a KMT2E-related neurodevelopmental disorder on the basis of 38 individuals in 36 families. This study includes 31 distinct heterozygous variants in KMT2E (28 ascertained from Matchmaker Exchange and three previously reported), and four individuals with chromosome 7q22.2-22.23 microdeletions encompassing KMT2E (one previously reported)....
The occurrence of non-epileptic hyperkinetic movements in the context of developmental epileptic encephalopathies is an increasingly recognized phenomenon. Identification of causative mutations provides an important insight into common pathogenic mechanisms that cause both seizures and abnormal motor control. We report bi-allelic loss-of-function C...
In the version of this article originally published, the name of author Serafim Batzoglou was misspelled. The error has been corrected in the HTML and PDF versions of the article.
The splicing of pre-mRNAs into mature transcripts is remarkable for its precision, but the mechanisms by which the cellular machinery achieves such specificity are incompletely understood. Here, we describe a deep neural network that accurately predicts splice junctions from an arbitrary pre-mRNA transcript sequence, enabling precise prediction of...
Mutations which perturb normal pre-mRNA splicing are significant contributors to human disease. We used exome sequencing data from 7,833 probands with developmental disorders (DD) and their unaffected parents, as well as >60,000 aggregated exomes from the Exome Aggregation Consortium, to investigate selection around the splice site, and quantify th...
Genetic architecture of developmental disorders
The genetics of developmental disorders (DDs) is complex. Martin et al. wanted to determine the degree of recessive inheritance of DDs in protein-coding genes. They examined the exomes of more than 6000 families in populations with high and low proportions of consanguineous marriages. They found that...
There are thousands of rare human disorders that are caused by single deleterious, protein-coding genetic variants1. However, patients with the same genetic defect can have different clinical presentations2-4, and some individuals who carry known disease-causing variants can appear unaffected5. Here, to understand what explains these differences, w...
Mutations which perturb normal pre-mRNA splicing are significant contributors to human disease. We used exome sequencing data from 7,833 probands with developmental disorders (DD) and their unaffected parents, as well as >60,000 aggregated exomes from the Exome Aggregation Consortium, to investigate selection around the splice site, and quantify th...
Millions of human genomes and exomes have been sequenced, but their clinical applications remain limited due to the difficulty of distinguishing disease-causing mutations from benign genetic variation. Here we demonstrate that common missense variants in other primate species are largely clinically benign in human, enabling pathogenic mutations to...
There are thousands of rare human disorders caused by a single deleterious, protein-coding genetic variant. However, patients with the same genetic defect can have different clinical presentation, and some individuals carrying known disease-causing variants can appear unaffected. What explains these differences? Here, we show in a cohort of 6,987 c...
We previously estimated that 42% of patients with severe developmental disorders carry pathogenic de novo mutations in coding sequences. The role of de novo mutations in regulatory elements affecting genes associated with developmental disorders, or other genes, has been essentially unexplored. We identified de novo mutations in three classes of pu...
Purpose
Given the rapid pace of discovery in rare disease genomics, it is likely that improvements in diagnostic yield can be made by systematically reanalyzing previously generated genomic sequence data in light of new knowledge.
Methods
We tested this hypothesis in the United Kingdom–wide Deciphering Developmental Disorders study, where in 2014...
Histone lysine methyltransferases (KMTs) and demethylases (KDMs) underpin gene regulation. Here we demonstrate that variants causing haploinsufficiency of KMTs and KDMs are frequently encountered in individuals with developmental disorders. Using a combination of human variation databases and existing animal models, we determine 22 KMTs and KDMs as...
Large exome-sequencing datasets offer an unprecedented opportunity to understand the genetic architecture of rare diseases, informing clinical genetics counseling and optimal study designs for disease gene identification. We analyzed 7,448 exome-sequenced families from the Deciphering Developmental Disorders study, and, for the first time, estimate...
De novo mutations in hundreds of different genes collectively cause 25-42% of severe developmental disorders (DD). The cause in the remaining cases is largely unknown. The role of de novo mutations in regulatory elements affecting known DD-associated genes or other genes is essentially unexplored. We identified de novo mutations in three classes of...
Purpose:
To characterize features associated with de novo mutations affecting SATB2 function in individuals ascertained on the basis of intellectual disability.
Methods:
Twenty previously unreported individuals with 19 different SATB2 mutations (11 loss-of-function and 8 missense variants) were studied. Fibroblasts were used to measure mutant pr...
Purpose: To characterize features associated with de novo mutations affecting SATB2 function in individuals ascertained on the basis of intellectual disability. Methods: Twenty previously unreported individuals with 19 different SATB2 mutations (11 loss-of-function and 8 missense variants) were studied. Fibroblasts were used to measure mutant prote...
The genomes of individuals with severe, undiagnosed developmental disorders are enriched in damaging de novo mutations (DNMs) in developmentally important genes. Here we have sequenced the exomes of 4,293 families containing individuals with developmental disorders, and meta-analysed these data with data from another 3,287 individuals with similar...
The ubiquitin fold modifier 1 (UFM1) cascade is a recently identified evolutionarily conserved ubiquitin-like modification system whose function and link to human disease have remained largely uncharacterized. By using exome sequencing in Finnish individuals with severe epileptic syndromes, we identified pathogenic compound heterozygous variants in...
Congenital heart defects (CHDs) have a neonatal incidence of
0.8–1% (refs. 1,2). Despite abundant examples of monogenic CHD in humans and mice, CHD has a low absolute sibling recurrence risk (~2.7%)3, suggesting a considerable role for de novo mutations (DNMs) and/or incomplete penetrance4,5. De novo protein-truncating variants (PTVs) have been sho...
Intellectual disability (ID) is a common condition with considerable genetic heterogeneity. Next-generation sequencing of large cohorts has identified an increasing number of genes implicated in ID, but their roles in neurodevelopment remain largely unexplored. Here we report an ID syndrome caused by de novo heterozygous missense, nonsense, and fra...
Individuals with severe, undiagnosed developmental disorders (DDs) are enriched for damaging de novo mutations (DNMs) in developmentally important genes. We exome sequenced 4,293 families with individuals with DDs, and meta-analysed these data with published data on 3,287 individuals with similar disorders. We show that the most significant factors...
Individuals with severe, undiagnosed developmental disorders (DDs) are enriched for damaging de novo mutations (DNMs) in developmentally important genes. We exome sequenced 4,293 families with individuals with DDs, and meta-analysed these data with published data on 3,287 individuals with similar disorders. We show that the most significant factors...
By analyzing the whole-exome sequences of 4,264 schizophrenia cases, 9,343 controls and 1,077 trios, we identified a genome-wide significant association between rare loss-of-function (LoF) variants in SETD1A and risk for schizophrenia (P = 3.3 × 10(-9)). We found only two heterozygous LoF variants in 45,376 exomes from individuals without a neurops...
Schizophrenia is a common, debilitating psychiatric disorder with a substantial genetic component. By analysing the whole-exome sequences of 4,264 schizophrenia cases, 9,343 controls, and 1,077 parent-proband trios, we identified a genome-wide significant association between rare loss-of-function (LoF) variants in KMT2F and risk for schizophrenia....
Discovery of most autosomal recessive disease-associated genes has involved analysis of large, often consanguineous multiplex families or small cohorts of unrelated individuals with a well-defined clinical condition. Discovery of new dominant causes of rare, genetically heterogeneous developmental disorders has been revolutionized by exome analysis...
Despite three decades of successful, predominantly phenotype-driven discovery of the genetic causes of monogenic disorders, up to half of children with severe developmental disorders of probable genetic origin remain without a genetic diagnosis. Particularly challenging are those disorders rare enough to have eluded recognition as a discrete clinic...
Background:
Human genome sequencing has transformed our understanding of genomic variation and its relevance to health and disease, and is now starting to enter clinical practice for the diagnosis of rare diseases. The question of whether and how some categories of genomic findings should be shared with individual research participants is currently...
Humans vary in acuity to many odors [1-4], with variation within olfactory receptor (OR) genes contributing to these differences [5-9]. How such variation also affects odor experience and food selection remains uncertain [10], given that such effects occur for taste [11-15]. Here we investigate β-ionone, which shows extreme sensitivity differences...
Humans vary in their ability to smell numerous odors [1-3], including those associated with food [4-6]. Odor sensitivity is heritable [7-11], with examples linking genetic variation for sensitivity to specific odors typically located near olfactory receptor (OR) genes [12-16]. However, with thousands of aromas and few deorphaned ORs [17, 18], there...
The ability to detect many odors varies among individuals; however, the contribution of genotype to this variation has been assessed for relatively few compounds. We have identified a genetic basis for the ability to detect the flavor compound cis-3-hexen-1-ol. This compound is typically described as "green grassy" or the smell of "cut grass," with...
The human genetic revolution is upon us, and while at this stage, it is restricted to providing fundamental insights on human traits, the notion underpinning the present chapter is that an understanding of human genetic variability, in the not too distant future, will be turned into insights that can systematically inform innovation of desirable pr...
Odour perception is controlled by environmental and genetic influences. Most people can discriminate over 10,000 different odours, but the molecular basis of this ability is poorly understood. Little is known about which odorant receptors (ORs) detect what odour compounds, and additional research is required to gain knowledge of why detection thres...