[Show abstract][Hide abstract] ABSTRACT: Mutations create variation in the population, fuel evolution and cause genetic diseases. Current knowledge about de novo mutations is incomplete and mostly indirect. Here we analyze 11,020 de novo mutations from the whole genomes of 250 families. We show that de novo mutations in the offspring of older fathers are not only more numerous but also occur more frequently in early-replicating, genic regions. Functional regions exhibit higher mutation rates due to CpG dinucleotides and show signatures of transcription-coupled repair, whereas mutation clusters with a unique signature point to a new mutational mechanism. Mutation and recombination rates independently associate with nucleotide diversity, and regional variation in human-chimpanzee divergence is only partly explained by heterogeneity in mutation rate. Finally, we provide a genome-wide mutation rate map for medical and population genetics applications. Our results provide new insights and refine long-standing hypotheses about human mutagenesis.
[Show abstract][Hide abstract] ABSTRACT: Using a genome-wide screen of 9.6 million genetic variants achieved through 1000 Genomes Project imputation in 62,166 samples, we identify association to lipid traits in 93 loci, including 79 previously identified loci with new lead SNPs and 10 new loci, 15 loci with a low-frequency lead SNP and 10 loci with a missense lead SNP, and 2 loci with an accumulation of rare variants. In six loci, SNPs with established function in lipid genetics (CELSR2, GCKR, LIPC and APOE) or candidate missense mutations with predicted damaging function (CD300LG and TM6SF2) explained the locus associations. The low-frequency variants increased the proportion of variance explained, particularly for low-density lipoprotein cholesterol and total cholesterol. Altogether, our results highlight the impact of low-frequency variants in complex traits and show that imputation offers a cost-effective alternative to resequencing.
[Show abstract][Hide abstract] ABSTRACT: Small insertions and deletions (indels) and large structural variations (SVs) are major contributors to human genetic diversity and disease. However, mutation rates and characteristics of de novo indels and SVs in the general population have remained largely unexplored. We report 332 validated de novo structural changes identified in whole genomes of 250 families, including complex indels, retrotransposon insertions and interchromosomal events. These data indicate a mutation rate of 2.94 indels (1-20bp) and 0.16 SVs (>20bp) per generation. De novo structural changes affect on average 4.1kbp of genomic sequence and 29 coding bases per generation, which is 91 and 52 times more nucleotides than de novo substitutions, respectively. This contrasts with the equal genomic footprint of inherited SVs and substitutions. An excess of structural changes originated on paternal haplotypes. Additionally, we observed a non-uniform distribution of de novo SVs across offspring. These results reveal the importance of different mutational mechanisms to changes in human genome structure across generations.
Published by Cold Spring Harbor Laboratory Press.
Genome Research 04/2015; DOI:10.1101/gr.185041.114 · 13.85 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Wine is the most popular alcoholic beverage around the world and because of its importance in society has been widely studied. Understanding what drives its flavor has been a quest for decades but much is still unknown and will be determined at least in part by individual taste preferences. Recently studies in the genetics of taste have uncovered the role of different genes in the determination of food preferences giving new insight on its physiology. In this context we have performed a genome-wide association study on red and white wine liking using three isolated populations collected in Italy, and replicated our results on two additional populations coming from the Netherland and Central Asia for a total of 3885 samples. We have found a significant association (P=2.1 × 10(-8)) between white wine liking and rs9276975:C>T a polymorphism in the HLA-DOA gene encoding a non-canonical MHC II molecule, which regulates other MHC II molecules. The same association was also found with red wine liking (P=8.3 × 10(-6)). Sex-separated analysis have also revealed that the effect of HLA-DOA is twice as large in women as compared to men suggesting an interaction between this polymorphism and gender. Our results are one of the first examples of genome-wide association between liking of a commonly consumed food and gene variants. Moreover, our results suggest a role of the MHC system in the determination of food preferences opening new insight in this field in general.European Journal of Human Genetics advance online publication, 11 March 2015; doi:10.1038/ejhg.2015.34.
European journal of human genetics: EJHG 03/2015; DOI:10.1038/ejhg.2015.34 · 4.23 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value = 1.27×10-32), PRODH with proline (P-value = 1.11×10-19), SLC16A9 with carnitine level (P-value = 4.81×10-14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value = 1.65×10-19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value = 1.26×10-8), KCNJ16 with 3-hydroxybutyrate (P-value = 1.65×10-8) and 2p12 locus with valine (P-value = 3.49×10-8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional insight into the genetics of complex traits.
[Show abstract][Hide abstract] ABSTRACT: To identify genetic variants associated with refractive astigmatism in the general population, meta-analyses of genome-wide association studies were performed for: White Europeans aged at least 25 years (20 cohorts, N = 31,968); Asian subjects aged at least 25 years (7 cohorts, N = 9,295); White Europeans aged <25 years (4 cohorts, N = 5,640); and all independent individuals from the above three samples combined with a sample of Chinese subjects aged <25 years (N = 45,931). Participants were classified as cases with refractive astigmatism if the average cylinder power in their two eyes was at least 1.00 diopter and as controls otherwise. Genome-wide association analysis was carried out for each cohort separately using logistic regression. Meta-analysis was conducted using a fixed effects model. In the older European group the most strongly associated marker was downstream of the neurexin-1 (NRXN1) gene (rs1401327, P = 3.92E-8). No other region reached genome-wide significance, and association signals were lower for the younger European group and Asian group. In the meta-analysis of all cohorts, no marker reached genome-wide significance: The most strongly associated regions were, NRXN1 (rs1401327, P = 2.93E-07), TOX (rs7823467, P = 3.47E-07) and LINC00340 (rs12212674, P = 1.49E-06). For 34 markers identified in prior GWAS for spherical equivalent refractive error, the beta coefficients for genotype versus spherical equivalent, and genotype versus refractive astigmatism, were highly correlated (r = -0.59, P = 2.10E-04). This work revealed no consistent or strong genetic signals for refractive astigmatism; however, the TOX gene region previously identified in GWAS for spherical equivalent refractive error was the second most strongly associated region. Analysis of additional markers provided evidence supporting widespread genetic co-susceptibility for spherical and astigmatic refractive errors.
[Show abstract][Hide abstract] ABSTRACT: Genome-wide association studies (GWAS) have revealed 74 single nucleotide polymorphisms (SNPs) associated with high-density lipoprotein cholesterol (HDL) blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS) to identify SNP×SNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS) cohort I (RS-I) using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs) to perform linear regression on all genome-wide pairs of SNPs. By performing a meta-analysis together with Rotterdam Study cohorts II and III (RS-II and RS-III), we were able to filter 181 interaction terms with a p-value<1 · 10-8 that replicated in the two independent cohorts. We were not able to replicate any of these interaction term in the AGES, ARIC, CHS, ERF, FHS and NFBC-66 cohorts (Ntotal = 30,011) when adjusting for multiple testing. Our GWIS resulted in the consistent finding of a possible interaction between rs774801 in ARMC8 (ENSG00000114098) and rs12442098 in SPATA8 (ENSG00000185594) being associated with HDL levels. However, p-values do not reach the preset Bonferroni correction of the p-values. Our study suggest that even for highly genetically determined traits such as HDL the sample sizes needed to detect SNP×SNP interactions are large and the 2-step filtering approaches do not yield a solution. Here we present our analysis plan and our reservations concerning GWIS.
PLoS ONE 10/2014; 9(10):e109290. DOI:10.1371/journal.pone.0109290 · 3.53 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Refractive error (RE) is a complex, multifactorial disorder characterized by a mismatch between the optical power of the eye and its axial length that causes object images to be focused off the retina. The two major subtypes of RE are myopia (nearsightedness) and hyperopia (farsightedness), which represent opposite ends of the distribution of the quantitative measure of spherical refraction. We performed a fixed effects meta-analysis of genome-wide association results of myopia and hyperopia from 9 studies of European-derived populations: AREDS, KORA, FES, OGP-Talana, MESA, RSI, RSII, RSIII and ERF. One genome-wide significant region was observed for myopia, corresponding to a previously identified myopia locus on 8q12 (p = 1.25610 28), which has been reported by Kiefer et al. as significantly associated with myopia age at onset and Verhoeven et al. as significantly associated to mean spherical-equivalent (MSE) refractive error. We observed two genome-wide significant associations with hyperopia. These regions overlapped with loci on 15q14 (minimum p value = 9.11610 211) and 8q12 (minimum p value 1.82610 211) previously reported for MSE and myopia age at onset. We also used an intermarker linkage-disequilibrium-based method for calculating the effective number of tests in targeted regional replication analyses. We analyzed myopia (which represents the closest phenotype in our data to the one used by Kiefer et al.) and showed replication of 10 additional loci associated with myopia previously reported by Kiefer et al. This is the first replication of these loci using myopia as the trait under analysis. ''Replication-level'' association was also seen between hyperopia and 12 of Kiefer et al.'s published loci. For the loci that show evidence of association to both myopia and hyperopia, the estimated effect of the risk alleles were in opposite directions for the two traits. This suggests that these loci are important contributors to variation of refractive error across the distribution.
PLoS ONE 09/2014; 9(9). DOI:10.1371/journal.pone.0107110 · 3.53 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Elevated intraocular pressure (IOP) is an important risk factor in developing glaucoma, and variability in IOP might herald glaucomatous development or progression. We report the results of a genome-wide association study meta-analysis of 18 population cohorts from the International Glaucoma Genetics Consortium (IGGC), comprising 35,296 multi-ancestry participants for IOP. We confirm genetic association of known loci for IOP and primary open-angle glaucoma (POAG) and identify four new IOP-associated loci located on chromosome 3q25.31 within the FNDC3B gene (P = 4.19 × 10(-8) for rs6445055), two on chromosome 9 (P = 2.80 × 10(-11) for rs2472493 near ABCA1 and P = 6.39 × 10(-11) for rs8176693 within ABO) and one on chromosome 11p11.2 (best P = 1.04 × 10(-11) for rs747782). Separate meta-analyses of 4 independent POAG cohorts, totaling 4,284 cases and 95,560 controls, showed that 3 of these loci for IOP were also associated with POAG.
[Show abstract][Hide abstract] ABSTRACT: Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring families and constructed a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions. The intermediate coverage (~13×) and trio design enabled extensive characterization of structural variation, including midsize events (30–500 bp) previously poorly catalogued and de novo mutations. We demonstrate that the quality of the haplotypes boosts imputation accuracy in independent samples, especially for lower frequency alleles. Population genetic analyses demonstrate fine-scale structure across the country and support multiple ancient migrations, consistent with historical changes in sea level and flooding. The GoNL Project illustrates how single-population whole-genome sequencing can provide detailed characterization of genetic variation and may guide the design of future population studies.
[Show abstract][Hide abstract] ABSTRACT: Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with 'true' genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed significant improvement in the imputation quality for rare variants (MAF 0.05–0.5%) compared with 1000G. In Dutch samples, the mean observed Pearson correlation, r 2 , increased from 0.61 to 0.71. We also saw improved imputation accuracy for other European populations (in the British samples, r 2 improved from 0.58 to 0.65, and in the Italians from 0.43 to 0.47). A combined reference set comprising 1000G and GoNL improved the imputation of rare variants even further. The Italian samples benefitted the most from this combined reference (the mean r 2 increased from 0.47 to 0.50). We conclude that the creation of a large population-specific reference is advantageous for imputing rare variants and that a combined reference panel across multiple populations yields the best imputation results. European Journal of Human Genetics advance online publication, 4 June 2014; doi:10.1038/ejhg.2014.19
European journal of human genetics: EJHG 06/2014; DOI:10.1038/ejhg.2014.19 · 4.23 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Coffee, one of the most popular beverages in the world, contains many different physiologically active compounds with a potential impact on people’s health. Despite the recent attention given to the genetic basis of its consumption, very little has been done in understanding genes influencing coffee preference among different individuals. Given its markedly bitter taste, we decided to verify if bitter receptor genes (TAS2Rs) variants affect coffee liking. In this light, 4066 people from different parts of Europe and Central Asia filled in a field questionnaire on coffee liking. They have been consequently recruited and included in the study. Eighty-eight SNPs covering the 25 TAS2R genes were selected from the available imputed ones and used to run association analysis for coffee liking. A significant association was detected with three SNP: one synonymous and two functional variants (W35S and H212R) on the TAS2R43 gene. Both variants have been shown to greatly reduce in vitro protein activity. Surprisingly the wild type allele, which corresponds to the functional form of the protein, is associated to higher liking of coffee. Since the hTAS2R43 receptor is sensible to caffeine, we verified if the detected variants produced differences in caffeine bitter perception on a subsample of people coming from the FVG cohort. We found a significant association between differences in caffeine perception and the H212R variant but not with the W35S, which suggests that the effect of the TAS2R43 gene on coffee liking is mediated by caffeine and in particular by the H212R variant.
No other significant association was found with other TAS2R genes. In conclusion, the present study opens new perspectives in the understanding of coffee liking. Further studies are needed to clarify the role of the TAS2R43 gene in coffee hedonics and to identify which other genes and pathways are involved in its genetics.
PLoS ONE 03/2014; 9(3):e92065. DOI:10.1371/journal.pone.0092065 · 3.53 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Glaucoma is characterized by irreversible optic nerve degeneration and is the most frequent cause of irreversible blindness worldwide. Here, the International Glaucoma Genetics Consortium conducts a meta-analysis of genome-wide association studies of vertical cup-disc ratio (VCDR), an important disease-related optic nerve parameter. In 21,094 individuals of European ancestry and 6,784 individuals of Asian ancestry, we identify 10 new loci associated with variation in VCDR. In a separate risk-score analysis of five case-control studies, Caucasians in the highest quintile have a 2.5-fold increased risk of primary open-angle glaucoma as compared with those in the lowest quintile. This study has more than doubled the known loci associated with optic disc cupping and will allow greater understanding of mechanisms involved in this common blinding condition.
[Show abstract][Hide abstract] ABSTRACT: The heritability of borderline personality (BP) features has been established in multiple twin and family studies. Using data from the borderline subscale of the Personality Assessment Inventory Borderline Features Scale (PAI-BOR) collected in two Dutch cohorts (N=7125), the Netherlands Twin Register and The Netherlands Study of Depression and Anxiety, we show that heritability of the PAI-BOR total score using genome-wide single-nucleotide polymorphism (SNPs) is estimated at 23%, and that the genetic variance is substantially higher in affect instability items compared with the other three subscales of the PAI-BOR (42.7% vs non-significant estimates for self-harm, negative relations and identity problems). We present results from a first genome-wide association study of BP features, which shows a promising signal on chromosome 5 corresponding to SERINC5, a protein involved in myelination. Reduced myelination has been suggested as possibly having a role in the development of psychiatric disorders characterized by lack of social interaction. The signal was confirmed in a third independent Dutch cohort drawn from the Erasmus Rucphen Family study (N=1301). Our analyses were complemented by investigating the heterogeneity that was implied by the differences in genetic variance components in the four subscales of the PAI-BOR. These analyses show that the association of SNPs tagging SERINC5 differs substantially across the 24 items of the PAI-BOR. Further, using reverse regression we showed that the effects were present only in subjects with higher scores on the PAI-BOR. Taken together, these results suggest that future genome-wide analyses can benefit substantially by taking into account the phenotypic and genetic heterogeneity of BP features.Molecular Psychiatry advance online publication, 27 August 2013; doi:10.1038/mp.2013.109.
[Show abstract][Hide abstract] ABSTRACT: Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent-offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910-1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14-15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project.European Journal of Human Genetics advance online publication, 29 May 2013; doi:10.1038/ejhg.2013.118.
European journal of human genetics: EJHG 05/2013; 22(2). DOI:10.1038/ejhg.2013.118 · 4.23 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Interindividual variation in mean leukocyte telomere length (LTL) is associated with cancer and several age-associated diseases. We report here a genome-wide meta-analysis of 37,684 individuals with replication of selected variants in an additional 10,739 individuals. We identified seven loci, including five new loci, associated with mean LTL (P < 5 × 10(-8)). Five of the loci contain candidate genes (TERC, TERT, NAF1, OBFC1 and RTEL1) that are known to be involved in telomere biology. Lead SNPs at two loci (TERC and TERT) associate with several cancers and other diseases, including idiopathic pulmonary fibrosis. Moreover, a genetic risk score analysis combining lead variants at all 7 loci in 22,233 coronary artery disease cases and 64,762 controls showed an association of the alleles associated with shorter LTL with increased risk of coronary artery disease (21% (95% confidence interval, 5-35%) per standard deviation in LTL, P = 0.014). Our findings support a causal role of telomere-length variation in some age-related diseases.
[Show abstract][Hide abstract] ABSTRACT: Visual refractive errors are complex genetic traits with a largely unknown etiology. To date, genome-wide association studies (GWAS) of moderate size have identified several novel risk markers for refractive error, measured here as mean spherical equivalent. We performed a GWAS using a total of 7,280 samples from 5 cohorts: the Age-Related Eye Disease Study (AREDS); the KORA study ("Cooperative Health Research in the Region of Augsburg"); the Framingham Eye Study (FES); the Ogliastra Genetic Park-Talana (OGP-Talana) Study; and the Multiethnic Study of Atherosclerosis (MESA). Genotyping was performed on Illumina and Affymetrix platforms with additional markers imputed to the HapMap II reference panel. We identified a new genome-wide significant locus on chromosome 16 (rs10500355, p=3.9 x 10-9) in a combined discovery and replication set (26,953 samples). This SNP is located within the RBFOX1 gene which is a neuron-specific splicing factor regulating a wide range of alternative splicing events implicated in neuronal development and maturation, including transcription factors, other splicing factors and synaptic proteins.
Human Molecular Genetics 03/2013; DOI:10.1093/hmg/ddt116 · 6.68 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Refractive error is the most common eye disorder worldwide and is a prominent cause of blindness. Myopia affects over 30% of Western populations and up to 80% of Asians. The CREAM consortium conducted genome-wide meta-analyses, including 37,382 individuals from 27 studies of European ancestry and 8,376 from 5 Asian cohorts. We identified 16 new loci for refractive error in individuals of European ancestry, of which 8 were shared with Asians. Combined analysis identified 8 additional associated loci. The new loci include candidate genes with functions in neurotransmission (GRIA4), ion transport (KCNQ5), retinoic acid metabolism (RDH5), extracellular matrix remodeling (LAMA2 and BMP2) and eye development (SIX6 and PRSS56). We also confirmed previously reported associations with GJD2 and RASGRF1. Risk score analysis using associated SNPs showed a tenfold increased risk of myopia for individuals carrying the highest genetic load. Our results, based on a large meta-analysis across independent multiancestry studies, considerably advance understanding of the mechanisms involved in refractive error and myopia.
[Show abstract][Hide abstract] ABSTRACT: Background
The 9p21.3 locus is strongly associated with the risk of coronary artery disease (CAD) and with type 2 diabetes (T2D). We investigated the association of 9p21.3 variants with severity of CAD (defined by the number of vessel diseased [VD]) in the presence and absence of T2D.
We tested 11 9p21.3-variants for association in a white Italian study (N = 2,908), and carried out replication in 2 independent white populations, a German study (N = 2,028) and a Canadian Study (N=950). SNP association and permutation analyses were conducted.
We identified two 9p21.3-variants, rs4977574 (P < 4×10-4) and rs2383207 (P < 1.5×10-3) that were associated with severity of CAD in subjects without T2D. Association of rs4977574 with severity of CAD was confirmed in the Canadian Study. Results from subgroup analysis among patients with T2D showed an interaction between rs10738610 and T2D with P = 4.82×10-2. Further investigation showed that rs10738610 (P < 1.99×10-2) was found to be significantly associated with severity of CAD in subjects with T2D.
The 9p21.3 locus is significantly associated with severity of CAD. The number of associations of 9p21.3 variants with severity of CAD is variable to the presence and absence of T2D. In a CAD-susceptible region of 115 kb, there is only one variant associated with the severity of coronary vessel disease in the presence of type 2 diabetes.
BMC Medical Genetics 01/2013; 14(1):11. DOI:10.1186/1471-2350-14-11 · 2.45 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: In this study, Prokopenko and colleagues provide novel evidence for causal relationship between adiposity and heart failure and increased liver enzymes using a Mendelian randomization study design.
Please see later in the article for the Editors' Summary