[Show abstract][Hide abstract] ABSTRACT: Using variants from the 1000 Genomes Project pilot European CEU dataset and data from additional resequencing studies, we densely genotyped 183 non-HLA risk loci previously associated with immune-mediated diseases in 12,041 individuals with celiac disease (cases) and 12,228 controls. We identified 13 new celiac disease risk loci reaching genome-wide significance, bringing the number of known loci (including the HLA locus) to 40. We found multiple independent association signals at over one-third of these loci, a finding that is attributable to a combination of common, low-frequency and rare genetic variants. Compared to previously available data such as those from HapMap3, our dense genotyping in a large sample collection provided a higher resolution of the pattern of linkage disequilibrium and suggested localization of many signals to finer scale regions. In particular, 29 of the 54 fine-mapped signals seemed to be localized to single genes and, in some instances, to gene regulatory elements. Altogether, we define the complex genetic architecture of the risk regions of and refine the risk signals for celiac disease, providing the next step toward uncovering the causal mechanisms of the disease.
[Show abstract][Hide abstract] ABSTRACT: In invertebrates that harbor functional DNA methylation enzymatic machinery, gene-bodies are the primary targets for CpG methylation. However, virtually all other aspects of invertebrate DNA methylation have remained a mystery until now. Here, using a comparative methylomics approach, we demonstrate that Nematostella vectensis, Ciona intestinalis, Apis mellifera, and Bombyx mori show two distinct populations of genes differentiated by gene-body CpG density. Genome-scale DNA methylation profiles for A. mellifera spermatozoa reveal CpG-poor genes are methylated in the germline, as predicted by the depletion of CpGs. We find an evolutionarily conserved distinction between CpG-poor and GpC-rich genes: The former are associated with basic biological processes, the latter with more specialized functions. This distinction is strikingly similar to that recently observed between euchromatin-associated genes in Drosophila that contain intragenic histone 3 lysine 36 trimethylation (H3K36me3) and those that do not, even though Drosophila does not display CpG density bimodality or methylation. We confirm that a significant number of CpG-poor genes in N. vectensis, C. intestinalis, A. mellifera, and B. mori are orthologs of H3K36me3-rich genes in Drosophila. We propose that over evolutionary time, gene-body H3K36me3 has influenced gene-body DNA methylation levels and, consequently, the gene-body CpG density bimodality characteristic of invertebrates that harbor CpG methylation.
Genome Research 09/2011; 21(11):1841-50. · 14.40 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: The prognosis of acute severe ulcerative colitis (ASC) influences therapeutic decisions, but data on prevalence or long-term outcome are few.
A systematic review of all patients with UC diagnosed in Oxford was performed to assess the prevalence of ASC defined by Truelove and Witts' (TW) criteria and determine whether outcome is related to disease activity on admission, likelihood of recurrence and long-term prognosis.
750 patients (median follow up 12.7 yr, range 0-648 mo) met inclusion criteria out of a total cohort of 1853 patients. 24.8% (186/750) had at least one admission for ASC (294 admissions in 186 patients). Overall, 12% (93/750) had a colectomy, compared to 39.8% (74/186) of patients with one or more episodes of ASC (p<0.0001) and 3.4% (19/564) in those with no admission. The colectomy rate on first admission (37/186, 19.9%) was lower than on the second or subsequent admissions (OR 2.35, 95% CI 1.33-4.14, p=0.003), being 29.0%, 36.6%, 38.2% after two, three, or subsequent episodes respectively. It was 8.5% (11/129) if patients had one TW criterion in addition to ≥6 bloody bowel motions/day, compared to 31% (29/94) if two additional criteria were present and 48% (34/71) if three or more additional criteria were present (p=1.4 × 10⁻⁵; OR 4.35, 95% CI 2.20-8.56 one criterion vs two or more).
A quarter of all patients with ulcerative colitis experience at least one episode of ASC; 20% come to colectomy on first admission, but 40% after two admissions. The likelihood of colectomy is related to biological severity on admission.
Journal of Crohn s and Colitis 10/2010; 4(4):431-7. · 3.39 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: We performed a second-generation genome-wide association study of 4,533 individuals with celiac disease (cases) and 10,750 control subjects. We genotyped 113 selected SNPs with P(GWAS) < 10(-4) and 18 SNPs from 14 known loci in a further 4,918 cases and 5,684 controls. Variants from 13 new regions reached genome-wide significance (P(combined) < 5 x 10(-8)); most contain genes with immune functions (BACH2, CCR4, CD80, CIITA-SOCS1-CLEC16A, ICOSLG and ZMIZ1), with ETS1, RUNX3, THEMIS and TNFRSF14 having key roles in thymic T-cell selection. There was evidence to suggest associations for a further 13 regions. In an expression quantitative trait meta-analysis of 1,469 whole blood samples, 20 of 38 (52.6%) tested loci had celiac risk variants correlated (P < 0.0028, FDR 5%) with cis gene expression.
[Show abstract][Hide abstract] ABSTRACT: Many disease-associated variants identified by genome-wide association (GWA) studies are expected to regulate gene expression. Allele-specific expression (ASE) quantifies transcription from both haplotypes using individuals heterozygous at tested SNPs. We performed deep human transcriptome-wide resequencing (RNA-seq) for ASE analysis and expression quantitative trait locus discovery. We resequenced double poly(A)-selected RNA from primary CD4(+) T cells (n = 4 individuals, both activated and untreated conditions) and developed tools for paired-end RNA-seq alignment and ASE analysis. We generated an average of 20 million uniquely mapping 45 base reads per sample. We obtained sufficient read depth to test 1371 unique transcripts for ASE. Multiple biases inflate the false discovery rate which we estimate to be approximately 50% for random SNPs. However, after controlling for these biases and considering the subset of SNPs that pass HapMap QC, 4.6% of heterozygous SNP-sample pairs show evidence of imbalance (P < 0.001). We validated four findings by both bacterial cloning and Sanger sequencing assays. We also found convincing evidence for allelic imbalance at multiple reporter exonic SNPs in CD6 for two samples heterozygous at the multiple sclerosis-associated variant rs17824933, linking GWA findings with variation in gene expression. Finally, we show in CD4(+) T cells from a further individual that high-throughput sequencing of genomic DNA and RNA-seq following enrichment for targeted gene sequences by sequence capture methods offers an unbiased means to increase the read depth for transcripts of interest, and therefore a method to investigate the regulatory role of many disease-associated genetic variants.
Human Molecular Genetics 10/2009; 19(1):122-34. · 7.69 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Chronic inflammatory diseases have been at the forefront of the new genome-wide association study era. Conditions such as coeliac disease, type 1 diabetes, Crohn's disease and ulcerative colitis have all benefited with multiple loci identified and replicated for each condition. As cohort sample numbers increase and researchers collaborate and share cohorts, common susceptibility loci are beginning to emerge between several diseases. Crohn's disease and coeliac disease both demonstrate considerable overlap in their common genetic susceptibility with other related conditions. These shared loci offer an insight into the biology of the conditions but still present researchers with the problem of attempting to identify the true causal variants.
Human Molecular Genetics 05/2009; 18(R1):R101-6. · 7.69 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Our previous coeliac disease genome-wide association study (GWAS) implicated risk variants in the human leucocyte antigen (HLA) region and eight novel risk regions. To identify more coeliac disease loci, we selected 458 single nucleotide polymorphisms (SNPs) that showed more modest association in the GWAS for genotyping and analysis in four independent cohorts.
458 SNPs were assayed in 1682 cases and 3258 controls from three populations (UK, Irish and Dutch). We combined the results with the original GWAS cohort (767 UK cases and 1422 controls); six SNPs showed association with p<1 x 10(-04) and were then genotyped in an independent Italian coeliac cohort (538 cases and 593 controls).
We identified two novel coeliac disease risk regions: 6q23.3 (OLIG3-TNFAIP3) and 2p16.1 (REL), both of which reached genome-wide significance in the combined analysis of all 2987 cases and 5273 controls (rs2327832 p = 1.3 x 10(-08), and rs842647 p = 5.2 x 10(-07)). We investigated the expression of these genes in the RNA isolated from biopsies and from whole blood RNA. We did not observe any changes in gene expression, nor in the correlation of genotype with gene expression.
Both TNFAIP3 (A20, at the protein level) and REL are key mediators in the nuclear factor kappa B (NF-kappaB) inflammatory signalling pathway. For the first time, a role for primary heritable variation in this important biological pathway predisposing to coeliac disease has been identified. Currently, the HLA risk factors and the 10 established non-HLA risk factors explain approximately 40% of the heritability of coeliac disease.
[Show abstract][Hide abstract] ABSTRACT: Genome wide association studies have been hugely successful in identifying disease risk variants, yet most variants do not lead to coding changes and how variants influence biological function is usually unknown.
We correlated gene expression and genetic variation in untouched primary leucocytes (n = 110) from individuals with celiac disease - a common condition with multiple risk variants identified. We compared our observations with an EBV-transformed HapMap B cell line dataset (n = 90), and performed a meta-analysis to increase power to detect non-tissue specific effects.
In celiac peripheral blood, 2,315 SNP variants influenced gene expression at 765 different transcripts (< 250 kb from SNP, at FDR = 0.05, cis expression quantitative trait loci, eQTLs). 135 of the detected SNP-probe effects (reflecting 51 unique probes) were also detected in a HapMap B cell line published dataset, all with effects in the same allelic direction. Overall gene expression differences within the two datasets predominantly explain the limited overlap in observed cis-eQTLs. Celiac associated risk variants from two regions, containing genes IL18RAP and CCR3, showed significant cis genotype-expression correlations in the peripheral blood but not in the B cell line datasets. We identified 14 genes where a SNP affected the expression of different probes within the same gene, but in opposite allelic directions. By incorporating genetic variation in co-expression analyses, functional relationships between genes can be more significantly detected.
In conclusion, the complex nature of genotypic effects in human populations makes the use of a relevant tissue, large datasets, and analysis of different exons essential to enable the identification of the function for many genetic risk variants in common diseases.
BMC Medical Genomics 01/2009; 2:1. · 3.47 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Coeliac disease is caused by dietary gluten, triggering a chronic inflammation of the small intestine in genetically predisposed individuals. Recently, a risk locus on chromosome 2q11-q12, harbouring interleukin 18 receptor accessory protein (IL18RAP) and three other genes, was suggested for coeliac disease. IL18 has been shown to play an important role in T helper type 1 activity in coeliac disease, making this locus a highly interesting candidate. In this study, two previously indicated risk variants at the IL18RAP locus (rs13015714 and rs917997) were tested for genetic association in 1638 cases with coeliac disease and 1385 control individuals from the Finnish, Hungarian and Italian populations. The protein expression level of IL18RAP was also compared between risk allele carriers and non-carriers by Western blotting. Furthermore, immunohistochemical analysis was performed to study IL18RAP protein expression in small intestinal biopsies of untreated and treated coeliac patients and controls. We confirmed genetic association and dose effects of variants at the 2q12.1 locus with coeliac disease in the Hungarian population. The GA haplotype of the markers rs13015714 and rs917997 showed the strongest association (P = 0.0001, odds ratio = 1.475, 95% confidence interval 1.21-1.80). Two putative isoforms of IL18RAP were detected and the ratios and total levels of these isoforms may contribute to the aetiology of coeliac disease. Our study supports IL18RAP as a novel predisposing gene for coeliac disease and highlights the need for further functional studies on this relatively unknown gene in coeliac disease pathogenesis.
Human Molecular Genetics 01/2009; 18(6):1148-55. · 7.69 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Coeliac disease is a common complex disease caused by a dietary intolerance to wheat gluten. Susceptibility is determined by both environmental and genetic factors. Coeliac disease results from complex interactions between the innate immune system, an adaptive T and B cell response and the mucosal barrier where inflammation is ultimately manifested. Genetic variants within the HLA region are well established, while variants outside of the HLA region have recently been identified. These variants are beginning to enhance our understanding of the immunology of the condition. This review focuses on the immunological pathogenesis of coeliac disease with special reference to the influence of genetic susceptibility on disease development.
Seminars in Immunology 01/2009; · 5.93 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Two inflammatory disorders, type 1 diabetes and celiac disease, cosegregate in populations, suggesting a common genetic origin. Since both diseases are associated with the HLA class II genes on chromosome 6p21, we tested whether non-HLA loci are shared.
We evaluated the association between type 1 diabetes and eight loci related to the risk of celiac disease by genotyping and statistical analyses of DNA samples from 8064 patients with type 1 diabetes, 9339 control subjects, and 2828 families providing 3064 parent-child trios (consisting of an affected child and both biologic parents). We also investigated 18 loci associated with type 1 diabetes in 2560 patients with celiac disease and 9339 control subjects.
Three celiac disease loci--RGS1 on chromosome 1q31, IL18RAP on chromosome 2q12, and TAGAP on chromosome 6q25--were associated with type 1 diabetes (P<1.00x10(-4)). The 32-bp insertion-deletion variant on chromosome 3p21 was newly identified as a type 1 diabetes locus (P=1.81x10(-8)) and was also associated with celiac disease, along with PTPN2 on chromosome 18p11 and CTLA4 on chromosome 2q33, bringing the total number of loci with evidence of a shared association to seven, including SH2B3 on chromosome 12q24. The effects of the IL18RAP and TAGAP alleles confer protection in type 1 diabetes and susceptibility in celiac disease. Loci with distinct effects in the two diseases included INS on chromosome 11p15, IL2RA on chromosome 10p15, and PTPN22 on chromosome 1p13 in type 1 diabetes and IL12A on 3q25 and LPP on 3q28 in celiac disease.
A genetic susceptibility to both type 1 diabetes and celiac disease shares common alleles. These data suggest that common biologic mechanisms, such as autoimmunity-related tissue damage and intolerance to dietary antigens, may be etiologic features of both diseases.
New England Journal of Medicine 12/2008; 359(26):2767-77. · 54.42 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Our genome-wide association study of celiac disease previously identified risk variants in the IL2-IL21 region. To identify additional risk variants, we genotyped 1,020 of the most strongly associated non-HLA markers in an additional 1,643 cases and 3,406 controls. Through joint analysis including the genome-wide association study data (767 cases, 1,422 controls), we identified seven previously unknown risk regions (P < 5 x 10(-7)). Six regions harbor genes controlling immune responses, including CCR3, IL12A, IL18RAP, RGS1, SH2B3 (nsSNP rs3184504) and TAGAP. Whole-blood IL18RAP mRNA expression correlated with IL18RAP genotype. Type 1 diabetes and celiac disease share HLA-DQ, IL2-IL21, CCR3 and SH2B3 risk regions. Thus, this extensive genome-wide association follow-up study has identified additional celiac disease risk variants in relevant biological pathways.