Bartha Maria Knoppers’s research while affiliated with University of Helsinki and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (93)


Fig. 2 PCA projection of HostSeq genomes against reference superpopulations. HostSeq genomes were merged with the 1000 Genomes reference set. The first two principal components of this merged data are shown here with HostSeq genomes in black and 1000 Genomes samples colored by their superpopulation: AFR = African, AMR = Admixed American, EAS = East Asian, SAS = South Asian, EUR = European
Hospitalization and patient outcomes in HostSeq
HostSeq : A Canadian Whole Genome Sequencing and Clinical Data Resource
  • Article
  • Full-text available

May 2023

·

208 Reads

·

6 Citations

BMC Genomic Data

S. Yoo

·

·

Lt Elliott

·

[...]

·

HostSeq was launched in April 2020 as a national initiative to integrate whole genome sequencing data from 10,000 Canadians infected with SARS-CoV-2 with clinical information related to their disease experience. The mandate of HostSeq is to support the Canadian and international research communities in their efforts to understand the risk factors for disease and associated health outcomes and support the development of interventions such as vaccines and therapeutics. HostSeq is a collaboration among 13 independent epidemiological studies of SARS-CoV-2 across five provinces in Canada. Aggregated data collected by HostSeq are made available to the public through two data portals: a phenotype portal showing summaries of major variables and their distributions, and a variant search portal enabling queries in a genomic region. Individual-level data is available to the global research community for health research through a Data Access Agreement and Data Access Compliance Office approval. Here we provide an overview of the collective project design along with summary level information for HostSeq. We highlight several statistical considerations for researchers using the HostSeq platform regarding data aggregation, sampling mechanism, covariate adjustment, and X chromosome analysis. In addition to serving as a rich data source, the diversity of study designs, sample sizes, and research objectives among the participating studies provides unique opportunities for the research community. Supplementary Information The online version contains supplementary material available at 10.1186/s12863-023-01128-3.

Download






Author Correction: The repertoire of mutational signatures in human cancer

January 2023

·

413 Reads

·

33 Citations

Nature

Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature1. Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses3–15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated—but distinct—DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer.



A second update on mapping the human genetic architecture of COVID-19

December 2022

·

1,396 Reads

·

4 Citations

Investigating the role of host genetic factors in COVID-19 severity and susceptibility can inform our understanding of the underlying biological mechanisms that influence adverse outcomes and drug development. Here we present a second updated genome-wide association study (GWAS) on COVID-19 severity and infection susceptibility to SARS-CoV-2 from the COVID-19 Host Genetic Initiative (data release 7). We performed a meta-analysis of up to 219,692 cases and over 3 million controls, identifying 51 distinct genome-wide significant loci—adding 28 loci from the previous data release. The increased number of candidate genes at the identified loci helped to map three major biological pathways involved in susceptibility and severity: viral entry, airway defense in mucus, and type I interferon.


Author Correction: Divergent mutational processes distinguish hypoxic and normoxic tumours

December 2022

·

192 Reads

·

9 Citations


Citations (73)


... We ran the SIGNAL webtool (https://signal.mutationalsignatures.com/) (Degasperi et al. 2020) under default parameters to identify the single base substitution (SBS) signatures active in each patient (Alexandrov et al. 2020). As recommended by the authors, SBS fitting was performed using candidate SBS signatures from CRC. ...

Reference:

Genomic diversity and BCL9L mutational status in CTC pools predict overall survival in metastatic colorectal cancer
Author Correction: The repertoire of mutational signatures in human cancer

Nature

... In clinical practice, differentiating between multiple primary lung cancers and intrapulmonary metastasis is challenging using pathology or morphologic appearance alone, and NGS is now recommended to help definitively differentiate these entities [5,[8][9][10][11][12][13][14][15]. When using genomic results to define multiple primaries versus intrapulmonary metastasis, we observed significantly improved survival in those with genome-defined multiple primaries versus intrapulmonary metastasis. ...

Author Correction: The evolutionary history of 2,658 cancers

Nature

... Characterizing the full spectrum of SVs in human genomes makes it possible to understand the distribution of SVs in different populations [8]. Additionally, SVs have been found to play a critical role in the development of certain diseases, such as cancer [22] and Alzheimer's [36]. SVs are typically referred to any genome sequence altering event, except for deletions or insertions (indels) shorter than 50 bp [32]. ...

Author Correction: Patterns of somatic structural variation in human cancer genomes

Nature

... RPL39L is a recently evolved ( 1 ) and non-redundant paralog of RPL39L that has just been implicated in the translation of long-lived, sperm cell-specific proteins ( 11 ). However, the RPL39L mRNA was also observed outside of the germ cell lineage, particularly in ovarian ( 12 ) and breast cancer tissues ( 2 ), as well as in lung cancer ( 13 ) and neuroblastoma ( 11 ) cell lines, where the expression appears to be driven by gene amplifications ( 14 ) and CpG island hypomethylation ( 13 ). These observations suggest that RPL39L's function extends beyond the translation of long-lived sperm cell proteins. ...

Author Correction: Genomic basis for RNA alterations in cancer

Nature

... 16 Second, we highlight an association between higher risk of COVID-19 and an ultra-rare missense variant in ZC3HAV1 (rs769102632:A, MAF ¼ 0.002%; p ¼ 3EÀ8; OR ¼ 26.7; 95% CI 8.37-85.38; Figure S1), a gene that encodes a zinc finger antiviral protein 19,20 that inhibits SARS-CoV-2 replication, 21 potentially by upregulating type I interferon responses. 22 Given the potential significance of this finding, we attempted to replicate the ZC3HAV1 rs769102632:A association in an additional 6,223 individuals with COVID-19 with exome or whole-genome sequence data generated as part of the GenOMICC (n ¼ 4,851), 11 Columbia University COVID-19 Biobank (n ¼ 1,152), and Biobanque Quebec (n ¼ 220) 23 studies. We found no carriers for this variant in these additional COVID-19 cases (Table S5) when we expected about four given the observed allele frequency in cases in our study (three and one carriers expected in individuals of African and European ancestry, respectively). ...

Failure to replicate the association of rare loss-of-function variants in type I IFN immunity genes with severe COVID-19

... We also re-estimated and leveraged the genetic correlation between the traits to amplify statistical power to discover IPF associations by performing multi-trait meta-analysis (MTAG) 24 . For these analyses we used the IPF meta-analysis statistics produced in this study and the latest meta-analysis summaries (Freeze 7) from the COVID-19 Host Genetics Initiative (HGI), for the Hospitalised covid vs. population phenotype (B2_ALL_leave_23andme), as this has produced the best balance between a carefully curated phenotype for severity and statistical power 15,25 (Supplementary Table 15). ...

A second update on mapping the human genetic architecture of COVID-19

... Genomic alterations have been ranked based on their recurrence and on their functional consequences, finally developing a clustering methodology to discriminate between potential driver events [46]. The extension of the sequencing to intergenic regions allowed evaluation of the burden of putative driver mutations in noncoding regions: on the pan-cancer database, 13% of all mutations were represented by driver point-mutation events in an intergenic region, with 25% of all PCAWG cancers analysed bearing at least one, one-third of which occurred in the TERT promoter, confirming its role in cancer [73][74][75][76][77][78][79]. On the counterpart, 91% of all cancers harboured a somatic driver event in a coding region of a gene (Fig. 3). ...

Author Correction: High-coverage whole-genome analysis of 1220 cancers reveals hundreds of genes deregulated by rearrangement-mediated cis-regulatory alterations

... Tumor hypoxia is a common feature of the microenvironment in solid tumors [67,68], primarily resulting from an imbalance between poor vascularization and high oxygen consumption by tumor cells [69]. Hypoxia induces a tumor-adapted phenotype, altering signaling, gene expression, and metabolism [70]. ...

Author Correction: Divergent mutational processes distinguish hypoxic and normoxic tumours

... 39 For pathway enrichments, we aggregated consistently curated FlyBase-associated genesets using the modEnrichr project. 40 We then used the ActivePathways gene set enrichment tool 41 Processing single-cell RNA sequencing data and cell-type-specific differential gene expression analysis ...

Author Correction: Integrative pathway enrichment analysis of multivariate omics data

... LncRNAs are involved in multiple developmental processes, including the regulation of Hox genes (Rinn et al. 2007), the regulation of chromatin accessibility in dosage compensation mechanisms (Loda and Heard 2019), and the development of organs such as the brain (Bernard et al. 2010) and heart (Klattenhoff et al. 2013). Alterations in the expression of lncRNAs have also been described for several diseases, including cancer (Huarte 2015;Carlevaro-Fita et al. 2020), cardiovascular diseases (Poller et al. 2018) and neurological disorders (Sunwoo et al. 2017). In fact, the public database LncRNADisease v2.0 currently describes more than 1,700 experimentally validated lncRNA-disease associations (Bao et al. 2019), highlighting lncRNAs as potential tools to understand the mechanisms and prognosis of multiple diseases. ...

Author Correction: Cancer LncRNA Census reveals evidence for deep functional conservation of long noncoding RNAs in tumorigenesis

Communications Biology