Alfonso Valencia

Alfonso Valencia
Centro Nacional de Investigaciones Oncológicas | CNIO · Structural Biology and Biocomputing Programme

About

714
Publications
108,848
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
63,534
Citations
Citations since 2017
141 Research Items
24515 Citations
201720182019202020212022202301,0002,0003,0004,000
201720182019202020212022202301,0002,0003,0004,000
201720182019202020212022202301,0002,0003,0004,000
201720182019202020212022202301,0002,0003,0004,000

Publications

Publications (714)
Preprint
Full-text available
Safe and effective severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) vaccines have been crucial to fight against the coronavirus disease 2019 pandemic. Most vaccines are based on a mutated version of the Spike glycoprotein [K986P/V987P (S-2P)] with improved stability, yield and immunogenicity. However, S-2P is still produced at low level...
Preprint
Full-text available
The co-administration of drugs known to interact has a high impact on morbidity, mortality, and health economics. We study the drug-drug interaction (DDI) phenomenon by analyzing drug administrations from population-wide Electronic Health Records (EHR) in Blumenau (Brazil), Catalonia (Spain), and Indianapolis (USA). Despite very different health ca...
Preprint
Full-text available
Precision medicine aims at tailoring treatments to individual patient needs. In this context, artificial intelligence (AI)-based technologies are viewed as revolutionary since they have the capacity to identify key features that link genomic and phenotypic traits at the individual level. AI techniques therefore depend on the quantity and quality of...
Preprint
Full-text available
Energetic local frustration offers a biophysical perspective to interpret the effects of sequence variability on protein families. Here we present a methodology to analyze local frustration patterns within protein families that allows us to uncover constraints related to stability and function, and identify differential frustration patterns in fami...
Preprint
Full-text available
Exploring the molecular basis of disease severity in rare disease scenarios is a challenging task provided the limitations on data availability. Causative genes have been described for Congenital Myasthenic Syndromes (CMS), a group of diverse minority neuromuscular junction (NMJ) disorders; yet a molecular explanation for the phenotypic severity di...
Article
Full-text available
Long-range interactions between regulatory elements and promoters are key in gene transcriptional control; however, their study requires large amounts of starting material, which is not compatible with clinical scenarios nor the study of rare cell populations. Here we introduce low input capture Hi-C (liCHi-C) as a cost-effective, flexible method t...
Article
Full-text available
“¿Las máquinas pueden pensar?”. Esta pregunta se la formuló Alan Turing1, considerado como el padre de la computación, ya en el año 1950 (Turing, 1950). Al mismo tiempo, formuló un pequeño juego al que acuñó el nombre de “juego de las imitaciones”. El juego consiste en que una persona A interactúa con una máquina B y una persona C, e intentará adiv...
Preprint
Full-text available
The COVID-19 Disease Map project is a large-scale community effort uniting 277 scientists from 130 Institutions around the globe. We use high-quality, mechanistic content describing SARS-CoV-2-host interactions and develop interoperable bioinformatic pipelines for novel target identification and drug repurposing. Community-driven and highly interdi...
Article
Full-text available
In mammalian cells, chromosomal replication starts at thousands of origins at which replisomes are assembled. Replicative stress triggers additional initiation events from 'dormant' origins whose genomic distribution and regulation are not well understood. In this study, we have analyzed origin activity in mouse embryonic stem cells in the absence...
Article
Full-text available
Background In December 2020, the COVID-19 disease was confirmed in 1,665,775 patients and caused 45,784 deaths in Spain. At that time, health decision support systems were identified as crucial against the pandemic. Methods This study applies Deep Learning techniques for mortality prediction of COVID-19 patients. Two datasets with clinical informa...
Article
Full-text available
Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods for protein structure predictions have reached the accuracy of experimentally determined models. Although this has been independently verified, the implementation of these methods...
Article
Full-text available
The human face is one of the most visible features of our unique identity as individuals. Interestingly, monozygotic twins share almost identical facial traits and the same DNA sequence but could exhibit differences in other biometrical parameters. The expansion of the world wide web and the possibility to exchange pictures of humans across the pla...
Article
Full-text available
The world has gone through unprecedented changes since the global pandemic hit. During the early phase of the pandemic, the absence of known drugs or pharmaceutical treatments forced governments to introduce different policies in order to help reduce contagion rates and manage the economic consequences of the pandemic. This paper analyses the causa...
Article
Full-text available
Digital biomarkers are defined as objective, quantifiable physiological and behavioral data that are collected and measured by means of digital devices. Their use has revolutionized clinical research by enabling high-frequency, longitudinal, and sensitive measurements. In the field of neurodegenerative diseases, an example of a digital biomarker-ba...
Article
Full-text available
Motivation The analysis of cancer genomes provides fundamental information about its aetiology, the processes driving cell transformation or potential treatments. While researchers and clinicians are often only interested in the identification of oncogenic mutations, actionable variants or mutational signatures, the first crucial step in the analys...
Article
Full-text available
The emerging SARS-CoV-2 variants of concern (VOCs) may display enhanced transmissibility, more severity and/or immune evasion; however, the pathogenesis of these new VOCs in experimental SARS-CoV-2 models or the potential infection of other animal species is not completely understood. Here we infected K18-hACE2 transgenic mice with B.1, B.1.351/Bet...
Article
Full-text available
Introduction: Health services generate large amounts of routine health data (eg, administrative databases, disease registries and electronic health records), which have important secondary uses for research. Increases in the availability and the ability to access and analyse large amounts of data represent a major opportunity for conducting studie...
Article
Full-text available
The emergence of cell resistance in cancer treatment is a complex phenomenon that emerges from the interplay of processes that occur at different scales. For instance, molecular mechanisms and population-level dynamics such as competition and cell–cell variability have been described as playing a key role in the emergence and evolution of cell resi...
Article
Computational systems and methods are often being used in biological research, including the understanding of cancer and the development of treatments. Simulations of tumor growth and its response to different drugs are of particular importance, but also challenging complexity. The main challenges are first to calibrate the simulators so as to repr...
Article
Full-text available
Prostate cancer is the second most occurring cancer in men worldwide. To better understand the mechanisms of tumorigenesis and possible treatment responses, we developed a mathematical model of prostate cancer which considers the major signalling pathways known to be deregulated. We personalised this Boolean model to molecular data to reflect the h...
Article
Full-text available
Importance: Autism spectrum disorder (ASD) and attention-deficit/hyperactivity disorder (ADHD) are childhood-onset disorders that may persist into adulthood. Several studies have suggested that they may be associated with an increased risk of mortality; however, the results are inconsistent. Objective: To assess the risk of mortality among perso...
Article
Full-text available
The protein structure field is experiencing a revolution. From the increased throughput of techniques to determine experimental structures, to developments such as cryo-EM that allow us to find the structures of large protein complexes or, more recently, the development of artificial intelligence tools, such as AlphaFold, that can predict with high...
Preprint
Full-text available
Motivation Cancer progression is a complex phenomenon that spans multiple scales from molecular to cellular and intercellular. Simulations can be used to perturb the underlying mechanisms of those systems and to generate hypotheses on novel therapies. We present a new version of PhysiBoSS, a multiscale modelling framework designed to cover multiple...
Chapter
Natural language processing (NLP) is increasingly applied to a broad range of sensitive tasks, such as human resources, biomedicine, and healthcare. Accordingly, a growing body of research is investigating the impact of sex and gender bias in the models and the data on which such models are trained. As NLP systems become more pervasive in our socie...
Preprint
The emergence of cell resistance in cancer treatment is a complex phenomenon that emerges from the interplay of processes that occur at different scales. For instance, molecular mechanisms and population-level dynamics such as competition and cell-cell variability have been described as playing a key role in the emergence and evolution of cell resi...
Preprint
The world has gone through unprecedented changes since the global pandemic hit. During the early phase of the pandemic, the absence of known drugs or pharmaceutical treatments forced governments to introduce different policies in order to help reduce contagion rates and manage the economic consequences of the pandemic. This paper analyses the causa...
Article
Full-text available
The regulation of gene transcription by transcription factors is a fundamental biological process, yet the relations between transcription factors (TF) and their target genes (TG) are still only sparsely covered in databases. Text-mining tools can offer broad and complementary solutions to help locate and extract mentions of these biological relati...
Article
Full-text available
COVID-19 is an infectious disease caused by the SARS-CoV-2 virus, which has spread all over the world leading to a global pandemic. The fast progression of COVID-19 has been mainly related to the high contagion rate of the virus and the worldwide mobility of humans. In the absence of pharmacological therapies, governments from different countries h...
Article
Full-text available
Cohesin exists in two variants containing STAG1 or STAG2. STAG2 is one of the most mutated genes in cancer and a major bladder tumor suppressor. Little is known about how its inactivation contributes to tumorigenesis. Here, we analyze the genomic distribution of STAG1 and STAG2 and perform STAG2 loss-of-function experiments using RT112 bladder canc...
Article
Full-text available
We present the results of the assessment of the intramolecular residue-residue contact and distance predictions from groups participating in the 14th round of the CASP experiment. The performance of contact prediction methods was evaluated with the measures used in previous CASPs, while distance predictions were assessed based on a new protocol, wh...
Article
Full-text available
The COST Action Gene Regulation Ensemble Effort for the Knowledge Commons (GREEKC, CA15205, www.greekc.org) organized nine workshops in a four-year period, starting September 2016. The workshops brought together a wide range of experts from all over the world working on various parts of the knowledge cycle that is central to understanding gene regu...
Preprint
Full-text available
Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods have led to protein structure predictions that have reached the accuracy of experimentally determined models. While this has been independently verified, the implementation of thes...
Chapter
Full-text available
Multi-scale simulations require parallelization to address large-scale problems, such as real-sized tumor simulations. BioFVM is a software package that solves diffusive transport Partial Differential Equations for 3-D biological simulations successfully applied to tissue and cancer biology problems. Currently, BioFVM is only shared-memory parallel...
Article
Full-text available
Agent-based modelling has proven its usefulness in several biomedical projects by explaining and uncovering mechanisms in diseases. Nevertheless, the scenarios addressed in these models usually consider a small number of cells, lack cell-specific characterisation and dynamic interactions and have a simplistic environment description. Tools that ena...
Preprint
Full-text available
The protein structure field is experiencing a revolution. From the increased throughput of techniques to determine experimental structures, to developments such as cryo-EM that allow us to find the structures of large protein complexes or, more recently, the development of artificial intelligence tools, such as AlphaFold, that can predict with high...
Preprint
Epidemiological evidence shows that some diseases tend to co-occur; more exactly, certain groups of patients with a given disease are at a higher risk of developing a specific secondary condition. Despite the considerable interest, only a small number of connections between comorbidities and molecular processes have been identified. Here we develop...
Preprint
Full-text available
COVID-19 is an infectious disease caused by the SARS-CoV-2 virus, which has spread all over the world leading to a global pandemic. The fast progression of COVID-19 has been mainly related to the high contagion rate of the virus and the worldwide mobility of humans. In the absence of pharmacological therapies, governments from different countries h...
Article
Full-text available
B cells have the unique property to somatically alter their immunoglobulin (IG) genes by V(D)J recombination, somatic hypermutation (SHM) and class-switch recombination (CSR). Aberrant targeting of these mechanisms is implicated in lymphomagenesis, but the mutational processes are poorly understood. By performing whole genome and transcriptome sequ...
Article
DOME is a set of community-wide recommendations for reporting supervised machine learning–based analyses applied to biological studies. Broad adoption of these recommendations will help improve machine learning assessment and reproducibility.
Article
Full-text available
Many human genes are transcribed from both strands and produce sense-antisense gene pairs. Sense-antisense (SAS) chimeric transcripts are produced upon the coalescing of exons/introns from both sense and antisense transcripts of the same gene. SAS chimera was first reported in prostate cancer cells. Subsequently, numerous SAS chimeras have been rep...
Article
Full-text available
Alzheimer’s (AD) and Parkinson’s diseases (PD) are the two most prevalent neurodegenerative disorders in human populations. Epidemiological studies have shown that patients suffering from either condition present a reduced overall risk of cancer than controls (i.e., inverse comorbidity), suggesting that neurodegeneration provides a protective effec...
Article
Full-text available
eTRANSAFE is a research project funded within the Innovative Medicines Initiative (IMI), which aims at developing integrated databases and computational tools (the eTRANSAFE ToxHub) that support the translational safety assessment of new drugs by using legacy data provided by the pharmaceutical companies that participate in the project. The project...
Article
Full-text available
Rett syndrome (RTT) is a rare neurological disorder mostly caused by a genetic variation in MECP2 . Making new MECP2 variants and the related phenotypes available provides data for better understanding of disease mechanisms and faster identification of variants for diagnosis. This is, however, currently hampered by the lack of interoperability betw...
Preprint
Full-text available
A bstract Background The propagation of COVID-19 in Spain prompted the declaration of the state of alarm on March 14, 2020. On 2 December 2020, the infection had been confirmed in 1,665,775 patients and caused 45,784 deaths. This unprecedented health crisis challenged the ingenuity of all professionals involved. Decision support systems in clinica...
Article
Full-text available
Copy number variations (CNVs) are major causative contributors both in the genesis of genetic diseases and human neoplasias. While “High-Throughput” sequencing technologies are increasingly becoming the primary choice for genomic screening analysis, their ability to efficiently detect CNVs is still heterogeneous and remains to be developed. The aim...
Preprint
Full-text available
Over the last 15 years we have identified hundreds of inherited variants that increase the risk of developing cancer. Polygenic risk scores (PRS) summarize the genetic risk of each individual by accounting for the unique combination of risk alleles in their genome. So far, most studies of PRS in cancer have focused on their predictive value: i.e. t...
Article
Full-text available
The Pan-Cancer Analysis of Whole Genomes (PCAWG) project generated a vast amount of whole-genome cancer sequencing resource data. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumor types, we provide a user’s guide to the five publicl...
Article
Full-text available
Motivation: The structure of chromatin impacts gene expression. Its alteration has been shown to coincide with the occurrence of cancer. A key challenge is in understanding the role of chromatin structure (CS) in cellular processes and its implications in diseases. Results: We propose a comparative pipeline to analyze CSs and apply it to study c...
Article
Full-text available
Comorbidity is a medical condition attracting increasing attention in healthcare and biomedical research. Little is known about the involvement of potential molecular factors leading to the emergence of a specific disease in patients affected by other conditions. We present here a disease interaction network inferred from similarities between patie...
Article
Full-text available
Diseases involve complex modifications to the cellular machinery. The gene expression profile of the affected cells contains characteristic patterns linked to a disease. Hence, new biological knowledge about a disease can be extracted from these profiles, improving our ability to diagnose and assess disease risks. This knowledge can be used for dru...
Article
Full-text available
One of the key challenges of cancer biology is to catalogue and understand the somatic genomic alterations leading to cancer. Although alternative definitions and search methods have been developed to identify cancer driver genes and mutations, analyses of thousands of cancer genomes return a remarkably similar catalogue of around 300 genes that ar...
Article
Full-text available
The catalog of cancer driver mutations in protein-coding genes has greatly expanded in the past decade. However, non-coding cancer driver mutations are less well-characterized and only a handful of recurrent non-coding mutations, most notably TERT promoter mutations, have been reported. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Ge...
Preprint
Full-text available
The Pan-Cancer Analysis of Whole Genomes (PCAWG) project has generated, to our knowledge, the largest whole-genome cancer sequencing resource to date. Here we provide a user’s guide to the five publicly available online data exploration and visualization tools introduced in the PCAWG marker paper: The ICGC Data Portal, UCSC Xena, Expression Atlas,...