Raphael Mourad

Raphael Mourad
Paul Sabatier University - Toulouse III | UPS Toulouse · Centre de Biologie Intégrative

PhD
Associate Professor in Artificial Intelligence for Genomics and Medicine. University Paul Sabatier

About

40
Publications
7,029
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
656
Citations
Introduction
I work on the development of new computational methods and models that combine artificial intelligence, machine learning, statistics, with genomics, genetics and medicine. https://raphaelmourad.wixsite.com/raphaelmourad
Additional affiliations
December 2013 - August 2014
Laboratoire d'Informatique, de Robotique et de Microélectronique de Montpellier (LIRMM)
Position
  • PostDoc Position
Description
  • HIV and Phylogenetics
November 2012 - October 2013
University of Chicago
Position
  • PostDoc Position
Description
  • Statistical Genetics and Whole Genome Sequencing
October 2011 - November 2012
Indiana University School of Medicine
Position
  • PostDoc Position
Description
  • Chromatin and Bioinformatics
Education
October 2008 - October 2011
University of Nantes
Field of study
  • Statistical Genetics and Machine Learning
September 2005 - August 2007
September 2004 - September 2005
Université Paris-Sud 11
Field of study
  • Genomics

Publications

Publications (40)
Preprint
Full-text available
The 3D genome is essential to numerous key processes such as the regulation of gene expression and the replication-timing program. In vertebrates, chromatin looping is often mediated by CTCF, and marked by CTCF motif pairs in convergent orientation. Comparative Hi-C recently revealed that chromatin looping evolves across species. However, Hi-C expe...
Article
Full-text available
The repair of DNA double-strand breaks (DSBs) is essential for safeguarding genome integrity. When a DSB forms, the PI3K-related ATM kinase rapidly triggers the establishment of megabase-sized, chromatin domains decorated with phosphorylated histone H2AX (γH2AX), which act as seeds for the formation of DNA-damage response foci¹. It is unclear how t...
Article
Full-text available
DNA is a complex molecule carrying the instructions an organism needs to develop, live and reproduce. In 1953, Watson and Crick discovered that DNA is composed of two chains forming a double-helix. Later on, other structures of DNA were discovered and shown to play important roles in the cell, in particular G-quadruplex (G4). Following genome seque...
Article
Full-text available
Background/Aim In higher eukaryotes, the three-dimensional (3D) organization of the genome is intimately related to numerous key biological functions including gene expression, DNA repair and DNA replication regulations. Alteration of 3D organization, in particular topologically associating domains (TADs), is detrimental to the organism and can giv...
Preprint
Full-text available
Genome-wide association studies have systematically identified thousands of single nucleotide polymorphisms (SNPs) associated with complex genetic diseases. However, the majority of those SNPs were found in non-coding genomic regions, preventing the understanding of the underlying causal mechanism. Predicting molecular processes based on the DNA se...
Article
Full-text available
The DNA damage response is essential to safeguard genome integrity. Although the contribution of chromatin in DNA repair has been investigated1,2, the contribution of chromosome folding to these processes remains unclear³. Here we report that, after the production of double-stranded breaks (DSBs) in mammalian cells, ATM drives the formation of a ne...
Article
Full-text available
Motivation Genome-wide association studies have systematically identified thousands of single nucleotide polymorphisms (SNPs) associated with complex genetic diseases. However, the majority of those SNPs were found in non-coding genomic regions, preventing the understanding of the underlying causal mechanism. Predicting molecular processes based on...
Article
Full-text available
Study design: Medical vignettes. Objectives: Lumbar spinal stenosis (LSS) is a degenerative condition with a high prevalence in the elderly population, that is associated with a significant economic burden and often requires spinal surgery. Prior authorization of surgical candidates is required before patients can be covered by a health plan and...
Preprint
Full-text available
Motivation: Genome-wide association studies have systematically identified thousands of single nucleotide polymor-phisms (SNPs) associated with complex genetic diseases. However, the majority of those SNPs were found in non-coding genomic regions, preventing the understanding of the underlying causal mechanism. Predicting molecular processes based...
Article
Full-text available
Purpose Lumbar spinal stenosis (LSS) is a condition affecting several hundreds of thousands of adults in the United States each year and is associated with significant economic burden. The current decision-making practice to determine surgical candidacy for LSS is often subjective and clinician specific. In this study, we hypothesize that the perfo...
Article
Full-text available
Transcriptionally active loci are particularly prone to breakage and mounting evidence suggests that DNA Double-Strand Breaks arising in active genes are handled by a dedicated repair pathway, Transcription-Coupled DSB Repair (TC-DSBR), that entails R-loop accumulation and dissolution. Here, we uncover a function for the Bloom RecQ DNA helicase (BL...
Conference Paper
Full-text available
Performance of artificial intelligence in determining candidacy for lumbar stenosis surgery Raphael Mourad 1, Serhii Kolisnyk 2, Jack Kim 1, Andrej Rusakov 1, Darren Lebl 3 1Remedy Logic, NY, USA, 2Vinnitsa National Medical University, Vinnytsia, Ukraine, 3Hospital for Special Surgery, NY, USA E-mail: r.mourad@remedylogic.com Background and Aims: L...
Preprint
Full-text available
DNA Double-Strand Breaks (DSBs) repair is essential to safeguard genome integrity but the contribution of chromosome folding into this process remains elusive. Here we unveiled basic principles of chromosome dynamics upon DSBs in mammalian cells, controlled by key kinases from the DNA Damage Response. We report that ATM is responsible for the reinf...
Preprint
Full-text available
DNA is a complex molecule carrying the instructions an organism needs to develop, live and reproduce. In 1953, Watson and Crick discovered that DNA is composed of two chains forming a double-helix. Later on, other structures of DNA were discovered and shown to play important roles in the cell, in particular G-quadruplex (G4). Following genome seque...
Preprint
Full-text available
Transcriptionally active loci are particularly prone to breakage and mounting evidence suggest that DNA Double-Strand Breaks arising in genes are handled by a dedicated repair pathway, Transcription-Coupled DSB Repair (TC-DSBR), that entails R-loops accumulation and dissolution. Here, we uncovered a critical function of the Bloom RecQ DNA helicase...
Preprint
Full-text available
DNA Double-Strand Breaks (DSBs) repair is essential to safeguard genome integrity. Upon DSBs, the ATM PI3K kinase rapidly triggers the establishment of megabase-sized, γH2AX-decorated chromatin domains which further act as seeds for the formation of DNA Damage Response (DDR) foci. How these foci are rapidly assembled in order to establish a "repair...
Article
Full-text available
Motivation: The 3D genome is essential to numerous key processes such as the regulation of gene expression and the replication-timing program. In vertebrates, chromatin looping is often mediated by CTCF, and marked by CTCF motif pairs in convergent orientation. Comparative Hi-C recently revealed that chromatin looping evolves across species. Howev...
Article
In higher eukaryotes, the three-dimensional (3D) organization of the genome is intimately related to numerous key biological functions including gene expression, DNA repair and DNA replication regulations. Alteration of this 3D organization is detrimental to the organism and can give rise to a broad range of diseases such as cancers. Here, we revie...
Article
Full-text available
Double-strand breaks (DSBs) result from the attack of both DNA strands by multiple sources, including radiation and chemicals. DSBs can cause the abnormal chromosomal rearrangements associated with cancer. Recent techniques allow the genome-wide mapping of DSBs at high resolution, enabling the comprehensive study of their origins. However, these te...
Article
Full-text available
The three-dimensional (3D) organization of the genome is intimately related to numerous key biological functions including gene expression and DNA replication regulations. The mechanisms by which molecular drivers functionally organize the 3D genome, such as topologically associating domains (TADs), remain to be explored. Current approaches consist...
Preprint
Full-text available
Double-strand breaks (DSBs) result from the attack of both DNA strands by multiple sources, including exposure to ionizing radiation or reactive oxygen species. DSBs can cause abnormal chromosomal rearrangements which are linked to cancer development, and hence represent an important issue. Recent techniques allow the genome-wide mapping of DSBs at...
Article
Full-text available
Chromosomal organization in 3D plays a central role in regulating cell-type specific transcriptional and DNA replication timing programs. Yet it remains unclear to what extent the resulting long-range contacts depend on specific molecular drivers. Here we propose a model that comprehensively assesses the influence on contacts of DNA-binding protein...
Article
Full-text available
Recent advances in long-range Hi-C contact mapping have revealed the importance of the 3D structure of chromosomes in gene expression. A current challenge is to identify the key molecular drivers of this 3D structure. Several genomic features, such as architectural proteins and functional elements, were shown to be enriched at topological domain bo...
Article
Objective: Antiretroviral-naive HIV-positive individuals contribute to the transmission of drug-resistant viruses, compromising first-line therapy. Using phylogenetic inference, we quantified the proportion of transmitted drug-resistance originating from a treatment-naive source. Methods: Using a novel phylotype-based approach, 24 550 HIV-1 subt...
Article
Full-text available
Chromosome folding can reinforce the demarcation between euchromatin and heterochromatin. Two new studies show how epigenetic data, including DNA methylation, can accurately predict chromosome folding in three dimensions. Such computational approaches reinforce the idea of a linkage between epigenetically marked chromatin domains and their segregat...
Article
Full-text available
Common variants at many loci have been robustly associated with asthma but explain little of the overall genetic risk. Here we investigate the role of rare (<1%) and low-frequency (1-5%) variants using the Illumina HumanExome BeadChip array in 4,794 asthma cases, 4,707 non-asthmatic controls and 590 case-parent trios representing European Americans...
Article
Full-text available
In the cell nucleus, each chromosome is confined to a chromosome territory. This spatial organization of chromosomes plays a crucial role in gene regulation and genome stability. An additional level of organization has been discovered at the chromosome scale: the spatial segregation into open and closed chromatins to form two genome-wide compartmen...
Book
At the crossroads between statistics and machine learning, probabilistic graphical models provide a powerful formal framework to model complex data. Probabilistic graphical models are probabilistic models whose graphical components denote conditional independence structures between random variables. The probabilistic framework makes it possible to...
Article
Full-text available
In data analysis, latent variables play a central role because they help provide powerful insights into a wide variety of phenomena, ranging from biological to human sciences. The latent tree model, a particular type of probabilistic graphical models, deserves attention. Its simple structure - a tree - allows simple and efficient inference, while i...
Article
Full-text available
Background Typical analysis of time-series gene expression data such as clustering or graphical models cannot distinguish between early and later drug responsive gene targets in cancer cells. However, these genes would represent good candidate biomarkers. Results We propose a new model - the dynamic time order network - to distinguish and connect...
Conference Paper
Full-text available
Together with the population aging concern, increasing health care costs require understanding the causal basis for common genetic diseases. The high dimensionality and complexity of genetic data hamper the detection of genetic associations. To alleviate the core risks (missing of the causal factor, spurious discoveries), machine learning offers an...
Article
Full-text available
Objective: Assessing the statistical power to detect susceptibility variants plays a critical role in genome-wide association (GWA) studies both from the prospective and retrospective point of view. Power is empirically estimated by simulating phenotypes under a disease model H1. For this purpose, the gold standard consists in simulating genotypes...
Article
Full-text available
Linkage disequilibrium study represents a major issue in statistical genetics as it plays a fundamental role in gene mapping and helps us to learn more about human history. The linkage disequilibrium complex structure makes its exploratory data analysis essential yet challenging. Visualization methods, such as the triangular heat map implemented in...
Article
Full-text available
Discovering the genetic basis of common genetic diseases in the human genome represents a public health issue. However, the dimensionality of the genetic data (up to 1 million genetic markers) and its complexity make the statistical analysis a challenging task. We present an accurate modeling of dependences between genetic markers, based on a fores...
Conference Paper
We describe a novel probabilistic graphical model customized to represent the statistical dependencies between genetic markers, in the Human genome. Our proposal relies on a forest of hierarchical latent class models. The motivation is to reduce the dimension of the data to be further submitted to statistical association tests with respect to disea...

Network

Cited By