ArticlePDF Available

Towards an integrated interactive database for the search of stratification biomarkers in Alkaptonuria

Authors:

Abstract

MOTIVATION Alkaptonuria (AKU) is a rare and genetic disease which causes discoloration of bone (a process called ‘ochronosis’) and induces early-onset osteoarthritis. AKU data have not been organized yet and the disease has no approved biomarkers. The ability to collect, integrate and analyze relevant data streams is the core for developing a “Precision Medicine Ecosystem” AKU-dedicated in which biological resources are shared between researchers, clinicians and patients. Computational modeling can be a useful guide to generate an exhaustive and dynamic picture of the individual and to identify the molecular interactions between biomarkers on which progressive diseases are based. METHODS It has been built a new integrated interactive database thanks to MySQL, the most frequently chosen for use in web applications. In addition, data are statistically analyzed by R software (www.r-project.org) based on Pearson’s correlation coefficient andPvalue. For a biological interpretation of statistic results, Stitch (Kuhn M. et al, 2007) and KEGG (Kanehisa M. et al, 2000) are used. RESULTS For Precision Medicine (PM) application to AKU, the collection of as much data as possible Alkaptonuria affected patients has been needed in order to organize them in an interactive and integrated database and to find a data-processing system for AKU biomarkers discovery. The database is an effective tool for registered researchers, clinicians and patients who could both easily access all the current information, as well as being able to insert new data, refreshing or replacing previous entries. Data are divided into different sections: genetic, proteic, biochemical, histopathologic and clinical. As far as data analysis is concerned, it has been developed an algorithm to build up a refreshable correlation matrix based on Pearson’s correlation coefficient andPvalue which allows the monitoring of renewable correlations between the most recent data inserted. Together with the mathematical and statistical interpretation, a biological explanation of the results is needed in order to investigate on AKU biomarkers. This dynamic tool could be useful for biomarkers investigation also in other osteoarticular diseases and it is a good starting point for the creation of data management and analysis model appropriate for PM. Through the use of AKU-dedicated database and this innovative analytic approach, it has been possible to become aware of the failure of biomarkers clinically used and to improve the detenction of more exploitable prognostic biomarkers for a more reliable AKU patients clinical monitoring.
TITLE
Towards an integrated interactive database for the search of stratification biomarkes in Alkaptonuria
AUTHORS
Cicaloni V¹, Rossi A², Zazzeri M², Zugarini A², Santucci A¹, Bernini A¹, Spiga O¹.
¹ Dipartimento di Biotecnologie, Chimica e Farmacia. Università degli Studi di Siena
² Dipartimento di Ingegneria dell’Informazione e Scienze matematiche. Università degli Studi di
Siena
ABSTRACT
MOTIVATION Alkaptonuria (AKU) is a rare and genetic disease which causes discoloration of
bone (a process called ‘ochronosis’) and induces early-onset osteoarthritis. AKU data have not been
organized yet and the disease has no approved biomarkers. The ability to collect, integrate and
analyze relevant data streams is the core for developing a “Precision Medicine Ecosystem” AKU-
dedicated in which biological resources are shared between researchers, clinicians and patients.
Computational modeling can be a useful guide to generate an exhaustive and dynamic picture of the
individual and to identify the molecular interactions between biomarkers on which progressive
diseases are based.
METHODS It has been built a new integrated interactive database thanks to MySQL, the most
frequently chosen for use in web applications. In addition, data are statistically analyzed by R
software (www.r-project.org) based on Pearson’s correlation coefficient and P value. For a
biological interpretation of statistic results, Stitch (Kuhn M. et al, 2007) and KEGG (Kanehisa M. et
al, 2000) are used.
RESULTS For Precision Medicine (PM) application to AKU, the collection of as much data as
possible Alkaptonuria affected patients has been needed in order to organize them in an interactive
and integrated database and to find a data-processing system for AKU biomarkers discovery. The
database is an effective tool for registered researchers, clinicians and patients who could both easily
access all the current information, as well as being able to insert new data, refreshing or replacing
previous entries. Data are divided into different sections: genetic, proteic, biochemical,
histopathologic and clinical. As far as data analysis is concerned, it has been developed an
algorithm to build up a refreshable correlation matrix based on Pearson’s correlation coefficient
and P value which allows the monitoring of renewable correlations between the most recent data
PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2174v1 | CC BY 4.0 Open Access | rec: 28 Jun 2016, publ: 28 Jun 2016
inserted. Together with the mathematical and statistical interpretation, a biological explanation of
the results is needed in order to investigate on AKU biomarkers. This dynamic tool could be useful
for biomarkers investigation also in other osteoarticular diseases and it is a good starting point for
the creation of data management and analysis model appropriate for PM. Through the use of AKU-
dedicated database and this innovative analytic approach, it has been possible to become aware of
the failure of biomarkers clinically used and to improve the detenction of more exploitable
prognostic biomarkers for a more reliable AKU patients clinical monitoring.
PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2174v1 | CC BY 4.0 Open Access | rec: 28 Jun 2016, publ: 28 Jun 2016
... By taking advantage of the dataset hereby present, containing the highest number of AKU patients ever considered, it is possible to explore the phenotype-genotype distribution study in AKU. In the present study, machine learning algorithms were firstly implemented with the aim to perform a stratification of AKU patients based on clinical, biochemical and QoL data deposited in the ApreciseKUre database [25][26][27]. To achieve this purpose, we have considered biochemical markers of chronic inflammation and amyloidosis (i.e., SAA and CHIT1, for more details see Supplementary Materials) and markers linked with oxidative stress status (i.e., PTI and S-thiolated proteins, for more details see Supplementary Materials). ...
Article
Alkaptonuria (AKU, OMIM: 203500) is an autosomal recessive disorder caused by mutations in the Homogentisate 1,2-dioxygenase (HGD) gene. A lack of standardized data, information and methodologies to assess disease severity and progression represents a common complication in ultra-rare disorders like AKU. This is the reason why we developed a comprehensive tool, called ApreciseKUre, able to collect AKU patients deriving data, to analyse the complex network among genotypic and phenotypic information and to get new insight in such multi-systemic disease. By taking advantage of the dataset, containing the highest number of AKU patient ever considered, it is possible to apply more sophisticated computational methods (such as machine learning) to achieve a first AKU patient stratification based on phenotypic and genotypic data in a typical precision medicine perspective. Thanks to our sufficiently populated and organized dataset, it is possible, for the first time, to extensively explore the phenotype-genotype relationships unknown so far. This proof of principle study for rare diseases confirms the importance of a dedicated database, allowing data management and analysis and can be used to tailor treatments for every patient in a more effective way.
ResearchGate has not been able to resolve any references for this publication.