ArticlePDF Available

Model verification tools: a computational framework for verification assessment of mechanistic agent-based models

May 2022
BMC Bioinformatics 22(S14)

DOI:10.1186/s12859-022-04684-0

License
CC BY 4.0

Authors:

Giulia Russo

University of Catania

Marzio Pennisi

Amedeo Avogadro University of Eastern Piedmont

Francesco Pappalardo

University of Catania

Background Nowadays, the inception of computer modeling and simulation in life science is a matter of fact. This is one of the reasons why regulatory authorities are open in considering in silico trials evidence for the assessment of safeness and efficacy of medicinal products. In this context, mechanistic Agent-Based Models are increasingly used. Unfortunately, there is still a lack of consensus in the verification assessment of Agent-Based Models for regulatory approval needs. VV&UQ is an ASME standard specifically suited for the verification, validation, and uncertainty quantification of medical devices. However, it can also be adapted for the verification assessment of in silico trials for medicinal products. Results Here, we propose a set of automatic tools for the mechanistic Agent-Based Model verification assessment. As a working example, we applied the verification framework to an Agent-Based Model in silico trial used in the COVID-19 context. Conclusions Using the described verification computational workflow allows researchers and practitioners to easily perform verification steps to prove Agent-Based Models robustness and correctness that provide strong evidence for further regulatory requirements.

Components and libraries of model verification tools

…

The documentation page of MVT

…

The smoothness analysis GUI. The box on the left side represents the list of parameters to perform the analysis; on the right side, the “Your Analysis” box contains the list of the completed results analysis

…

The Time step convergence analysis GUI. The box on the left side represents the list of parameters to perform the analysis; on the right side, the “Your Analysis” box contains the list of the completed results analysis

…

+11

The uniqueness analysis GUI

…

Figures - available from: BMC Bioinformatics

This content is subject to copyright. Terms and conditions apply.

Access to this full-text is provided by Springer Nature.

Learn more

Download available

Content available from BMC Bioinformatics

This content is subject to copyright. Terms and conditions apply.

Model verication tools: acomputational

framework forverication assessment

ofmechanistic agent‑based models

Giulia Russo1†, Giuseppe Alessandro Parasiliti Palumbo2†, Marzio Pennisi3 and Francesco Pappalardo1*

From 4th International Workshop on Computational Methods for the Immune System Function (CM-

ISF 2020)

Virtual. 16-19 December 2020

Background

e recent openness from both European and USA Regulatory Agencies [1, 2] to the

possibility of using computer modeling and simulation for providing some of the regula-

tory evidence needed for the assessment of safeness (i.e., when it does not worsen the

health of the recipient) and eﬃcacy (when it does improve the recipient’s health) of novel

medical compounds has paved the way to the application of the so-called in-silico tri-

als. Computational simulations can be used to strengthen, or to possibly substitute, the

Abstract

Background: Nowadays, the inception of computer modeling and simulation in life

science is a matter of fact. This is one of the reasons why regulatory authorities are

open in considering in silico trials evidence for the assessment of safeness and eﬃcacy

of medicinal products. In this context, mechanistic Agent-Based Models are increas-

ingly used. Unfortunately, there is still a lack of consensus in the veriﬁcation assessment

of Agent-Based Models for regulatory approval needs. VV&UQ is an ASME standard spe-

ciﬁcally suited for the veriﬁcation, validation, and uncertainty quantiﬁcation of medical

devices. However, it can also be adapted for the veriﬁcation assessment of in silico trials

for medicinal products.

Results: Here, we propose a set of automatic tools for the mechanistic Agent-Based

Model veriﬁcation assessment. As a working example, we applied the veriﬁcation

framework to an Agent-Based Model in silico trial used in the COVID-19 context.

Conclusions: Using the described veriﬁcation computational workﬂow allows

researchers and practitioners to easily perform veriﬁcation steps to prove Agent-Based

Models robustness and correctness that provide strong evidence for further regulatory

requirements.

Keywords: Agent-based models, Veriﬁcation assessment, In silico trials, Medicinal

product, Regulatory context, COVID-19

Open Access

use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original

author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third

party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the mate-

rial. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or

exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visithttp://

creat iveco mmons. org/ licen ses/ by/4. 0/. The Creative Commons Public Domain Dedication waiver (http:// creat iveco mmons. org/ publi

cdoma in/ zero/1. 0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

SOFTWARE

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

https://doi.org/10.1186/s12859‑022‑04684‑0

BMCBioinformatics

†Giulia Russo and Giuseppe

Alessandro Parasiliti Palumbo

were considered as joint ﬁrst

authors

*Correspondence:

francesco.pappalardo@unict.

1 Department of Drug

and Health Sciences,

University of Catania,

95125 Catania, Italy

Full list of author information

is available at the end of the

article

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 2 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

results coming from experiments involving cell cultures and animals (i.e., pre-clinical

trials) before and human volunteers (i.e., phase I, II, III, and IV clinical trials) then.

While the regulatory protocol for the assessment of safeness and eﬃcacy (i.e., quali-

ﬁcation) of a medical product is well established when classical clinical trials are con-

sidered [3], there is still a lack of a common consensus [4–7] on how to assess the

“credibility” of a computational model. Even if veriﬁcation and validation techniques can

be in general borrowed from other research ﬁelds (i.e., statistics, engineering, mathe-

matics, and physics), it has been mandatory to establish which steps must be carried on

and which methodologies must be used on each of them to qualify, through veriﬁcation,

validation, and uncertainty quantiﬁcation (VV&UQ) procedures, any computational

model to be used for In Silico Trials (ISTs).

To date, few model credibility standards and approaches have been discussed [8–10].

In the ﬁeld of In Silico Trials, Viceconti etal. [11] proposed a theoretical framing for

the problem of assessing the credibility of a predictive model for ISTs that considers the

epistemic speciﬁcity of the research ﬁeld and is general enough to be used for diﬀerent

types of models, including simulators based on Agent-Based Models (ABMs), that have

become increasingly popular in this scenario.

anks to ABMs ability to readily describe complicated biological behaviors, laws, and

interactions involving cells and molecules without the need for complex mathemati-

cal formulas, these models are increasingly applied to simulate human pathophysiol-

ogy. Speciﬁcally, AMBs are useful to predict disease progression and related response

to various treatment administrations, or in speciﬁc conditions where the immune sys-

tem involvement is considered, and also to assist in discovering and developing novel

vaccines.

In ABMs, entities (also called agents) are tracked individually, and interactions are

recorded one by one, allowing for the inference of the system global emergent behavior

as the sum of the agents individual behaviors (bottom-up approach). As ABMs lack a

strong mathematical formalization and a standard veriﬁcation process, some veriﬁca-

tion steps must be reﬁned and customized better to meet their characteristics [12, 13].

In this scenario, Curreli etal. [14] adapted the theoretical framework mentioned above

for assessing the credibility of an ABM simulator of the immune system in the presence

of tuberculosis disease. However, the numerical and statistical procedures carried on the

veriﬁcation procedure have been executed manually, using diﬀerent tools and software.

To facilitate the work of researchers employed in the development of computational

models for ISTs and to speed up the veriﬁcation procedure, we developed “Model Veriﬁ-

cation Tools” (MVT), a suite of tools based on the same theoretical framework described

above and with a user-friendly interface for the evaluation of the deterministic veriﬁca-

tion of discrete-time models, with a particular focus on agent-based approaches. e

toolkit makes it simple for researchers to check many parts of models for possible ﬂaws

and inconsistencies that could inﬂuence their outcomes.

Implementation

The verication workow andits application toABMs

Curreli et al. proposed a theoretical framework that aims at deﬁning the steps for

assessing the credibility of mechanistic models used in the context of in-silico trials for

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 3 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

medicinal products [14]. Here, we recall such a veriﬁcation workﬂow, as it represents the

starting point for MVT development. e workﬂow considers two veriﬁcation proce-

dures that can be carried out independently, i.e., deterministic and stochastic model ver-

iﬁcation. ABMs usually make use of pseudo-random number generators initialized with

diﬀerent random seeds for reproducing diﬀerent stochastic behaviors. Keeping constant

or varying the random seed over a set of simulations, it is then possible to analyze the

model behavior from a deterministic or stochastic point of view. Hence, stand-alone

procedures for the deterministic and stochastic veriﬁcation procedures can be provided.

For the deterministic model veriﬁcation, the workﬂow takes into consideration the fol-

lowing steps:

1. Existence and uniqueness analysis

2. Time Step Convergence Analysis

3. Smoothness Analysis

4. Parameter sweep analysis

Moreover, for stochastic model veriﬁcation, the following steps are also considered:

5. Consistency

6. Sample Size.

At present, MVT includes the analysis tools for steps 1–4, as these represent the most

important ones. Steps 5 and 6 will be implemented in the next release of the tool.

e existence procedure checks for solution existence in the acceptable range of the

input parameters. Uniqueness focuses on checking for possible numerical and discre-

tizations (i.e., round-oﬀ errors) due to the limited numerical precision of computing

platforms that may inﬂuence solution results over diﬀerent runs executed with the same

seed. While existence can be checked by assuring that the computational model returns

an output value for a given reasonable input range, uniqueness can be veriﬁed by check-

ing that identical input sets always entitle the same outputs with, at most, a minimal

tolerated variation determined by the used numerical rounding algorithm.

Time step convergence analysis aims at assuring that the time approximation intro-

duced by the Fixed Increment Time Advance (FITA) approach used by most ABM

frameworks and tools does not extensively inﬂuence the quality of the solution. e

same model is run with diﬀerent time-step lengths to calculate the percentage discre-

tization error according to the following equation:

where qi* is an output reference quantity (i.e., the peak value of the simulation, ﬁnal value

or mean value) obtained by the simulation executed at the smallest reference time-step

that maintains the execution of the model still computationally tractable (i*); qi repre-

sents the same output reference quantity obtained with a time-step i (with i > i*), and eqi

is the percentage discretization error. In their work, Curreli etal. proposed to assume

that the model converges if the error eqi < 5%.

∗−q

i∗

∗

100

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 4 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

Smoothness analysis was proposed to calculate the smoothness of the solution, bear-

ing in mind that possible errors in the numerical solution may lead to singularities, dis-

continuities, and buckling. e coeﬃcient of variation D is computed as the standard

deviation of the ﬁrst diﬀerence of the time series scaled by the absolute value of their

mean for all the output time series to evaluate the smoothness. To this end, a moving

window is used, and thus for each time observation yt in the output time series, the k

nearest neighbors are considered in the window: ykt = {yt-k, yt-k + 1,…, yt, yt + 1,…, yt + k}.

Currelli etal. used k = 3. e higher D is, the higher is the risk of stiﬀness, singularities,

and discontinuities.

Finally, parameter sweep analysis is used to assure that the computational model is not

numerically ill-conditioned. In general, the procedure involves sampling the entire input

parameter space to check if for particular input sets, the model fails to produce a valid

solution or if the solution is valid but outside the expected validity range. Furthermore,

by introducing slight variations on the input values, the analysis can be used to verify if

such slight variations entitle signiﬁcant variations on the output values, suggesting an

abnormal sensitivity to some inputs.

While in their paper Curreli etal. proceed by using a two-step procedure for reducing

the input set size ﬁrst, and to check the eﬀects of the most relevant selected inputs on

the outputs then, we believe that similar results can be obtained by using well-known

standard stochastic sensitivity analyses, such as variance based (Sobol) sensitivity analy-

sis or Latin Hypercube Sampling-Partial Rank Correlation Coeﬃcient (LHS-PRCC),

which have been then introduced inside MVT. e latter, in particular, uses a Latin

Hypercube Sampling (LHS) over the entire input parameter range to calculate the Partial

Rank Correlation Coeﬃcient (PRCC) values between the input values and the selected

output value. In this way, it is possible to estimate the inﬂuence that any input parameter

has on the output value, independently from the variation carried over the other input

parameters from a stochastic point of view. is procedure can also be carried at any

time point to check the inﬂuence of the inputs on the output over time. LHS-PRCC is a

robust sensitivity analysis technique for nonlinear but monotonic relationships between

inputs and output.

Model verication tools

Model Veriﬁcation Tools (MVT) is an open-source tool1 that oﬀers helpful analysis

to verify discrete-time stochastic simulation models. Figure1 shows the architecture,

the software, and the libraries used to develop MVT. e tool is fully developed using

Python 3.9 programming language [15], the Django2 environment to create the web

infrastructure, and Docker.3 anks to this last component, we were able to build up

a stand-alone software platform (a docker container) that can be used on any oper-

ating system. is represents a huge leap ahead in respect to its preliminary web-

based implementation [16]. is version brings several improvements, among which

a considerable reduction of the latency times related to large ﬁle uploading and the

1 https:// github. com/ COMBI NE- Group/ docker_ verify

2 https:// www. djang oproj ect. com/

3 Django (Version 1.5) [Computer Software]. (2013). Retrieved from https:// www. djang oproj ect. com/

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 5 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

possibility of taking full advantage of the system resources for more complex analyses.

Among the libraries used for Uncertainty and Sensitivity Analysis, “Pingouin” [17],

“Scikit’’ [18] and “Scipy” [19] were used to perform the LHS-PRCC analysis, while the

library “SALib” [20] was chosen to perform the Sobol sensitivity analysis.

Regarding the libraries used for the deterministic model veriﬁcation techniques,

we used “Numpy” [21], the fundamental python package for scientiﬁc computing.

e Graphical User Interface (GUI) of MVT (Fig.2) consists of two main menus: 1)

Time Step Convergence Analysis

Smoothness Analysis

Uniqueness Analysis

Sobol Analysis

LHS-PRCC Analysis

Model Verification Tools

Graphical User Interface

Deterministic Model Verification

Fig. 1 Components and libraries of model veriﬁcation tools

Fig. 2 The documentation page of MVT

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 6 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

Documentation and 2) Model Veriﬁcation. e documentation menu gives a brief

description of each technique and explains in detail each input parameter.

e second menu consists of ﬁve sub-menu: i) Smoothness Analysis, ii) Time step

Convergence Analysis, iii) Uniqueness Analysis, iv) Sobol Analysis, and v) LHS-PRCC

Analysis.

Smoothness analysis

e model may suﬀer from singularities, discontinuities, and buckling errors. e

Smoothness Analysis allows detecting these errors. To perform this analysis, a setting

up of the following parameters is required (Fig.3): i) “Skip rows” panel allows the ignore

speciﬁc rows from the analysis, for example, header lines of the input/output ﬁles that

have to be removed; ii) “Column to analyze” panel allows the selection of the output col-

umn to be analyzed from the input/output ﬁle uploaded by the user; iii) “Window Size”

panel allows to deﬁne the size of the window, i.e., the choice of the k nearest neighbors

for the analysis; iv) “Separator character” panel allows deﬁning the correct separator

character of the input ﬁles (i.e., comma, space, tab); v) “File to analyze” panel allows the

user to upload a CSV or ASCII ﬁle deﬁning the output ﬁle on which to perform the anal-

ysis. After clicking on the submit button, the analysis applies the procedure described

above on the column selected by the user. en, by choosing the listed results on the

box “your analysis” reported on the right side of Fig.3, the user can look at the analysis

results and the related produced plots.

Time step convergence analysis

Time step Convergence Analysis allows one to determine if the model behavior

converges as the time-step length becomes narrower. In this version of MVT, the

numerical measures considered to evaluate the global convergence of the model

are the maximum value achieved throughout the simulation (Peak Value—PV), the

time-to-peak-value, the ﬁnal value (FV), the Pearson Correlation Coeﬃcient (PCC),

and the root-mean-square error (RMSE). To perform the study of the model con-

vergence, it is essentially setting up the parameters from the interface (Fig.4). is

Fig. 3 The smoothness analysis GUI. The box on the left side represents the list of parameters to perform the

analysis; on the right side, the “Your Analysis” box contains the list of the completed results analysis

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 7 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

analysis essentially takes into account the same parameters described for the smooth-

ness analysis. After clicking on the submit button, the user can retrieve the plots of

the measures mentioned above produced by the algorithm in the box “your analysis”

shown on the right side of Fig.4.

Uniqueness analysis

As described in the previous version of the tool [16], the user can set up the analysis

parameter in the GUI (Fig.5). In particular, the analysis needs as an input: i) “Skip

rows” parameter that allows ignoring speciﬁc rows from the analysis; ii) “Separator

Character” panel allows deﬁning the correct separator character of the input ﬁles, and

iii) “Files to analyze” panel allows to select the ﬁles to analyze. After clicking on the

submit button, the tool calculates the mean and Standard Deviation (SD) of all the

rows among all the ﬁles. If the maximum value of the previously calculated SD is not

equal to 0, the ﬁles are diﬀerent. In this case, the tool returns a warning message in

a pop-up window, showing the row and the column of the ﬁrst occurrence where the

SD is diﬀerent from 0 (see Fig.6); on the other hand, MVT returns a success message

in a pop-up window.

Fig. 4 The Time step convergence analysis GUI. The box on the left side represents the list of parameters

to perform the analysis; on the right side, the “Your Analysis” box contains the list of the completed results

analysis

Fig. 5 The uniqueness analysis GUI

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 8 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

Uncertainty analysis

In this version of MVT, we used the scikit-optimize library to integrate the Latin

Hypercube Sampling (LHS) methodology and the SALib python library for imple-

menting the Sobol Sensitivity Analysis methodology [22–24].

e user can set up the analysis parameters in the GUI for LHS presented in Fig.7.

In particular, the box allows to set: i) the “Number of samples” to deﬁne the number

of samples to generate; ii) the “Seed” parameter that is used to deﬁne the random

seed of the pseudo-random generator, in order of creating sample input parameter

sets that can be reproduced; iv) “Iteration” ﬁeld deﬁnes the number of iterations for

optimizing LHS; v) “Separator Character” panel allows deﬁning the proper separator

character of the input ﬁle; vi) “Input parameter ﬁle” is the ﬁeld that allows uploading

a CSV ﬁle, deﬁning the model inputs on which to perform the simulations. is ﬁle

must have a header and three columns deﬁned as follows: i) param_name: is the ﬁrst

column and represents the name of the parameter; ii) min: is the second column and

represents the minimum value of the parameters; iii) max: is the third column and

represents the maximum value of the parameters. In this version of MVT, the LHS

tool can generate a sample set drawn from a uniform distribution of the parameters.

Once the analysis is complete, it produces the LHS matrix with N rows and M col-

umns, where N represents the number of samples and M the number of parameters.

After that, the user can download the matrix and run the model on the parameter set

generated.

MVT allows the user to use another algorithm to generate samples based on Saltelli

methodology [20, 23] using the SALib library. Speciﬁcally, the user can use the appro-

priate GUI (Fig.8) to choose the parameters for the generation samples: i) "Number

of combinations" panel allows to set the number of samples to be generated; ii) "skip

values," according to the SALib library, is the number of points in Sobol’ sequence to

skip for getting diﬀerent samples. Furthermore, it is worth mentioning that this value

Fig. 6 An example of the output of the uniqueness analysis, where the ﬁles are not the same. Information

about the row and the column in which SD is not equal to zero is also provided

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 9 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

must be an exponent to the power of 2; iii) "Separator Character" allows to deﬁne the

proper separator character and, iv) "Input Parameter File" allows to choose a CSV ﬁle

with the model parameters, its range values and the type of distribution to be used

for the generation of each speciﬁc parameter. e CSV ﬁle must have a header with

the following structure: i) param_name: this ﬁeld represents the name of the param-

eters; ii) ﬁrst_value: this value depends on the type of distribution and represents the

minimum value if the value itself of the distribution ﬁeld is "uniform"; otherwise, the

value represents the mean; iii) second_value: this ﬁeld also depends on the type of

Fig. 7 The Latin hypercube sampling analysis GUI

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 10 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

distribution chosen. is value represents the maximum if the value of the distribu-

tion ﬁeld is "uniform"; otherwise, the value represents the standard deviation; iv) dis-

tribution: this ﬁeld represents the type of distribution used to generate the samples

for each parameter. e allowed values are: "unif", "norm" and "lognorm", which spe-

ciﬁcally represent the "uniform", "normal" and "lognormal" distribution. After click-

ing on the submit button, the algorithm produces a matrix having N * (2D + 2) rows.

Where D is the number of parameters in input and N is the number of combinations.

e number of rows in the matrix is equal to the number of samples.

Sensitivity analysis: Sobol analysis

is analysis is used to evaluate the sensitivity of the model to the input parameters

using the matrix obtained from the Sobol sample generation procedure. e user can

perform this analysis using the appropriate GUI (Fig.9), which allows to set up the fol-

lowing parameters: i) “Separator Character for Input parameter ﬁle” and “Separator

Character for Output ﬁle from the model” allows to deﬁne the proper separator char-

acter for parameter ﬁle and for Output ﬁle which derive from the model; ii) “Column to

Fig. 8 The SOBOL sample generation analysis GUI

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 11 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

analyze” represents the column on which to perform the analysis; iii) “Input parameter

ﬁle” represents the same ﬁle used in Sobol sample generation; iv) “Output ﬁles from the

model” panel is used for uploading the model outputs ﬁles, in ASCII or CSV format,

without header. To perform the analysis, it is mandatory to rename the ﬁles according

to a predeﬁned scheme. e naming of the output model ﬁles should follow the follow-

ing nomenclature: "0_1.csv", "0_2.csv", and so on, with the second index representing

the model output obtained by the respective input row value of the Sample Generation

output matrix. For example, "0_1.csv" represents the model output from the ﬁrst input

row, while "0_2.csv" represents the model output obtained by the second row from the

Sample Generation output matrix.

Sensitivity analysis: PRCC analysis

Once the input parameters have been generated with the LHS procedure, and the sim-

ulations have been run on such parameters, the Partial Rank Correlation Coeﬃcient

(PRCC) procedure for ﬁnalizing the LHS-PRCC analysis [25] can be executed. e user

can then check the PRCC values evolution over time (PRCC_OT) and understand how

the relationship between inputs and outputs evolves over time, and/or visualize the

PRCC results at speciﬁc time steps (PRCC_STS); this allows to understand better the

correlation among the input parameters and the output of the model. In the PRCC over-

time analysis, the GUI (the box on the left side in Fig.10) takes into account the same

input parameters described for the other tools, along with the following ones: i) “Time

points interval” allows to pick out the data from the “Column to analyze” for a speciﬁc

time point interval selected by the user; ii) “reshold p-value” allows the user to set

up the threshold for the visualization of the level of signiﬁcance. is analysis produces

a pop-up (Fig.11) that allows the user to download a pdf ﬁle containing the temporal

Fig. 9 The SOBOL analysis GUI. The box on the left side represents the list of parameters necessary to

perform the analysis; on the right side, the “Your Analysis” box contains the list of the completed analysis

results

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 12 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

correlation plots. Furthermore, a JSON ﬁle (Fig.12) containing each parameter under

investigation the time points in which the p-value overcomes the threshold set by the

user, meaning that the correlation is signiﬁcant, is also made available. MVT provides

the PRCC_STS GUI (the central box of Fig.10) to analyze the relationship between

the input and output parameters at speciﬁc time points. is analysis takes the same

parameters deﬁned for the PRCC_OT but replaces the “Time point interval” parameter

with “Time step.” At the end, PRCC_STS produces a pop-up window that allows users

to download and visualize a scatter plot for each parameter under study. Such graphi-

cal plots are of signiﬁcant importance, as they allow to graphically reveal possible non-

monotonic correlations that are not usually detected by the standard PRCC procedure.

Fig. 10 The partial rank correlation coeﬃcients analysis GUI. The box on the left side represents the list of

parameters to perform the PRCC over time analysis; while the box on the center side represents the list of

parameters to perform the PRCC to a speciﬁc time step. On the right side, the “Your Analysis” box contains the

list of the completed analysis results

Fig. 11 A pop-up example to download the PRCC over time plots and the time correlation ﬁles

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 13 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

Results anddiscussion

We applied the Deterministic Model Veriﬁcation and Uncertainty and Sensitivity analy-

sis techniques on UISS for the SARS-CoV-2 scenario (UISS-SARS-CoV-2). UISS-SARS-

CoV-2 is the implementation of COVID-19 disease model in UISS. Hence it owns the

immune system machinery originally developed inside UISS. UISS-SARS-CoV-2 was

further implemented to reﬂect the dynamics of COVID-19 [26]. Within UISS, it is pos-

sible to change the time-step length for the simulation. In other words, it is possible to

simulate with a time-step length equal to 8h, rather than 20min, or 5min. Within the

Fig. 12 A sketch of time correlation JSON ﬁles

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 14 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

context of UISS-SARS-CoV-2, we run 9 simulations using a time-step length between

8h and 5min and with a total duration of 4months of simulation.

We assumed that the reference trace has a time-step length of 5min. en, we used

the MVT to evaluate the time step convergence analysis on the active TH-1 cells. Panel

A of Fig.13 shows the speciﬁc trends of the active TH-1 cells. Panel B shows the PCC,

and the RMSE computed between the reference trace and the other ones. It is worth

mentioning that, at the end of the plot, the value of PCC is about 0.6, and the value of

RMSE remains stable. e last step of time step convergence analysis uses the formula

described above to calculate the convergence of the time-to-peak-value and the ﬁnal

value; the corresponding plot is shown in Panel C. e x-axis of Panel B and C shows

the number of iterations, that is the number of steps to reach the end of simulation. e

number of iterations depends on time-step length. According to the time step conver-

gence analysis, the obtained results from the time-to-peak value and the ﬁnal value sug-

gest that convergence is achieved using a time-step length of 15min, that is equal to

11,520 number of iterations.

For this reason, subsequent analyses will be carried out using the outputs obtained

from the simulation with a time-step length of 15min. e next step was to perform

the smoothness analysis of the TH-1 Active (Fig.14). e data presented in the ﬁrst

part of the plot shows sudden peaks caused by the TH-1 response to speciﬁc anti-

gens. en, we perform the Uniqueness analysis to check if repeated executions on

Fig. 13 Time step convergence active TH-1 output. Panel A shows the dynamics of the output at diﬀerent

timestep over time. Panel B shows the Pearson Correlation Coeﬃcient and Root Mean Square Error at

diﬀerent time step. Panel C shows the convergence of Time to peak value and Final value of the output

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 15 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

the same input parameter set lead to the identical output. For this reason, we ran

the UISS-SARS-CoV-2 three times with the same set of input parameters obtaining

identical outputs. Sobol is the ﬁrst technique applied to carry out the Uncertainty

and Sensitivity analysis of the UISS-SARS-CoV-2. We chose to evaluate the sensitiv-

ity of the simulator by varying the values of two parameters: i) Num_Ag, the num-

ber of inoculated antigens within the range; ii) AbMultifact, the number of antibodies

secreted by plasma B cells, within the range. It is important to note that each param-

eter was sampled using a uniform distribution. en, through the GUI of Sobol sam-

ple generation, we chose to set the ’number of combinations’ parameter equal to 16

in order to generate 96 samples. After running the simulations, we chose to analyze

the relationship between the input set and active TH-1 cells and IgG, respectively.

Panel A in Fig.15 shows the correlation relationship between the two input param-

eters (Num_Ag and AbMultifact levels) and active TH-1 cells levels, while Panel B

shows the sensitivity result obtained for the IgG values. ese ﬁgures make it possible

Fig. 14 Smoothness analysis on the TH-1 Active output. The plot shows the smoothness analysis of the TH-1

Active. The data presented in the ﬁrst part of the plot shows sudden peaks caused by the TH-1 response to

speciﬁc antigens

A B

Fig. 15 The SOBOL analysis output. Panel A and B respectively show the sensitivity of the simulator on the

“Num_Ag ’’ and “AbMultifact ’’ parameters concerning TH1-active cells and IgG

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 16 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

to highlight how, inside our model, the two parameters diﬀerently aﬀect the selected

outputs. It is important to remember that the greater the sensitivity indices are, the

more critical parameters are for the model output. erefore, we can observe how

the dynamics of IgG antibodies are more sensitive to the variation of the number of

inoculated antigensdue to the low y value in panel B, while AbMultifact mainly aﬀects

the dynamics of active TH-1.

LHS-PRCC is the second technique applied to perform the Uncertainty and Sen-

sitivity analysis of the model. e parameters and the type of distribution used here

are the same as those taken into account in the previous analysis. In order to gener-

ate the LHS matrix, the parameters of GUI were set as follows: i) ’number of samples’

equals 96; ii) ‘Seed’ parameter equal 2021; iii) ‘Iterations’ equal 1000. After that, the

simulation was run on the LHS matrix. After that, we chose to apply the PRCC_OT

and PRCC_STS analysis to analyze the relationship between parameters set and TH-1

active cells and IgG. e plots of PRCC_OT are shown on panels A and C of Fig.16.

Both panels show a dummy curve (red line), it does not aﬀect the model in any other

way, but it is useful for comparing parameters that have an eﬀect on the model out-

put. Panel A shows a strong correlation at the start of simulations between Num_

Ag and active TH-1 (highlighted in gray), which then turns into a weak correlation

C D

Active TH-1

Fig. 16 The output of PRCC_OT And PRCC_STS. Panel A and B respectively show the sensitivity of the

simulator on the “Num_Ag” over time and at a speciﬁc time step. Panel C and D respectively show the

sensitivity of the simulator on the “AbMultifact” over time and at a speciﬁc time step. The scatter plots, Panel

B and D, depict the inﬂuence of the Num_Ag and AbMultifact variables (input, x-axis) on the selected output

value (y-axis), respectively

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 17 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

between 0.1*107s and 0.2*107s and then turns again into a strong correlation until

the end of the simulations. Panel C shows a weak correlation at the start of simula-

tions between AbMultifact and the levels of active TH-1 cells, which then turns into a

strong correlation until the end of the simulations. After that, we ran the PRCC_STS

analysis at the speciﬁc time step of 5,400,000s, obtaining the plots shown in pan-

els B and D of Fig.16. ese scatter plots, depict the inﬂuence of the Num_Ag and

AbMultifact variables (input, x-axis) on the selected output value (y-axis), respec-

tively. Both input and output values are represented as ranked values to remove any

non-linear relationship. Scatter plots may be useful to visually detect the presence of

non-monotonic relationships that are not usually shown by the PRCC value alone. In

this case Fig.16, Panel B, shows a weak positive relationship at time-step 5,400,000

between the Num_Ag variable and the output value (PRCC: 0.3128). is can be seen

by observing that the distribution of the points vaguely approximates an increasing

straight line. Conversely in Fig.16, Panel D, a weak negative relationship between the

AbMultifact variable and the output value (PRCC: 0.3128) holds, with the points dis-

tribution that vaguely approximates a decreasing straight line.

Conclusions

Mechanistic agent-based models are increasingly employed for developing in silico tri-

als solutions for medicinal products. Consequently, to lower barriers in their regulatory

acceptance, the assessing of their credibility is mandatory. Formal methodologies for

agent-based models veriﬁcation should be developed and widely adopted. e described

approach proposes a set of automatic tools that help formally verifying the determinis-

tic part of mechanistic agent-based models. is allows researchers and practitioners to

easily perform veriﬁcation steps to prove ABM robustness and correctness that provide

strong evidences for further regulatory requirements. As ABMs usually own a stochastic

component, statistical consistency and minimum sample size determination need to be

also addressed. We are dealing with this issue and we will expand actual modeling veriﬁ-

cation framework in due course.

Availability andrequirements

Project name: Model Veriﬁcation Tools

Project home page: https:// github. com/ COMBI NE- Group/ docker_ verify

Operating system(s): Platform independent

Programming language: Python

Other requirements: Docker

Any restrictions to use by non-academics: not applicable.

Abbreviations

VV&UQ: Uncertainty quantiﬁcation; ISTs: In silico trials; ABMs: Agent-based modeling; MVT: Model veriﬁcation tools; FITA:

Fixed increment time advance; LHS-PRCC : Latin hypercube sampling-partial rank correlation coeﬃcient (LHS-PRCC);

LHS: Latin hypercube sampling; PRCC : Partial rank correlation coeﬃcient; GUI: Graphic user interface; PV: Peak value; FV:

Final value; PCC: Pearson correlation coeﬃcient; RMSE: Root mean square error; SD: Standard deviation; UISS: Universal

immune system simulator.

Acknowledgements

Not applicable.

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 18 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

About this supplement

This article has been published as part of BMC Bioinformatics Volume 22 Supplement 14 2021: Selected papers from the

4th International Workshop on Computational Methods for the Immune System Function (CMISF 2020). The full contents

of the supplement are available at https:// bmcbi oinfo rmati cs. biome dcent ral. com/ artic les/ suppl ements/ volume- 22-

suppl ement- 14.

Author contributions

GR: provided regulatory context knowledge in modeling and simulation, checked the biological adherence and mean-

ing, evaluated the workﬂow, drafted the manuscript. GAPP: performed numerical simulations, developed the python

Model Veriﬁcation Tools, wrote the manuscript. MP: performed numerical veriﬁcation consistency, analyzed data, and

wrote the manuscript. FP: conceived the model, gave computational immunological knowledge, provided regulatory

knowledge in in silico trials, supervised the project. All authors have read and approved the ﬁnal manuscript.

Funding

Authors of this paper acknowledge support from the STriTuVaD project. The STriTuVaD project has been funded by

the European Commission and Indian Department of Biotechnology, under the contract H2020-SC1-2017-CNECT-2,

No. 777123. The information and views set out in this article are those of the authors and do not necessarily reﬂect the

oﬃcial opinion of the European Commission. Neither the European Commission institutions and bodies nor any person

acting on their behalf may be held responsible for the use which may be made of the information contained therein.

Publication costs are funded by the STriTuVaD project.

Availability of data and materials

The main computational framework is fully described in the paper. The UISS-SARS-CoV-2 framework used for this

research is available at: https:// combi ne. dmi. unict. it/ UISS- COVID 19/. The MVT tool is available at: https:// github. com/

COMBI NE- Group/ docker_ verify.

Declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Author details

1 Department of Drug and Health Sciences, University of Catania, 95125 Catania, Italy. 2 Department of Mathematics

and Computer Science, University of Catania, 95125 Catania, Italy. 3 Computer Science Institute, DiSIT, University of East-

ern Piedmont, 15121 Alessandria, Italy.

Received: 5 April 2022 Accepted: 11 April 2022

Published: 19 May 2022

References

1. Viceconti M, Cobelli C, Haddad T, Himes A, Kovatchev B, Palmer M. In silico assessment of biomedical products: the

conundrum of rare but not so rare events in two case studies. Proc Inst Mech Eng H. 2017;231:455–66. https:// doi.

org/ 10. 1177/ 09544 11917 702931.

2. Pappalardo F, Russo G, Tshinanu FM, Viceconti M. In silico clinical trials: concepts and early adoptions. Brief Bioin-

form. 2019;20:1699–708. https:// doi. org/ 10. 1093/ bib/ bby043.

3. Schruben LW. Establishing the credibility of simulations. SIMULATION. 1980;34:101–5. https:// doi. org/ 10. 1177/ 00375

49780 03400 310.

4. Patterson EA, Whelan MP. A framework to establish credibility of computational models in biology. Prog Biophys Mol

Biol. 2017;129:13–9.

5. Walker EG, Baker AF, Sauer JM. Promoting adoption of the 3Rs through regulatory qualiﬁcation. ILAR J. 2016;57:221–

5. https:// doi. org/ 10. 1093/ ilar/ ilw032.

6. Mirams GR, Pathmanathan P, Gray RA, Challenor P, Clayton RH. Uncertainty and variability in computational and

mathematical models of cardiac physiology. J Physiol. 2016;594:6833–47. https:// doi. org/ 10. 1113/ JP271 671.

7. Davies MN, Pere H, Bosschem I, Haesebrouck F, Flahou B, Tartour E, et al. In silico adjuvant design and validation.

Methods Mol Biol. 2017;1494:107–25. https:// doi. org/ 10. 1007/ 978-1- 4939- 6445-1_8.

8. IEEE Standard for Software Quality Assurance Processes. IEEE Std 730-2014 (Revision of IEEE Std 730-2002).

2014;1–138.

9. American Society of Mechanical Engineers. Assessing credibility of computational modeling through veriﬁcation

and validation: application to medical devices—V&V40-2018. Asme V&V 40-2018. 2018;60. https:// www. asme. org/

codes- stand ards/ ﬁnd- codes- stand ards/v- v- 40- asses sing- credi bility- compu tatio nal- model ing- veriﬁ cati on- valid ation-

appli cation- medic al- devic es. https:// www. asme. org/ produ cts/ codes- stand ards/ vv- 40- 2018- asses sing- credi bility-

compu tatio na. Accessed 27 Jul 2021.

10. Gordon Schulmeyer G. Handbook of Software Quality Assurance Fourth Edition.

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Page 19 of 19

Russoetal. BMC Bioinformatics 2021, 22(Suppl 14):626

•

fast, convenient online submission

•

thorough peer review by experienced researchers in your ﬁeld

•

rapid publication on acceptance

•

support for research data, including large and complex data types

•

gold Open Access which fosters wider collaboration and increased citations

maximum visibility for your research: over 100M website views per year

•

At BMC, research is always in progress.

Learn more biomedcentral.com/submissions

Ready to submit your research

? Choose BMC and benefit from:

11. Viceconti M, Juarez MA, Curreli C, Pennisi M, Russo G, Pappalardo F. Credibility of in silico trial technologies—a

theoretical framing. IEEE J Biomed Heal Inf. 2020;24:4–13. https:// doi. org/ 10. 1109/ JBHI. 2019. 29498 88.

12. Pennisi M, Russo G, Motta S, Pappalardo F. Agent based modeling of the eﬀects of potential treatments over the

blood–brain barrier in multiple sclerosis. J Immunol Methods. 2015;427:6–12. https:// doi. org/ 10. 1016/j. jim. 2015. 08.

014.

13. Pennisi M, Russo G, Ravalli S, Pappalardo F. Combining agent based-models and virtual screening techniques to

predict the best citrus-derived vaccine adjuvants against human papilloma virus. BMC Bioinf. 2017;18:544. https://

doi. org/ 10. 1186/ s12859- 017- 1961-9.

14. Curreli C, Pappalardo F, Russo G, Pennisi M, Kiagias D, Juarez M, et al. Veriﬁcation of an agent-based disease model of

human Mycobacterium tuberculosis infection. Int J Numer Methods Biomed Eng. 2021;March:e3470.

15. Lindstrom G. Programming with Python. IT Prof. 2005;7:10–6.

16. Palumbo GAP, Russo G, Sgroi G, Viceconti M, Pennisi M, Curreli C, et al. Verify: a toolbox for deterministic veriﬁcation

of computational models. In: Proceedings—2020 IEEE international conference on bioinformatics and biomedicine,

BIBM 2020;2020.

17. Vallat R. Pingouin: statistics in Python. J Open Source Softw. 2018;3:1026. https:// doi. org/ 10. 21105/ joss. 01026.

18. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in Python. J

Mach Learn Res. 2012;12:2825–30.

19. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. SciPy 10: fundamental algorithms

for scientiﬁc computing in Python. Nat Methods. 2020;17:261–72. https:// doi. org/ 10. 1038/ s41592- 019- 0686-2.

20. Herman J, Usher W. SALib: an open-source python library for sensitivity analysis. J Open Source Softw. 2017;2:97.

https:// doi. org/ 10. 21105/ JOSS. 00097.

21. Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, et al. Array programming with NumPy.

Nature. 2020;585:357–62. https:// doi. org/ 10. 1038/ s41586- 020- 2649-2.

22. Sobol IM. Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math

Comput Simul. 2001;55:271–80.

23. Saltelli A. Making best use of model evaluations to compute sensitivity indices. Comput Phys Commun.

2002;145:280–97. https:// doi. org/ 10. 1016/ S0010- 4655(02) 00280-1.

24. Saltelli A, Annoni P, Azzini I, Campolongo F, Ratto M, Tarantola S. Variance based sensitivity analysis of model output.

Design and estimator for the total sensitivity index. Comput Phys Commun. 2010;181:259–70. https:// doi. org/ 10.

1016/j. cpc. 2009. 09. 018.

25. Marino S, Hogue IB, Ray CJ, Kirschner DE. A methodology for performing global uncertainty and sensitivity analysis

in systems biology. J Theor Biol. 2008;254:178–96. https:// doi. org/ 10. 1016/j. jtbi. 2008. 04. 011.

26. Russo G, Pennisi M, Fichera E, Motta S, Raciti G, Viceconti M, et al. In silico trial to test COVID-19 candidate vaccines: a

case study with UISS platform. BMC Bioinformatics. 2020;21:527. https:// doi. org/ 10. 1186/ s12859- 020- 03872-0.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional aﬃliations.

Content courtesy of Springer Nature, terms of use apply. Rights reserved.

Terms and Conditions

Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).

Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-

scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By

accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these

purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.

These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal

subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription

(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will

apply.

We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within

ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not

otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as

detailed in the Privacy Policy.

While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may

not:

use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access

control;

use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is

otherwise unlawful;

falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in

writing;

use bots or other automated methods to access the content or redirect messages

override any security feature or exclusionary protocol; or

share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal

content.

In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,

royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal

content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any

other, institutional repository.

These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or

content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature

may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.

To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied

with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,

including merchantability or fitness for any particular purpose.

Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed

from third parties.

If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not

expressly permitted by these Terms, please contact Springer Nature at

onlineservice@springernature.com

ResearchGate has not been able to resolve any citations for this publication.

Verification of an Agent‐Based Disease Model of Human Mycobacterium tuberculosis Infection

Article

Full-text available

Feb 2021

Agent‐Based Models are a powerful class of computational models widely used to simulate complex phenomena in many different application areas. However, one of the most critical aspects, poorly investigated in the literature, regards an important step of the model credibility assessment: solution verification. This study overcomes this limitation by proposing a general verification framework for Agent‐Based Models that aims at evaluating the numerical errors associated with the model. A step‐by‐step procedure, which consists of two main verification studies (deterministic and stochastic model verification), is described in detail and applied to a specific mission critical scenario: the quantification of the numerical approximation error for UISS‐TB, an ABM of the human immune system developed to predict the progression of pulmonary tuberculosis. Results provide indications on the possibility to use the proposed model verification workflow to systematically identify and quantify numerical approximation errors associated with UISS‐TB and, in general, with any other ABMs. This article is protected by copyright. All rights reserved.

In silico trial to test COVID-19 candidate vaccines: a case study with UISS platform

Article

Full-text available

Dec 2020
BMC BIOINFORMATICS

Background SARS-CoV-2 is a severe respiratory infection that infects humans. Its outburst entitled it as a pandemic emergence. To get a grip on this outbreak, specific preventive and therapeutic interventions are urgently needed. It must be said that, until now, there are no existing vaccines for coronaviruses. To promptly and rapidly respond to pandemic events, the application of in silico trials can be used for designing and testing medicines against SARS-CoV-2 and speed-up the vaccine discovery pipeline, predicting any therapeutic failure and minimizing undesired effects. Results We present an in silico platform that showed to be in very good agreement with the latest literature in predicting SARS-CoV-2 dynamics and related immune system host response. Moreover, it has been used to predict the outcome of one of the latest suggested approach to design an effective vaccine, based on monoclonal antibody. Universal Immune System Simulator (UISS) in silico platform is potentially ready to be used as an in silico trial platform to predict the outcome of vaccination strategy against SARS-CoV-2. Conclusions In silico trials are showing to be powerful weapons in predicting immune responses of potential candidate vaccines. Here, UISS has been extended to be used as an in silico trial platform to speed-up and drive the discovery pipeline of vaccine against SARS-CoV-2.

Array programming with NumPy

Article

Full-text available

Sep 2020
NATURE

Array programming provides a powerful, compact and expressive syntax for accessing, manipulating and operating on data in vectors, matrices and higher-dimensional arrays. NumPy is the primary array programming library for the Python language. It has an essential role in research analysis pipelines in fields as diverse as physics, chemistry, astronomy, geoscience, biology, psychology, materials science, engineering, finance and economics. For example, in astronomy, NumPy was an important part of the software stack used in the discovery of gravitational waves1 and in the first imaging of a black hole2. Here we review how a few fundamental array concepts lead to a simple and powerful programming paradigm for organizing, exploring and analysing scientific data. NumPy is the foundation upon which the scientific Python ecosystem is constructed. It is so pervasive that several projects, targeting audiences with specialized needs, have developed their own NumPy-like interfaces and array objects. Owing to its central position in the ecosystem, NumPy increasingly acts as an interoperability layer between such array computation libraries and, together with its application programming interface (API), provides a flexible framework to support the next decade of scientific and industrial analysis.

SciPy 1.0: fundamental algorithms for scientific computing in Python

Article

Full-text available

Feb 2020

SciPy is an open-source scientific computing library for the Python programming language. Since its initial release in 2001, SciPy has become a de facto standard for leveraging scientific algorithms in Python, with over 600 unique code contributors, thousands of dependent packages, over 100,000 dependent repositories and millions of downloads per year. In this work, we provide an overview of the capabilities and development practices of SciPy 1.0 and highlight some recent technical developments. This Perspective describes the development and capabilities of SciPy 1.0, an open source scientific computing library for the Python programming language.

Pingouin: statistics in Python

Article

Full-text available

Nov 2018

Raphael Vallat

Combining agent based-models and virtual screening techniques to predict the best citrus-derived vaccine adjuvants against human papilloma virus

Article

Full-text available

Dec 2017
BMC BIOINFORMATICS

Background: Human papillomavirus infection is a global social burden that, every year, leads to thousands new diagnosis of cancer. The introduction of a protocol of immunization, with Gardasil and Cervarix vaccines, has radically changed the way this infection easily spreads among people. Even though vaccination is only preventive and not therapeutic, it is a strong tool capable to avoid the consequences that this pathogen could cause. Gardasil vaccine is not free from side effects and the duration of immunity is not always well determined. This work aim to enhance the effects of the vaccination by using a new class of adjuvants and a different administration protocol. Due to their minimum side effects, their easy extraction, their low production costs and their proven immune stimulating activity, citrus-derived molecules are valid candidates to be administered as adjuvants in a vaccine formulation against Hpv. Results: With the aim to get a stronger immune response against Hpv infection we built an in silico model that delivers a way to predict the best adjuvants and the optimal means of administration to obtain such a goal. Simulations envisaged that the use of Neohesperidin elicited a strong immune response that was then validated in vivo. Conclusions: We built up a computational infrastructure made by a virtual screening approach able to preselect promising citrus derived compounds, and by an agent based model that reproduces HPV dynamics subject to vaccine stimulation. This integrated methodology was able to predict the best protocol that confers a very good immune response against HPV infection. We finally tested the in silico results through in vivo experiments on mice, finding good agreement.

Verify: a toolbox for deterministic verification of computational models

Conference Paper

Dec 2020

POSITION PAPER : Credibility of in silico trial technologies - a theoretical framing

Article

Oct 2019

Different research communities have developed various approaches to assess the credibility of predictive models. Each approach usually works well for a specific type of model, and under some epistemic conditions that are normally satisfied within that specific research domain. Some regulatory agencies recently started to consider evidences of safety and efficacy on new medical products obtained using computer modelling and simulation (which is referred to as In Silico Trials); this has raised the attention in the computational medicine research community on the regulatory science aspects of this emerging discipline. But this poses a foundational problem: in the domain of biomedical research the use of computer modelling is relatively recent, without a widely accepted epistemic framing for problem of model credibility. Also, because of the inherent complexity of living organisms, biomedical modellers tend to use a variety of modelling methods, sometimes mixing them in the solution of a single problem. In such context merely adopting credibility approaches developed within other research community might not be appropriate. In this position paper we propose a theoretical framing for the problem of assessing the credibility of a predictive models for In Silico Trials, which accounts for the epistemic specificity of this research field and is general enough to be used for different type of models.

In silico clinical trials: Concepts and early adoptions

Article

Jun 2018
BRIEF BIOINFORM

Innovations in information and communication technology infuse all branches of science, including life sciences. Nevertheless, healthcare is historically slow in adopting technological innovation, compared with other industrial sectors. In recent years, new approaches in modelling and simulation have started to provide important insights in biomedicine, opening the way for their potential use in the reduction, refinement and partial substitution of both animal and human experimentation. In light of this evidence, the European Parliament and the United States Congress made similar recommendations to their respective regulators to allow wider use of modelling and simulation within the regulatory process. In the context of in silico medicine, the term 'in silico clinical trials' refers to the development of patient-specific models to form virtual cohorts for testing the safety and/or efficacy of new drugs and of new medical devices. Moreover, it could be envisaged that a virtual set of patients could complement a clinical trial (reducing the number of enrolled patients and improving statistical significance), and/or advise clinical decisions. This article will review the current state of in silico clinical trials and outline directions for a full-scale adoption of patient-specific modelling and simulation in the regulatory evaluation of biomedical products. In particular, we will focus on the development of vaccine therapies, which represents, in our opinion, an ideal target for this innovative approach.

Promoting Adoption of the 3Rs through Regulatory Qualification

Article

Dec 2016

One mechanism to advance the application of novel safety assessment methodologies in drug development, including in silico or in vitro approaches that reduce the use of animals in toxicology studies, is regulatory qualification. Regulatory qualification, a formal process defined at the the U. S. Food and Drug Administration and the European Medicines Agency, hinges on a central concept of stating an appropriate "context of use" for a novel drug development tool (DDT) that precisely defines how that DDT can be used to support decision making in a regulated drug development setting. When accumulating the data to support a particular "context-of-use," the concept of "fit-for-purpose" often guides assay validation, as well as the type and amount of data or evidence required to evaluate the tool. This paper will review pathways for regulatory acceptance of novel DDTs and discuss examples of safety projects considered for regulatory qualification. Key concepts to be considered when defining the evidence required to formally adopt and potentially replace animal-intensive traditional safety assessment methods using qualified DDTs are proposed. Presently, the use of qualified translational kidney safety biomarkers can refine and reduce the total numbers of animals used in drug development. We propose that the same conceptual regulatory framework will be appropriate to assess readiness of new technologies that may eventually replace whole animal models.

Model verification tools: a computational framework for verification assessment of mechanistic agent-based models

Abstract and Figures

Recommended publications

Verify: a toolbox for deterministic verification of computational models

POSITION PAPER : Credibility of in silico trial technologies - a theoretical framing

POSITION PAPER: Credibility of In Silico Trial Technologies: A Theoretical Framing

Verification of an Agent‐Based Disease Model of Human Mycobacterium tuberculosis Infection