ArticlePDF Available

Polynomial superlevel set representation of the multistationarity region of chemical reaction networks

Authors:

Abstract and Figures

In this paper we introduce a new representation for the multistationarity region of a reaction network, using polynomial superlevel sets. The advantages of using this polynomial superlevel set representation over the already existing representations (cylindrical algebraic decompositions, numeric sampling, rectangular divisions) is discussed, and algorithms to compute this new representation are provided. The results are given for the general mathematical formalism of a parametric system of equations and so may be applied to other application domains.
A simple example of a closed network consisting of one reaction. a Two molecules of superoxide and two hydron atoms react to each other and produce one molecule of dioxygen and a molecule of hydrogen perixide. The reaction rate here follows the mass-action kinetics with the reaction rate constant k. b The system of ODE equations of the network. The concentrations of O2-\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hbox {O}_{2}}^{-}$$\end{document}, H+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {H}^{+}$$\end{document}, O2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {O}_{2}$$\end{document} and H2O2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hbox {H}_{2}\hbox {O}_{2}$$\end{document} are denoted by x1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_1$$\end{document}, x2,x3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_2, x_3$$\end{document} and x4\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_4$$\end{document} respectively
… 
Three representations of the multistationarity region of the network in Fig. 3a after fixing all parameter values other than k3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k_3$$\end{document} and k8\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k_8$$\end{document} to the values in (1). a CAD gives the exact boundary of L1(Φf0)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_1(\Phi _f^0)$$\end{document} and L3(Φf0)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_3(\Phi _f^0)$$\end{document}. The first one is colored by sky blue and the later one with yellow. b A sampling representation of the parameter region B=[(0.0005,0),(0.001,2)]\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B=[(0.0005,0),(0.001,2)]$$\end{document} by 1000 random points sampled from uniform distribution on B. 78 of these points belong to L3(Φf0)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_3(\Phi _f^0)$$\end{document} and are colored yellow. The other 922 points belong to L1(Φf0)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_1(\Phi _f^0)$$\end{document} and are colored by sky blue. c A rectangular representation of B. Each subrectangle is colored with respect to the average number of solutions for 10 random points sampled from the uniform distribution on the subrectangle. The color bar of the figure is in the right side
… 
This content is subject to copyright. Terms and conditions apply.
Polynomial superlevel set representation
ofthemultistationarity region ofchemical
reaction networks
AmirHosein Sadeghimanesh* and Matthew England*
Introduction
Many problems in applied sciences can be modelled by a parametric polynomial system,
and therefore to solve such problems we must be able to explore the properties of these
systems. In particular, we often seek to identify areas of the parameter space where a
property holds. e contribution of this paper is a new methodology for exploring these.
Motivation: multistationarity regions ofCRNs
We are motivated by the problem of understanding the multistationarity behavior of a
Chemical Reaction Network (CRN). In a CRN, variables represent the concentrations of
the species. ese change as time passes and are studied as part of the field of dynami-
cal systems. is is of polynomial type when the kinetics is assumed to follow the mass
action rules. e equilibria of such a dynamical system are therefore the solutions to a
system of polynomial equations. However, the coefficients of the terms on the polynomi-
als may involve some parameters. ese parameters are usually the rates under which a
reaction occurs and the total amounts (thought of as a dependency on the initial concen-
tration of the species).
Both the variables and parameters can only attain non-negative real values. A network
is called multistationary if there exists a choice of parameters for which the network has
Abstract
In this paper we introduce a new representation for the multistationarity region of a
reaction network, using polynomial superlevel sets. The advantages of using this poly-
nomial superlevel set representation over the already existing representations (cylindri-
cal algebraic decompositions, numeric sampling, rectangular divisions) is discussed,
and algorithms to compute this new representation are provided. The results are given
for the general mathematical formalism of a parametric system of equations and so
may be applied to other application domains.
Keywords: Polynomial superlevel set, Steady states, Multistationarity, Parameter
analysis
Open Access
© The Author(s) 2022. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits
use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original
author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third
party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the mate-
rial. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or
exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://
creat iveco mmons. org/ licen ses/ by/4. 0/. The Creative Commons Public Domain Dedication waiver (http:// creat iveco mmons. org/ publi
cdoma in/ zero/1. 0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
RESEARCH
Sadeghimaneshand England
BMC Bioinformatics (2022) 23:391
https://doi.org/10.1186/s12859-022-04921-6
BMC Bioinformatics
*Correspondence:
Amirhossein.
Sadeghimanesh@coventry.ac.uk;
Matthew.England@coventry.
ac.uk
Research Centre
for Computational Sciences
and Mathematical Modelling,
Coventry University, Coventry, UK
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 2 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
more than one equilibrium. ere are already many algorithms developed for answering
the binary question of whether a system can exhibit multistationarity [19]. e input of
these algorithms is a reaction network and the output is the confirmation or rejection of
the possibility of exhibiting multistationary behavior.
In the case where multistationarity can exist it then becomes important to determine
the parameters where the network has this behavior. ere has been less progress in this
direction in the literature to date: the present paper offers a promising new development
for this problem.
Prior work
Reviewing the state of the art in the literature, we see one vein of work focused on spe-
cific reaction networks, with success following heuristic or manual calculations to find a
suitable parameter which may not work in generality [10, 11]. en, in another vein of
work the system of equations for finding equilibria are solved for many random points
from the parameter space to approximate the region where the network is multistation-
ary [1215].
Recently in [16] a new approach to get a description of the multistationarity region is
proposed. In this method one does not need to solve the system of equations to count
the number of equilibriums. Instead one computes an integral to get the expected num-
ber of equilibriums when the parameters are following a random distribution. is
method partitions the parameter region into subsets that are a Cartesian product of
intervals, called hyperrectangles. By choosing the uniform distribution and computing
the average number of equilibriums on these hyperrectangles, one can approximate the
multistationarity region as a union of sub-hyperrectangles. While efficient and widely
applicable, this list of hyperrectangles does not allow the reader much information or
intuition about the geometry of this region, such as connectedness or convexity.
Contribution
In this work, we propose using polynomial superlevel sets to approximate the union of
the hyperrectangles from [16] as a set that can be described by one polynomial. Polyno-
mial superlevel sets are already employed to approximate semi-algebraic sets and have
been used in control and robust filtering contexts, see [17, 18]. e polynomial super-
level set representation we propose is a more compact representation of the region
compared to a list of many hyperrectangles each described as a Cartesian product of
intervals. Further, to check if a point belongs to the region given in this representation
one can easily just evaluate the polynomial in this point. Further benefits of the polyno-
mial superlevel set description of the region will be explored later.
Organization ofthepaper
e organization of this paper is as follows. e mathematical framework of reaction
networks and the definition of the multistationarity region is given first followed by a
section containing the notations regarding parametric functions and definitions of the
sampling and the rectangular representations of the multistationarity region from [19,
Section2.4].
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 3 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
Wethen define polynomial superlevel sets formally and describe how one can algo-
rithmically find a polynomial superlevel set representation of a set using the sampling
and the rectangular representations. We demonstrate how to use it to find the polyno-
mial superlevel set representation of the multistationarity region of a reaction network.
In thefinal section we discuss methods that sometimes can speed up computation of the
polynomial superlevel set representation by the help of bisecting algorithms and where
possible, algorithms for computing the expected number of solutions independently of
solving the system itself.
Notations
e cardinality of a set A is denoted by
#(A)
. Let
xZ
and
nZ\{0}
. In this paper
we define x modulo n to be n instead of 0 whenever x is a multiple of n. For a function
f:A1A2
and a point
uA2
, the level set of f is denoted by
Lu(f)
and is defined as
. For two points
a=(a1,...,an)
and
b=(b1,...,bn)
in
Rn
, the nota-
tion [a,b] is used to show the hyperractangle
n
i=1[
a
i
,b
i]
. For a subset S of a hyperrec-
tangle
BRn
, let
Vol(S)
denote the normalized volume of S with respect to B, i.e.
When a random vector
X=(X1,...,Xn)
is distributed by a uniform distribution on a
set
SRn
, we write
XU(S)
. If X is distributed by a normal distribution with mean
µRn
and variance
σ2R>0
, then we write
XN
,
σ2)
and we mean that
X1,...,Xn
are identically and independently distributed by
Ni
,
σ2)
. e expectation of g(X) when
X is distributed by a probability distribution q is denoted by
E
g(X)
|
X
q
.
Computer information
All computations for the examples of this paper were done on a computer with the
following information. Processor: Intel(R) Core(TM) i7-10850H CPU
@2.70GHz 2.71 GHz. Installed memory (RAM): 64.0 GB (63.6 GB usa-
ble). System type: 64-bit Operating System, x64-based processor.
e software and programming languages used for the computations reported in this
paper had the following version numbers: Maple 2021, Matlab R2021a, YALMIP,
SeDuMi 1.3, Julia 1.6.2, MCKR 1.0.
Multistationarity region ofchemical reaction networks
In this section, we introduce the concepts of reaction network theory that are needed
throughout the rest of the paper, with the help of a simple gene regulatory network
example.
One can think of a gene as a unit encoding information for the synthesis of a prod-
uct such as a protein. First, a group of DNA binding proteins called transcription fac-
tors bind a region of the gene called promoter. Now an enzyme called RNA polymerase
starts reading the gene and produces an RNA until it arrives in the terminator region of
the gene. e process until here is called the transcription step. After transcription is
completed, the resulting RNA leaves the nucleus (in eukaryotes) and reaches ribosomes.
In ribosomes, the second step, called translation, gets started. Ribosomes assemble a
Vol
(S)
=
Vol
(
S
)
Vol
(B)
.
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 4 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
protein from amino acids using the manual guide written in the RNA. A gene encod-
ing of a protein recipe is said to be expressed when it gets transcribed to an RNA, and
the RNA translated to the protein. A gene is not always expressed in a constant rate.
ere might be proteins that bind the transcription factors or the promoter region and,
as a result, inhibit the RNA polymerase starting the transcription process. On the other
hand, there might be other proteins in which their binding to the transcription factors or
the promoter region enhances the transcription.
Consider a simple example from [20, Figure2], depicted here in Fig.1a. ere are three
genes with proteins A,B, and C as their final products. Denote their concentrations at
time t by [A](t),[B](t) and [C](t) respectively. e concentration of these proteins will
not remain constant all the time, and we have an Ordinary Differential Equation (ODE)
system describing the variation of the concentrations as time passes, see Fig.1b. Each
protein is degraded with a first-order kinetics with the reaction rate constants
kA,d,kB,d
and
kC,d
correspondingly. Protein A activates the expression of the second gene with
Michaelis-Menten kinetics with the maximum rate
kB,max
and the Michaelis constant
k1
B,A
. e third gene gets activated by both proteins A and B together with the product
of two Michaelis-Menten kinetics, with maximum rate
kC,max
and Michaelis constants
k1
C,A
and
k1
C,B
. e first gene gets expressed by the rate
kA,max
in the absence of protein C,
and protein C has an inhibitory effect on the expression of the first gene, captured by the
denominator
(1+kA,C[C](t))
in the rate expression.
A solution to the system
d[X
i
](t)
dt =0
(where the
Xi
s are A,B and C) is called an equilib-
rium of the ODE system. Since the concentration of the proteins can only be non-nega-
tive real numbers, the complex or negative real solutions are not relevant. Sometimes we
may only consider the positive solutions, for example, if a total consumption of a protein
is not possible or of no interest. erefore by steady states we mean positive solutions to
the system of equations
d[X
i
](t)
dt =0
. e equations in this system are called the steady
state equations.
Now we are ready to define a reaction network formally. A reaction network, or a net-
work for short, is an ordered pair,
N=(S
,
R)
where
S
and
R
are two finite sets called
the set of species and the set of reactions. In our example,
S={A,B,C}
and
R
contains
six reactions: three gene expressions and three protein degradations. To each network,
an ODE is attached with concentration of the species as its variables and the constants
of the reaction rate expressions as its parameters. In our example, we have 3 suchvari-
ables and 10 parameters. To fix the notation assume
S={X1,...,Xn}
and that there are
r constants involved in the reaction rate expressions. en we use
xi
instead of
[Xi](t)
Gene 1 Gene 2 Gene 3
(a)
d
[
A
](
t
)
dt =kA,max
·
1
1+kA,C ·[C](t)
kA,d ·[A](t)
d[B](t)
dt =kB,max
·
kB,A·[A](t)
1+kB,A·[A](t)
kB,d·[B](t)
d[C](t)
dt =kC,max
·
kC,A·[A](t)
1+kC,A·[A](t)
·
kC,B·[B](t)
1+kC,B·[B](t)
kC,d·[C](t)
(b)
Fig. 1 A regulatory network of 3 genes [20, Figure 2]. a This graph shows the relations between expressions
of the genes. We denote by an inhibitory relation and by
a positive relation. b The system of ordinary
differential equations for the network
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 5 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
and
ki
for the i-th parameter. Denote by
fi
,
k(x)
the i-th steady state equation where
x=(x1,...,xn)
and
k=(k1,...,kr)
.
A network with an inflow (injection) or an outflow (extraction or degradation) for at
least one of its species is called an open network. e network in Fig.1a is an open net-
work because of the presence of the degradation reactions. A network can also be fully
or partially conserved.
Consider the simple single reaction network depicted in Fig.2a. e system of its ODE
equations is given in Fig. 2b. Because
˙
x1+2˙
x3=0
, the linear combination
x1+2x3
should be constant with respect to the time. erefore there exists a positive constant
T1
such that the relation
x1+2x3=T1
holds. Similarly there exist two other positive con-
stants
T2
and
T3
such that
x1+2x4=T2
and
x2+2x3=T3
. e values of
T1
,
T2
and
T3
can be determined by the initial conditions of the ODE system. ese linear invariants
imply that three of the steady state equations are linearly redundant and can be replaced
by these three linear invariants which are called conservation laws in CRN theory. e
linear subspace determined by the conservation laws is called the stoichiometric compat-
ibility class. For a more detailed definition of conservation laws see Definition 1 in [19,
Chapter2]. One should note that the trajectories of the ODE system are confined to
stoichiometric compatibility classes. In this case, one only cares about the steady states
in one stoichiometric compatibility class.
Now we are ready to define the main concept of interest, multistationarity.
Definition 2.1 Consider a network with n species and replace redundant steady state
equations by conservation laws if there exist any. Let k stands for the vector of constants
of both the reaction rates and conservation laws and be of the size r. A network is called
multistationary over
BRr
if there exists a
kB
such that
fk(x)=0
has more than
one solution in
Rn
>0
.
Remark 2.2

i) One may also consider non-linear invariants such as first integrals as defined in [21,
Definition 11].
ii) Note that we are not concerned with the choice of the kinetics such as mass-action,
Michaelis-Menten, Hill function, power-law kinetics and S-systems, or the form of
2O
2+2H+k
O2+H
2O2
(a)
dx1
dt =2kx2
1x2
2,dx3
dt =kx2
1x2
2
dx2
dt =2kx2
1x2
2,dx4
dt =kx2
1x2
2
(b)
Fig. 2 A simple example of a closed network consisting of one reaction. a Two molecules of superoxide and
two hydron atoms react to each other and produce one molecule of dioxygen and a molecule of hydrogen
perixide. The reaction rate here follows the mass-action kinetics with the reaction rate constant k. b The
system of ODE equations of the network. The concentrations of
O2
,
H+
,
O2
and
H2O2
are denoted by
x1
,
x2,x3
and
x4
respectively
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 6 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
the steady state equations such as polynomial or rational functions. erefore the
results of this paper will remain valid and practical for a general reaction network.
From here on the word parameters also includes the constants of the conservation
laws in addition to the reaction rate constants. To answer the question of whether a net-
work is multistationary or not one can use one of many algorithms available in the litera-
ture, see [15, 7, 9] for a few examples. However, to partition the parameter space into
two subsets, one consisting of the choices of parameters for which
fk(x)
has more than
one solution and the other comprising those parameter choices for which
fk(x)
has at
most one solution, is a more laborious task which we tackle in this paper.
Definition 2.3 Consider a reaction network with the setting and notation of Defini-
tion2.1. e set
is called the multistationarity region of the network.
e region B in Definition2.3 represents the regions of scientific interest. It will usually
be a hyperrectangle made by the inequality restrictions of the form
ki,min <ki<ki,max
for the parameters. is is because, for example the rate of expression of a gene can not
be any arbitrary positive number but must be limited; or the constant of conservation
laws may be limited from above due to the limitation of the materials in the lab.
Prior state‑of‑the‑art forparametric systems ofequations
Let
fk:RnRm
be a parametric function with
BRr
as its parameter region and u a
point in
Rm
. For each choice of the parameters
kB
, the system
fk
(x)=u
is a non-
parametric system of equations. One can solve this system and look at the cardinality of
the solution set. For different choices of
k
, this number can be different. erefore we
define a new function
u
f:BZ
0
{∞}
sending
kB
to
#
L
u
(f)
, i.e. the size of the
level set of
fk
(the set of points in
Rn
which
fk
maps to u). Now one can partition B into
the union of level sets of the map
u
f
. For a general form of
fk(x)
, finding
L
i
(�u
f)
is a hard
question.
CAD withrespect todiscriminant variety
In the case where f
k
(x)
R
(k)
[
x
]m
and A and B are semi-algebraic sets there are a
variety of tools which can be employed, see for example [22]. In the literature, the
approach used most commonly (e.g. [10, 23, 24]) is a Cylindrical Algebraic Decom-
position (CAD) computed with respect to the discriminant variety. For a full descrip-
tion of this technique we refer the reader to [25, 26] or the short sketch of the main
idea in [24, section3]. Briefly: the discriminant variety of the system
fk(x)
with the
domain and codomain restrictions on the semi-algebraic sets A and B is the solution
set to a new set of (non-parametric) polynomial equations with k as its indetermi-
nants. This new set of polynomials can be computed algorithmically for example
using Gröbner bases and elimination theory. Then CAD with respect to the discri-
minant variety decomposes B into a finite number of connected semi-algebraic sets
{
k
B
|
#
f
1
k
(0)
R
n
>0
2
}
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 7 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
called cells. Each cell has intersection with only one
L
i(�
u
f)
and therefore
L
i(�
u
f)
can
be expressed as union of a finite number of cells with an exact description of their
boundaries.
As we see later in this section, in many cases one is only interested in open
cells (i.e. only those cells which have full dimension [27]). A Maple package,
RootFinding[Parametric] has implemented an algorithm to compute the
open CAD with respect to the discriminant variety of a system of parametric poly-
nomial equations and inequalities [28]. From here on in this paper by CAD we mean
such an open CAD with respect to the discriminant variety.
Both the computation of the discriminant variety and the subsequent decompo-
sition involve the use of algorithms with doubly exponential complexity which can
cause problems. The number of cells in the decomposition will grow doubly expo-
nentially in the number of parameters of
fk(x)
[29]; and even computation of the
discriminant variety itself before any decomposition can be infeasible for moderate
examples, see e.g. [24]. This makes CAD impractical for studying parametric sys-
tems of polynomial equations with more than a few variables and parameters.
Approximation bysampling
Another approach adopted by scientists is to solve the system
fk
(
x
)
=u
for many
different choices of
kB
[1215].
Mathematically speaking, this means that B is replaced by a finite set. Then each
L
i
(�u
f)
is expressed as a subset of this finite set. This approach hereafter is referred
as the sampling representation approach. In contrast with the CAD approach which
provides an exact description of
L
i
(�u
f)
, the sampling representation approach pro-
vides an approximation. Note that there are different ways to choose the sample
parameter points for the sampling representation. One way is to arrange all points
equally distanced like a grid, and another way is to randomly sample from a distribu-
tion such as the uniform distribution on B, which is the one used in this paper. For
an example of a case where a sampling representation with grid-like parameter sam-
pling is used see [10, Figures7–12].
Since we are motivated from the application, we should note that in a lab, it is usu-
ally not possible to design the experiment so that the parameter values are exactly
the numbers that we decide. Therefore when the experiment is designed to have
k=k
, what happens is that k is a point in a neighborhood of
k
and not necessarily
k
itself. This can happen for example because of errors coming from the measure-
ment tools or the noise from the environment. In such cases picking a point close to
the boundaries of
L
i(�
u
f)
could lead to a different result than what the experimental-
ist expects, if errors or noise push it over the boundary.
Rectangular representation
A different discretization can be done using a rectangular division of B. For example if
B is a hyperrectangle [a,b] then a grid on B is achieved by dividing B along each axis to
equal parts. en for each sub-hyperrectangle of B in this rectangular division we assign
the average of the number of solutions of
fk(x)=u
for several choices of k coming from
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 8 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
the sub-hyperrectangle. is approach hereafter is referred as the rectangular represen-
tation approach. See Fig.4 to compare the three approaches visually.
Example
Consider the gene regulatory network in [30, Figure3B], depicted here in Fig.3a with
the ODE system in Fig.3b. is network has one conservation law,
x1+x4=k8
. ere-
fore we consider the system of equations obtained by the first three steady state equa-
tions in the ODE system and the conservation law to study the multistationarity of this
network. For illustration purpose we fix values of all parameters other than two so we
can plot the multistationarity region in 2 dimensions. In [30, Figure4] the reaction rate
constants other than
k3
were fixed to the values listed below.
We fix the values of all parameters other than
k3
and
k8
to these values also.
(1)
k1
=
2.81, k2
=
1, k4
=
0.98, k5
=
2.76, k6
=
1.55, k7
=
46.9.
Xk1
X+P
Pk2
0
2Pk3
k4
PP
X+PP k5
k6
XPP
XPPk7
XPP+P
(a)
dx1
dt =k5x1x3+k6x4,
dx2
dt =k1x1k2x22k3x2
2+2k4x3+k7x4,
dx3
dt =k3x2
2k4x3k5x1x3+k6x4,
dx4
dt =k5x1x3k6x4
(b)
Fig. 3 A bistable autoregulatory motif presented in [30, Figure 3B]. a X is a gene, P is a protein that can form a
dimer PP and then binding to X. The gene X will get expressed and produce P in both forms X and XPP. Finally
there is a degradation of P. b The ODE system of the gene regulatory network in a. The variables
x1,x2,x3
and
x4
are standing for the concentration of the species X, P, PP and XPP respectively
Fig. 4 Three representations of the multistationarity region of the network in Fig. 3a after fixing all parameter
values other than
k3
and
k8
to the values in (1). a CAD gives the exact boundary of
L
1(�
0
f)
and
L
3(�
0
f)
. The
first one is colored by sky blue and the later one with yellow. b A sampling representation of the parameter
region
B=[(0.0005, 0),(0.001, 2)]
by 1000 random points sampled from uniform distribution on B. 78 of
these points belong to
L
3(�
0
f)
and are colored yellow. The other 922 points belong to
L
1(�
0
f)
and are colored
by sky blue. c A rectangular representation of B. Each subrectangle is colored with respect to the average
number of solutions for 10 random points sampled from the uniform distribution on the subrectangle. The
color bar of the figure is in the right side
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 9 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
Let B be the rectangle made by the constraints
0.0005 <k3<0.001
and
0<k8<2
.
Using the RootFinding[Parametric] package of Maple we get the exact
description of the multistationarity region of the network, in 0.12 s, depicted in
Fig.4a.
A sampling representation of the multistationarity region is found by solving the
system of the equations for 1000 points
(k3,k8)
sampled from the uniform distribu-
tion on [(0.0005,0),(0.001,2)]. We used the vpasolve command from Matlab to
solve the system numerically. e Matlab code to generate this sampling representa-
tion took 154s to run, with the output visualised in Fig.4b.
A rectangular representation is given by dividing [(0.0005, 0), (0.001, 2)] to 100
equal sub-rectangles and then solving the system for 10 points
(k3,k8)
sampled from
the uniform distribution on each sub-rectangles. e sub-rectangles are colored with
respect to the average number of solutions. is computation also was done by Mat-
lab and took 166s, with the output visualised in Fig.4c.
Polynomial superlevel set representation
Superlevel sets
Definition 4.1 Consider
f:RnR
, an arbitrary function. For a given
uR
a super-
level set of f is the set of the form
When
u=1
we drop the index and write only U(f). Naturally, a polynomial superlevel set
is a superlevel set of a polynomial.
Polynomial sublevel sets are defined similarly as in Definition4.1 with the only differ-
ence the direction of the inequality. However, in this paper, we only focus on super-
level sets. For
dZ0
let
Pd
denote the set of polynomials of total degree at most d. A
sum of squares (SOS) polynomial of degree 2d is a polynomial
pP2d
such that there
exist
p1,...,pmPd
so that p
=m
i=1
p
2
i
. We denote the set of SOS polynomials of
degree at most 2d by
2d
.
eorem4.2 ([17, eorem2]) Let
BRn
be a compact set and K a closed subset of B.
For
dN
define
en there exists a polynomial
pdSd
such that
Furthermore
limd→∞ Vol(U(pd)K)=0
.
Given a pair (B,K) where
BRn
is a compact set and
KB
a closed set, and given
dN
; we call the polynomial superlevel set U(p) (with p being the polynomial
pdSd
Uu(f)={xRn|f(x)u}.
Sd={pPd|p0 on B,p1 on K}.
B
pd(x)dx
=
inf
B
p(x)dx
|
p
Sd
.
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 10 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
found in eorem4.2) the PSS representation of
KB
of degree d. When K is a semi-
algebraic set, one can find
pd
numerically using a minimization problem subject to some
positivity constraints [18, Equation13].
Let
B=[aB,bB]
and
Ki=[aKi,bKi],i=1, ...,m
be some hyperrectangles in
Rn
such
that
K:= m
i=1KiB
. By solving a similar optimization problem it is possible to find the
PSS representation of
KB
. Let
dN
. e goal is to find the coefficients of a polyno-
mial of degree d such that
B
p(x)
dx
becomes minimum subject to some conditions.
Before presenting the constraints, let us look at the target function. A polynomial p(x) of
degree d can be written as
α
Nn
d
cαx
α
. Here
Nn
d
is the set of
α=
1,...,αn)
Zn
0
such that
n
i=1
α
id
. Now the integral can be simplified as below.
Since
B
x
αdx
are constant real numbers independent of the coefficients of the polyno-
mial, the target function is a linear function on the coefficients of p(x) which are the
variables of the optimization problem.
Now let us look at the constraints. First of all p(x) has to be nonnegative on B. is can
be enforced by letting
where
r=⌊
d
2
the largest integer less than or equal to
d
2
. Secondly we need
p(x)1
on
K or in other words
p(x)10
on K. is holds if and only if
p(x)10
on each
Ki
.
erefore for every
i=1, ...,m
one more constraint of the shape (2) has to be added:
Recall Definition2.3: the multistationarity region of a network is in fact a superlevel set,
U
2(�
0
f)
. e goal is to find a PSS representation of the set
U
2(�
0
f)
. One way to accom-
plish this goal is to find a rectangular representation of the multistationarity region and
then solve the above mentioned SOS optimization problem. e next example illustrates
this idea. To tackle it we use a Matlab toolbox called YALMIP [31, 32] which can receive
an SOS optimization problem, process it and use other solvers to solve it. For the solver
to be used by YALMIP, we chose SeDuMi [33].
Examples
We continue with the example fromFig.3. Consider the rectangular representation of the
multistationarity region of that example given in Fig.4c. To find the PSS representation of
B
p(x)dx =
B
αNn
d
cαxαdx
=
αNn
d
cαB
xαdx
=
α
Nn
d
B
xαdxcα
.
(2)
p(x)
n
j
=1
sB,j(x)xjaB,jbB,jxj2r,sB,j2r2,j=1, ...,n
,
p(x)1
n
j
=1
sKi,j(x)xjaKi,jbKi,jxj2r,sKi,j2r2,j=1, ...,n
.
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 11 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
this set, we let
B=[(0.0005, 0),(0.0010, 2)]
and K be the union of rectangles
Ki
s such that
their associated number is greater than or equal to 2. From the 100 sub-rectangles of B, 9 of
them satisfy this condition. ese sub-rectangles are colored orange in Fig.5. We use the
YALMIP and SeDuMi packages of Matlab to solve the SOS optimization discussed before
this example. To report the computation time we add the two times reported in the output of
YALMIP: the “yalmiptime” and “solvertime. It takes about 2s to get the coefficients
of the polynomial p of the PSS representation of degree 2. Figure5 shows the plot of U(p).
Unfortunately the same code does not produce a better approximation when we increase
d, the degree of p, from 2 to 4, 8 or 16. e output from Matlab gives similar figures in these
cases as Fig.5.
Consider another gene regulatory example from [19, Chapter2]. To avoid lengthening
the text, we only reproduce the system of equations needed to study the multistationarity of
the network:
We fix all parameters other than
k7
and
k8
to the following values coming from Equation
(2.10) of [19, Chapter2]:
(3)
k
1
x
7
x
5
k
5
x
1
=0k
2
x
8
x
6
k
6
x
2
=0
k
3x1k7x3=0k4x2k8x4=0
k
9x7x4k11x9=0k10 x8x3k12x10 =0
k
13x9x4k15 x11 =0k14 x10x3k16 x12 =
0
x
5=k17 x6=k18
x7+x9+x11 =k19 x8+x10 +x12 =k20
.
(4)
k
1
=k
2
=k
3
=k
4
=1, k
5
=0.0082, k
6
=0.0149,
k
9=k10 =0.01, k11 =k12 =10000, k13 =2,
k
14 =25, k15 =1, k16 =9, k17 =k18 =k19 =1,
k20 =
4.
Fig. 5 The PSS approximation of the multistationarity region for the network in Fig. 3a of degree 2 obtained
by the information of Fig. 4c. The union of orange colored subrectangles is considered as the initial
approximation of the multistationarity region obtained by the rectangular representation and chosen as the
set K. The yellow colored area is the difference of
U(p)K
. Remember that the PSS representation of the
multistationarity region is the yellow region which contains the orange rectangles as well
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 12 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
We reproduced the rectangular representation of the multistationarity region of this
network by Matlab, as shown in Fig.6a. From the 100 sub-rectangles in total, for 28 of
them the average number of steady states is greater than or equal to 2. Using YALMIP
and SeDuMi it took between 1 and 2s to get the polynomials of the PSS representations
of degrees 2, 4 and 8 represented in Fig.6b–d respectively. For this example, the PSS
approximation of degree 4 looks different than that of degree 2, but for degree 8 the plot
looks similar to degree 4.
Advantages ofPSS representation overrectangular representation
It is natural to ask why one should find a PSS representation of the multistationar-
ity region using the rectangular representation given one already has the rectangular
representation? Let
BRr
be the parameter region of the form of a hyperrectangle,
and
KB
be the multistationarity region. In the rectangular representation we have
K≃∪
m
i=1Ki
where
Ki=[aKi,bKi]
are hyperrectangles. In the PSS representation we
have
KU(p)
where p is a polynomial of degree d.
Fig. 6 PSS representations of different degrees for the mulistationarity region of the LacI-TetR gene
regulatory network using the information we got from the rectangular representation. The union of orange
colored subrectangles is considered as the approximation of the multistationarity region obtained by the
rectangular representation and chosen as the set K. The yellow colored area is the difference of
U(p)K
. a
The rectangular representation of multistationarity region of the network with the system of equations given
in (3) and some parameters being fixed by the values in (4). bd PSS representations of the multistationarity
region of degrees 2, 4 and 8 respectively obtained by the information of Fig. 6a
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 13 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
1- When
r4
, plotting K is impossible. In order to save or show the rectangular repre-
sentation one needs to use a matrix of size
(m)×(2r)
, where each row stands for one
Ki
and the first r columns have the coordinates of the point
aKi
and the second r col-
umns correspond to the coordinates of the point
bKi
. However for the PSS represen-
tation one needs to use only a vector of size
r+d
r
=
d
i
=
0
r1+i
r
1
, where
r1+i
r
1
entries are coefficients of the terms of degree i. e terms are ordered
from smaller total degree to larger and for the terms of the same total degree we use
the lexicographic order.
2- To test if a point
kB
belongs to K using the rectangular representation one should
check m conditions of the form
kKi
which means verifying an inequality on each
coordinate of the point, i.e.
a
K
i
,j
k
j
bK
i
,
j
. If one of the conditions
kKi
is posi-
tive, then there is no need to check the rest, otherwise all should fail to conclude that
k K
. However, using the PSS representation one needs to check only one condi-
tion of an evaluation form, i.e.
p(k)1
.
3- Recall from the last paragraph ofthe “Approximation by sampling” Section explain-
ing that parameters near the boundary of the multistationarity region are not suita-
ble choices for an experimentalist. To check the distance of a point
kB\K
to the
boundaries of K using the rectangular representation one should find distance of
k
from boundaries of each
Ki
and then taking the minimum. However, using the PSS
representation, in both cases of
kB\K
or
kK
, one just needs to find the dis-
tance of
k
from the algebraic set defined by
p(k)1=0
, for example by Lagrange
multipliers, as in the next section.
To conclude, if
r+d
r
is considerably smaller than 2mr, then storing the PSS repre-
sentation instead of the initial rectangular representation will save memory without
loosing information about the multistationarity region.
Approximating thedistance ofparameter point fromtheboundary
To illustrate how to approximate distance of a parameter point from the boundaries of
the multistationarity region using a PSS representation we continue with the example
fromFig.3.
Let p be the polynomial of degree 2 in two variables
k7
and
k8
corresponding to U(p)
in Fig.6b. We will approximate distance of the point
k=(0.08, 0.02)
from the bound-
ary of the multistationarity region by the distance of
k
from the algebraic set defined
by
p(k7,k8)1=0
. is question is equivalent to minimizing the Euclidean distance
function of a point
(k7,k8)
from the point
k
subject to the constraint
(k7,k8
)
L1
(
p)
.
e target function is
(k7
0.08
)
2
+(k8
0.02
)
2 which gets minimized if and only
if
(k70.08)2+(k80.02)2
gets minimized. An elementary way to solve this minimi-
zation problem is to use the method of Lagrangian multipliers [34, Chapter7, eo-
rem1.13]. Define
F
(k7,k8,)=(k70.08)
2
+(k80.02)
2
+
p(k7,k8)1
.
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 14 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
Now we must find the critical points of
F(k7,k8,)
. So we should solve the system of
equations obtained by
F
k7
=
F
k8
=
F
=0
. It takes 0.167s to solve this system of equa-
tions by the solve command in Maple. It has 4 solutions, from which 2 belong to the
rectangle
B=[(0, 0),(0.1, 0.1)]
and the minimum of the target function is obtained
at the point (0.04499222669, 0.04161251428). e distance of this point from
k
is
0.04114176669.
Constructing aPSS representation fromasampling representation
It is not necessary to have a rectangular representation to get the PSS representation.
Let
B=[aB,bB]
be a hyperrectangle and
K={a(1)
,
...
,
a(m)}
a finite set. Let
dN
,
and the goal be to find the coefficients of a polynomial of degree d such that
B
p(x)
dx
becomes minimum subject to some conditions. We already saw that the target func-
tion is linear. e constraint
p1
on K can be enforced by
p(a(i))1
for every i,
which are linear constraints. e positivity of p on B can be enforced by Eq.(2) or by
adding a large enough number of random points from B and putting the constraint
p(a)>0
. e later idea makes the problem solvable by any common linear program-
ming tool. However, here we still use Eq.(2).
Let us illustrate this with our ongoing example. Consider the sampling representa-
tion of the multistationarity region of the network of Example3.4 given at Fig.4b.
To find the PSS representation of this set, we let
B=[(0.0005, 0),(0.001, 2)]
and K to
be the set of points for which the system
fk(x)=0
had more than one positive solu-
tion. ere are 1000 points from which 78 of them are parameter choices where the
network has three steady states. Using the YALMIP package of Matlab, it takes less
than a second to get the coefficients of each of the polynomials p of the PSS represen-
tation of degrees 2, 6 and 10. Figure7a–c show the plots of U(p) for degrees 2, 6 and
10 respectively. e plot for degree 6 actually looks worse than the plot for degree 2
(further away from the actual result in Fig.4a), although the one for degree 10 looks
a little better. In all these cases YALMIP finished the computations with a message
‘Numerical problems (SeDuMi)’ indicating that the solver found the prob-
lem to be numerically ill-posed. Rescaling the parameter region of interest, B, to
[(0,0),(1,1)] and then transforming the PSS polynomial back to the original B allows
a better PSS approximations via YALMIP. For degrees 2 and 6 the numerical problem
message is avoided but for degree 10 it remains. e results are shown in Fig.7d–f.
To demonstrate that the number of free parameters need not be 2 to be able to com-
pute the PSS representation, we repeated the above process with all 8 parameters of the
system being free in the following hyperrectangle:
Solving the system at 1000 random points uniformly chosen from B takes the same
amount of time as solving the system at 1000 random points with only 2 of their
B=[(1, 0, 0.0005, 0, 1, 1, 40, 0),(4, 2, 0.001, 2, 4, 3, 50, 2)].
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 15 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
coordinates varying took time in the previous case. It took about 1.5s to compute the
45 coefficients of the polynomial of the PSS representation of degree 2. e polynomial
found is the following.
p=0.0232593410k
2
10.2965863774k
2
2+559.9760177000k
2
3
0.0618678914k2
4+0.0098179292k2
50.0061375489k2
6
0.0036871825k2
70.0716829116k2
80.0110701085k1k2
5.3373104650k1k3+0.0186292240k1k4+12.5539028600k2k
3
+0.0051935911k1k50.0820767495k2k4+0.0250525965k1k6
0.0649618436k2k58.2165668850k3k4+0.0092355627k1k7
0.0310047257k2k64.2926137530k3k50.0527315151k1k8
+0.0017547696k2k72.7647028150k3k6+0.0288291263k4k5
+0.0785940967k2k80.4049844472k3k7+0.0536360086k4k6
+12.6014929300k3k80.0091147734k4k7+0.0198800552k5k
6
0.0567174819k4k8+0.0009448114k5k70.0190068615k5k8
+0.0102920949k6k7+0.0316830973k6k8+0.0022468043k7k8
0.6043381476k1+0.7152344469k2+0.3287484744k4
+36.3811699100k30.0617318336k50.5940226892k6
+
0.2987521874
k7+
0.2558883329
k8
5.0441247030.
Fig. 7 PSS representation of different degrees of the mulistationarity region of the network of Example 3.4
inside the hyperrectangle
B=[(0.0005, 0),(0.001, 2)]
using the information we got from the sampling
representation of the multistationairy region. The orange colored points are the points with three steady
states and their union is considered as approximation of K. The yellow colored area is the difference of
U(p)K
. One expects to see that this difference is getting smaller as the degree increases. However, the
Matlab code that we wrote using YALMIP and SeDuMi does not behave as expected. ac gives the PSS
representation of the original problem of degrees 2, 6 and 10 respectively. df gives the PSS representation of
those degrees for the problem after after rescaling the parameters for better numerical behavior via YALMIP
and SeDuMi
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 16 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
Advantages ofPSS representation overthesampling representation
Let us addresswhy oneshould find a PSS representation of the multistationarity region
using the sampling representation if one already has a sampling representation. Let
BRr
be the parameter region of the form of a hyperrectangle,
KB
be the multista-
tionarity region. In the sampling representation we have
{a(1),...,a(m)}⊆K
. In the PSS
representation we have
KU(p)
where p is a polynomial of degree d.
1- When
r4
, plotting K is impossible. In order to save or show the sampling repre-
sentation one needs to use a matrix of the size
m×r
, where each row stands for one
point
a(i)
and the columns correspond to the coordinates of the points. However, for
the PSS representation one needs to use a vector of the size
r+d
r
, as explained
inthe first item of the “Advantages of PSS representation over rectangular represen-
tation” section1.
2- To test if a point
kB
belongs to K using the sampling representation is not a
straightforward task. However, using the PSS representation one needs to verify only
one condition of the evaluation form,
p(k)1
.
3- To compute the distance of a point
kB
to the boundaries of K, using the sampling
representation, if
k K
, one should compute the distance of
k
from each point in
the sampling representation of K and then take the minimum. Using the PSS repre-
sentation, whether
k K
or not, one just needs to find the distance of
k
from the
algebraic set defined by
p(k)1=0
.
In a typical example from CRN theory, r is usually much higher than 2, and therefore
item 1 is really important. When
r+d
r
is lower than rm, one can use less memory
by saving the PSS representation instead of keeping all the points of the sampling rep-
resentation in the memory. Further, as items 2 and 3 show, this will not cause a loss of
information about the multistationarity region.
More involved example
We showed earlierthat the PSS representation can be generated for examples with
a higher number of parameters than two. ere, we let all 8 parameters of the net-
work in Fig.3a to be free and found the PSS representation of degree two in 8 inde-
terminants. Now we bring another such example which also serves to emphasize
Remark2.2 item (ii): that to have a PSS representation of the multistationarity region,
one does not need to have the right hand side functions of the ODE system to be of
polynomial or even rational functions.
Consider a gene expression system with 4 species
Xi
,
i=1, ...,4
where these species
can be m-RNA or protein molecules or other relevant factors, with the ODE system
as in Fig.8a which was introduced in [35, Figure2]. As one can see the right hand side
functions involve at least a square root, and as a result this system is not polynomial, or
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
Page 17 of 26
Sadeghimaneshand England BMC Bioinformatics (2022) 23:391
even defined by rational functions. Let us fix the parameter values other than the three
degradation rates
βi
,
i=2, 3, 4
to the following values, chosen the same as in [35]:
In [35] it is shown that for
2,β3,β4)=(5,