ArticlePDF Available

Abstract and Figures

In this paper, we compared the efficacy of observation based modeling approach using a genetic algorithm with the regular statistical analysis as an alternative methodology in plant research. Preliminary experimental data on in vitro rooting was taken for this study with an aim to understand the effect of charcoal and naphthalene acetic acid (NAA) on successful rooting and also to optimize the two variables for maximum result. Observation-based modelling, as well as traditional approach, could identify NAA as a critical factor in rooting of the plantlets under the experimental conditions employed. Symbolic regression analysis using the software deployed here optimised the treatments studied and was successful in identifying the complex non-linear interaction among the variables, with minimalistic preliminary data. The presence of charcoal in the culture medium has a significant impact on root generation by reducing basal callus mass formation. Such an approach is advantageous for establishing in vitro culture protocols as these models will have significant potential for saving time and expenditure in plant tissue culture laboratories, and it further reduces the need for specialised background.
This content is subject to copyright.
RESEARCH ARTICLE
Genetic programming based models in plant
tissue culture: An addendum to traditional
statistical approach
Meenu R. Mridula
1
*, Ashalatha S. Nair
1
, K. Satheesh Kumar
2
1Department of Botany, University of Kerala, Kariavattom, Thiruvananthapuram, Kerala, India,
2Department of Future Studies, University of Kerala, Kariavattom, Thiruvananthapuram, Kerala, India
These authors contributed equally to this work.
*m.r.meenakshi@gmail.com
Abstract
In this paper, we compared the efficacy of observation based modeling approach using a
genetic algorithm with the regular statistical analysis as an alternative methodology in plant
research. Preliminary experimental data on in vitro rooting was taken for this study with an aim
to understand the effect of charcoal and naphthalene acetic acid (NAA) on successful rooting
and also to optimize the two variables for maximum result. Observation-based modelling, as
well as traditional approach, could identify NAA as a critical factor in rooting of the plantlets
under the experimental conditions employed. Symbolic regression analysis using the software
deployed here optimised the treatments studied and was successful in identifying the complex
non-linear interaction among the variables, with minimalistic preliminary data. The presence of
charcoal in the culture medium has a significant impact on root generation by reducing basal
callus mass formation. Such an approach is advantageous for establishing in vitro culture pro-
tocols as these models will have significant potential for saving time and expenditure in plant tis-
sue culture laboratories, and it further reduces the need for specialised background.
Author summary
Trials to find out the best combination of factors that contribute to the desired response
takes up the chunk of time and resources in any plant tissue culture laboratory. The out-
put of such experiments is analysed statistically to come to a conclusion. However, with-
out prior statistical modifications, the results could be misleading. Recent reports from
several labs point out the use of artificial neural networks to circumvent this. We have
chosen to use a computational process that can predict the best combination of factors for
the desired response after randomly testing the higher and lower limit of the factors with
experiments. The magnitude of the desired response can be presumed at any concentra-
tion within this range using the models generated by symbolic regression. The procedure
provides both optimum model function as well as the optimum variable values in the
model. The variable sensitivity and percentage response add depth to the information
thus obtained. The study indicated that these models would have significant potential for
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 1 / 13
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPEN ACCESS
Citation: Mridula MR, Nair AS, Kumar KS (2018)
Genetic programming based models in plant tissue
culture: An addendum to traditional statistical
approach. PLoS Comput Biol 14(2): e1005976.
https://doi.org/10.1371/journal.pcbi.1005976
Editor: Jean-Baptiste Durand, Laboratoire Jean
Kuntzmann, FRANCE
Received: February 23, 2017
Accepted: January 15, 2018
Published: February 27, 2018
Copyright: ©2018 Mridula et al. This is an open
access article distributed under the terms of the
Creative Commons Attribution License, which
permits unrestricted use, distribution, and
reproduction in any medium, provided the original
author and source are credited.
Data Availability Statement: All relevant data are
within the paper and its Supporting Information
files. Data has also been uploaded to figshare (DOI:
10.6084/m9.figshare.5844765).
Funding: The first author (MRM) would like to
acknowledge the financial assistance from Council
of Scientific and Industrial Research, New Delhi
through Junior Research Fellowship scheme (No.
09/102/RENEWAL/2014-EMR-1). The funders had
no role in study design, data collection and
analysis, decision to publish, or preparation of the
manuscript.
saving time and expenditure in plant tissue culture laboratories for the commercial estab-
lishment of in vitro protocols.
Introduction
Relatively more straightforward and efficient empirical modeling techniques based on input-
output models are gaining popularity to conventional statistical methods across various disci-
plines [1]. This surge is due to its relative ease of use and understanding. Genetic programming
(GP) is an approach which uses the concept of biological evolution to handle a problem with
many fluctuating variables. Computational optimisation techniques have recently debuted in
plant tissue culture research as studied in neural networks models [2]. Symbolic regression was
one of the earliest applications of GP and continues to be widely considered [3]. A broad array
of scientific fields like Biology, Chemistry, Environmental Science, Neurology and Psychology
reports the use of symbolic regression [49]. However, plant tissue culture data has not yet been
analysed using symbolic regression. The data generated from plant tissue culture experiments
includes continuous, count, binomial or multinomial and predominantly the information is val-
idated using analysis of variance method (ANOVA) [2,10]. ANOVA is adequate for normally
distributed continuous data; but without prior manipulation, it is erroneous to analyze count,
binomial or multinomial data [11]. Neuro-fuzzy logic is the standard practice by which compu-
tational modeling is achieved in plant tissue culture [2,12]. In this context, genetic algorithm
based symbolic regression remains unevaluated. Unlike conventional regression analysis which
optimises parameters for a pre-defined model, symbolic regression avoids imposing any apriori
assumptions. In generalised linear model (GLM) regression, the dependent variable is repre-
sented as linear combination of the given set of basic functions and optimise the coefficients to
fit the data. However, symbolic regression searches for both a set of basic functions and coeffi-
cients. The added value of symbolic regression, compared to GLM, lies in its ability to quickly
and accurately find an optimal set of basic functions [13,14]. The algorithm infers the model
from the data by combining variables and mathematical operators and generates an empirical
formula which is a mathematical equation that predicts observed results derived from con-
ducted experiments. GP combines previous equations and forms new ones. Thus it produces
models with interpretable structure, relating to input and output variables from a data set with-
out pre-processing and identifying critical parameters and hence shed insight into the underly-
ing processes involved in a given system [15]. Symbolic regression can recognise and model
complex non-linear relationships between the inputs and outputs of biological processes even
in the presence of disturbances and potential for parallel processing. The preliminary data gen-
erated from experiments during rooting of in vitro regenerated plantlets in Wrightia tinctoria
was employed to study the utility of symbolic regression to analyze plant tissue culture data.
The effect of two variables - NAA and charcoal on root proliferation was considered. The data-
sets were subjected to usual statistical analysis as well as observation based modeling via sym-
bolic regression. Moreover, we aimed to optimise the process by examining the influencing
factors. We propound the use of symbolic regression-based model prediction as an addendum
to data analysis method for plant tissue culture experiments.
Materials and methods
Culture conditions
The genetic variability was kept minimum by using a single field grown ortet, thus minimising
statistical errors [16]. Nodal regions derived from the fresh flushes of growth from the ortet, two
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 2 / 13
Competing interests: The authors have declared
that no competing interests exist.
weeks after lopping one major branch served as the explants [17]. The nodal explants were condi-
tioned over a period of 4 months (subculture/four weeks) on MS medium (1962) [18], pH 5.8 and
2μM each of BAP and NAA for shoot multiplication. For rooting experiments individual shoots
were transferred on MS medium containing 2 μM BAP with NAA (2, 4 and 6 μM, respectively)
and charcoal (0.01, 0.03, 0.05, 0.07, 0.09 and 0.11%, respectively) in 250 ml culture flasks in 50 ml
of sterilized medium (pH 5.8). The cultures were maintained at 25±2˚C in a culture room with
40 μmolm
2
s
1
irradiances and a photoperiod of 8 hrs with 55±5% of relative humidity.
Experimental design
The plant tissue culture database, containing 21 conditions, followed a factorial design for two
variables- concentration of NAA (2, 4 and 6 μM) and charcoal (0,0.01, 0.03,0.05,0.07,0.09
and 0.11%) in the medium. Each treatment consisted of 5-7 explants in a culture flask with
three replicates. The subculture was done at the end of four weeks and five parameters were
recorded to analyze the effects of the variables on rooting such as basal callus diameter (mm)
(BC), the percentage of shoots rooted (R), the length of the longest root (cm) (RL), the number
of roots (NR) and the number of lateral roots (NLR) (S1).
Statistical analysis
All experiments were conducted using Randomised Block Design (RBD). Continuous data were
analysed using multiple linear regression in R and posthoc comparisons of pairs performed by
Tukey’s test (p>0.05). Count data were analysed using Poisson regression model. Pearson’s Chi-
squared test for count data was employed to access statistical significance of the variables.
Symbolic regression
Each of the observed parameters is modeled as a function of NAA and charcoal concentrations
using symbolic regression and GLM for comparison. To obtain a global optimum, we have also
modelled the combination (R+RL+NR+NRL-BC) by taking rooting factors together after nor-
malisation by employing both GLM and symbolic regression. The optimum model for each
case was generated by genetic programming based symbolic regression using the software pack-
age Eureqa (Version 0.98 beta) with 50% of the data randomly selected as training data, and
3-fold cross-validated with randomly selected 25% of the remaining data [1921]. Correspond-
ing to each symbolic regression model of the data partition, we have also obtained generalised
linear model by including x, y, xy, 1/x, 1/y, sin(x), cos(x), sin(y), cos(y), xy sin(x), xy cos(x), xy
sin(y), xy cos(y) into the set of basic functions and cross-validated similarly. The remaining 25%
the data was used for testing and reporting error [1921]. The Target expression used to gener-
ate the regression model was the minimal equation z = f(x, y) where ‘x’ corresponds to NAA
concentration and ‘y’ corresponds to charcoal concentrations, and ‘z’ represents each of the five
observed parameters and their combination. The models were based on the primary and trigo-
nometric building blocks, with the R
2
goodness of fit as the error metric [22,23]. Root Mean
Squared Error (RMSE) was calculated for the test data sets. Sensitivity represents the relative
impact of the variable on the parameter studied within this model and was calculated by the
local method using the partial derivatives [24]. Given a model equation of the form z = f(x, y),
the influence metrics of x on z was;
Sensitivity ¼@z
@x:sðxÞ
sðzÞ;evaluated at all input data points;
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 3 / 13
The percentage positive was calculated as percentage of data points where sðxÞ
sðzÞ>0and per-
centage negative was calculated as percentage of data points where sðxÞ
sðzÞ<0; where @z
@xwas the
partial derivative of z with respect to x, σ(x) was the standard deviation of x in the input data, σ
(z) was the standard deviation of z, |x| denoted the absolute value of x, and
xdenoted the mean
of x [25]. The ‘fmin’ function in MATLAB (R2012b) was used to obtain the maximum value of
each of these functions.
Results and discussion
The average values obtained for the five growth parameters observed during the study were
given as the basal callus diameter (Table 1), the percentage of shoots rooted (Table 2), the
length of longest roots (Table 3) and the number of roots and the number of lateral roots
(Table 4). The miniscule alphabets within a column indicated the significant influence of char-
coal and majuscule alphabets in the row represented the significant interaction of NAA. The
shoots inoculated on MS medium with 0% charcoal (control) showed maximum basal callus
formation (Fig 1). The shoots inoculated on MS medium supplemented with 4μM NAA and
0.07% charcoal showed the maximum percentage of rooting (Fig 2).
Multiple linear regression demonstrated a significant effect of NAA and its interaction with
charcoal on basal callus (p>0.001), the percentage of shoots rooted (p>0.05) and root length
(p>0.01) (Tables 13). The individualistic effect of NAA for the number of roots and lateral
roots were found to be significant at p>0.05 and p>0.001 respectively (Table 4). The interac-
tion of NAA and charcoal was not significant for the same parameters studied. Mathematical
Table 1. Mean of basal callus diameter after four weeks of culture under the conditions studied. Means followed
by similar letters; a, b and c (within each column) and A and B (in each row) are not significantly different by Tukey’s
multirange test (p>0.05).
Basal callus
Diameter (mm)
Charcoal NAA
2 4 6
0 10.06 c A 11.40 d B 11.33 c B
0.01 2.25 b AB 2.40 c B 2. b A
0.03 1.61 a A 2.38 c B 1.43 a A
0.05 1.57 a A 1.54 b A 1.32 a A
0.07 1.32 a A 1.36 ab A 1.22 a A
0.09 1.34 a A 1.12 ab A 1.27 a A
0.1 1.44 a A 1.28 a A 1.24 a A
https://doi.org/10.1371/journal.pcbi.1005976.t001
Table 2. Mean of the percentage of shoots rooted after four weeks of culture under the conditions studied. Means
followed by similar letters; a, b and c (within each column) and A Band C (in each row) are not significantly different
by Tukey’s multirange test (p>0.05).
Percentage of
shoots rooted
Charcoal NAA
246
0 0 a A 0 a A 0 a A
0.01 0 a A 0 a A 0 a A
0.03 0 a A 10.76 a B 12.96 b B
0.05 43.9 c B 47.6 c B 12.8 b A
0.07 39.93 c B 50 c C 28 c A
0.09 27.43 b B 31.7 b B 9.4 ab A
0.1 28.3 b B 23.23 b B 0 a A
https://doi.org/10.1371/journal.pcbi.1005976.t002
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 4 / 13
functions were successfully developed using symbolic regression to understand the correlation
between the two variables for each of the parameters considered and is contrasted with those
obtained by traditional regression models (Table 5). To analyze the effect of each of the vari-
ables on the parameter studied; variable sensitivity measures were calculated along with its
percentage impact. Its sensitivity denoted the relative impact within this model that a variable
has on the target variable. The individualistic effect of the two input variables on the output
parameter was pointed out as percentage positive or negative of that input variable (Table 6).
For the parameter basal calli diameter, the percentage positive value for variable ‘y’ was zero.
In other words, there was zero percent chance of basal calli mass increasing with increasing
concentration of charcoal; or that basal calli mass decrease with increasing concentration of
charcoal (Fig 3). The model predicted that increase in charcoal concentration had a conse-
quent increase in root length and root number in 50% of all the trials while the same promoted
rooting percentage and lateral root number in 75% of the trials. Root number and root length
decreased with increasing concentration of NAA in 100% of the trials. Rooting percentage
and lateral root numbers increased with increasing NAA concentration in 50% of all the trials.
Table 3. Mean of the length of longest roots after four weeks of culture under the conditions studied. Means fol-
lowed by similar letters; a, b and c (within each column) and A and B (in each row) are not significantly different by
Tukey’s multirange test (p>0.05).
Length of
longest
roots (mm)
Charcoal NAA
2 4 6
0 0 a A 0 a A 0 a A
0.01 0 a A 0 a A 0 a A
0.03 0 a A 7.88 b B 1.72 a A
0.05 26.4 c C 17.94 c B 6.9 ab A
0.07 41.78 d B 41.67 d B 14.4 b A
0.09 18.17 b B 10.1 b A 6.8 a A
0.1 7.38 a B 3.13 ab AB 0 a A
https://doi.org/10.1371/journal.pcbi.1005976.t003
Table 4. Mean of the number of roots and number of lateral roots after four weeks of culture under the conditions
studied. The row and the column variables are statistically significantly associated at p>0.05 (Number of roots) and
p>0.001(Number of lateral roots), by Pearson’s Chi-squared Test for Count Data.
Number of Roots
Charcoal NAA
2 4 6
0 0 0 0
0.01 0 0 0
0.03 0 0.4 0.6
0.05 5 2.4 1.2
0.07 1.5 1.4 0.8
0.09 1 1.13 0.8
0.1 0.8 0.6 0
Number of lateral Roots
0 0 0 0
0.01 0 0 0
0.03 0 0 0
0.05 0.4 0.43 0
0.07 5 0.6 1.1
0.09 0.6 2.7 2.2
0.1 0 1.96 0
https://doi.org/10.1371/journal.pcbi.1005976.t004
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 5 / 13
The function obtained and the 3D plots thus generated could be used to predict the combi-
nations of input variables giving optimum results. The best response for rooting percentage
Fig 1. In vitro plants of W.tinctoria- Shoot inoculated in the absence of charcoal showing profuse basal callusing (left); against shoot inoculated in
the presence of charcoal showing reduced/no basal callus mass (right).
https://doi.org/10.1371/journal.pcbi.1005976.g001
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 6 / 13
was predicted at 3.7 μM NAA and 0.08% charcoal (Fig 4). The root length showed a non-linear
pattern, and the highest value for its function was estimated with 2.8 μM NAA and 0.05%
charcoal (Fig 5). The maximum root number was determined for 1.7 μM NAA and 0.06%
charcoal (Fig 6). The maximum value of the function generated for lateral root number was
with 6.3 μM NAA and 0.08% charcoal (Fig 7). The global optimum modelled upon the com-
bination (R+RL+NR+NRL-BC) indicated the results as 2.44 μM NAA and 0.03% charcoal
(Fig 8).
Fig 2. Maximum rooting observed in the shoots inoculated on MS medium supplemented with 4 μM NAA and 0.07% charcoal.
https://doi.org/10.1371/journal.pcbi.1005976.g002
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 7 / 13
The conclusion obtained by traditional statistics suggested that charcoal had a positive and
stimulatory effect in rooting of shoots by reducing basal callus (Table 1). Percentage of shoots
rooted and root length showed a significant impact with the combination of NAA and char-
coal (Tables 2and 3). In the present study, NAA has a significant effect on rooting as shown by
the number of roots and lateral roots (Table 4). Similar results were reported in Acacia leu-
cophloea and Cinnamomum verum [26,27]. With traditional statistics, we were not able to esti-
mate the combination/s of both variables in producing the best results or able to identify the
relative impact of a particular variable on the output parameter. Modeling of plant tissue cul-
ture data is practised using regression analysis where first an initial function is approximated
and the data fitted to that function to obtain the optimum parameters [11,28,29]. In this proce-
dure even when one gets the optimum parameter values, the model prediction was limited by
the probable wrong selection of the model function. In contrast, symbolic regression
Table 5. Functions and error matrix obtained through symbolic regression and multiple linear regression.
Parameter Symbolic regression Linear regression
Function R
2
RMSE Function R
2
RMSE
Basal callus
diameter (BC)
1.17 + 0.12/(0.01 + y) 0.98 0.09 739.5 x 51540 y + 73390 Sin(y) 0.56 Sin(x) -29450 Cos(y)
1190 xy + 1673 x Sin(y) 14130 y Sin(y) +739.3 x Cos(y) -21970 y
Cos(y) + 0.7174 y Sin(x) -350.2 xy Sin(y) -487.8 xy Cos(y)
+294660
0.98 0.56
Percentage of
shoots rooted (R)
4.43 x + 4738572.57 y
4
+ 88353.22 y
2
-
18.33 1270650.34 y
3
21.85 y x
2
0.73 0.94 70990 x + 619700 y -904900 Sin(y) + 0.294 Sin (x) + 278000 Cos
(y) -157200 xy + 229300 x Sin(y) + 137300 y Sin (y) -70990 x Cos
(y) + 285300 y Cos(y) -27.01 y Sin(x) -34940 x y Sin (y) -72090 x y
Cos(y) -278000
0.60 1.11
Length of longest
roots (mm) (RL)
2.28 -22.58 cos(40.39 y)/(1.25 x + x cos(5.06 y)
+ sin(1.84 - x))
0.75 1.28 46840 x + 530000 y -768300 Sin(y) -1.7 Sin(x) + 263300 Cos (y)
-98130 xy + 142600 x Sin(y) + 129200 y Sin(y) + 129200 y Sin(y)
-46840 + 238500 y Cos(y) -4.134 y Sin(x) -22990 x y Sin (y) -44490
x y Cos(y) -263300
0.85 7.85
Number of roots
(NR)
14.83 y
2
+ 14.78 y
2
sin(10.50 + 6375y) 4.80
y 1.92 x y -19.2 y sin(5.5 + 88.86y) 1.95 x y
sin(6.3-183.03y)
0.82 0.76 5984 x + 56560 y -83160 Sin(y) + 0.09042 Sin(x) + 22980 Cos(y)
-14110 xy + 20670 x Sin (y) + 11420 y Sin(y) -5984 x Cos (y)
+ 26590 y Cos(y) -0.3519 y Sin (x) -2959 x y SIn (y) -6560 x y Cos
(y) -22980
0.63 1.03
Number of lateral
roots (NRL)
0.04 + 2208.78 x y
3
- 3559.64 y
4
0.39 y x
2
17758.21 x y
4
0.96 0.02 7025 x + 56710 y -82060 Sin(y) + 0.467 Sin (x) + 28560 Cos(y)
-14700 x y + 21340 x Sin(y) + 13970 y Sin(y) -7025 x Cos(y)
+ 25370 y Cos(y) -2.28 y Sin(x)-3443 xy Sin(y)-6652 xy Cos (y)
-28560
0.68 1.40
Normalised (R+RL
+NR+NRL-BC)
1.24 sin(3.24 + 3.42 x
3
y
2
sin(y) - 3.77y
2
cos(x)) 0.80 0.92 8604 x + 15570y 19520 xy 2.067 sin(x) 22750 sin(y) +6988
cos(y) + 3453 y sin(x) -8606 x cos(y)– 4249 xy sin(y) -9014 xy cos
(y) - 6988
0.78 1.99
The variable x represents NAA concentration and y represents charcoal concentration
https://doi.org/10.1371/journal.pcbi.1005976.t005
Table 6. Variable sensitivity values of the parameters studied through symbolic regression.
Parameters Sensitivity Percentage positive Percentage negative
x y x y x y
Basal calli diameter (mm) - 1.65 - 0 - 100
Percentage of shoots rooted 0.7 1.78 50 75 50 25
Length of longest roots (mm) 0.3 1.4 0 50 100 50
Number of roots 0.21 1.26 0 50
100 50
Number of lateral roots 0.18 1.94 50 75 50 25
Normalised 0.13 1.64 50 75 50 25
https://doi.org/10.1371/journal.pcbi.1005976.t006
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 8 / 13
procedures work simultaneously on model specification problem and the problem of fitting
coefficients [30]. Thus it provides both optimum model function as well as the optimum vari-
able values in the model. The simple relations derived from GP were more accessible to analyze
the relationships between the input and output variables [31]. Observation-based predictive
models using GP identified that the individualistic effect of charcoal was significant in all the
output parameters. A previous investigation suggested basal callus mass formation as one of
the primary constraints in the culture of this tree species [32]. In the present study, charcoal
has a positive and stimulatory effect in rooting by reducing basal callus formation in shoots.
For each of the functions, generated values can be obtained by increasing /decreasing the
Fig 3. The plot generated for parameters predicted by symbolic regression model for W.tinctoria plantlets as a
function of NAA and charcoal concentration showing basal calli diameter (mm). The cyan coloured dot indicates
the optimum concentration predicted by the model function.
https://doi.org/10.1371/journal.pcbi.1005976.g003
Fig 4. The plot generated for parameters predicted by symbolic regression model for W.tinctoria plantlets as a
function of NAA and charcoal concentration showing rooting percentage. The cyan coloured dot indicates the
optimum concentration predicted by the model function.
https://doi.org/10.1371/journal.pcbi.1005976.g004
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 9 / 13
variables by a unit. After randomly testing the higher and lower limit of the additives with
experiments, the magnitude of the observed parameters can be presumed at any concentration
of the additives within this range using the models generated. It can be extended to analyze
synergistic interactions between two parameters by testing whether increasing both variables
by a unit, gives a higher or a lower value than the sum of the values obtained by increasing
each individually by a unit. The basic requirement for any empirical model includes interpret-
ability, robustness and reliability [33]. Symbolic regression gave comparably lesser RMSE val-
ues in comparison to multiple linear regression, thus adding validity to its use. In plant tissue
culture obtaining an optimum model is crucial when one needs to find the optimum experi-
mental parameters for large-scale production. The procedure adopted in the work can also be
Fig 5. The plot generated for parameters predicted by symbolic regression model for W.tinctoria plantlets as a
function of NAA and charcoal concentration showing the length of the longest roots (mm). The cyan coloured dot
indicates the optimum concentration predicted by the model function.
https://doi.org/10.1371/journal.pcbi.1005976.g005
Fig 6. The plot generated for parameters predicted by symbolic regression model for W.tinctoria plantlets as a
function of NAA and charcoal concentration showing the number of roots. The cyan coloured dot indicates the
optimum concentration predicted by the model function.
https://doi.org/10.1371/journal.pcbi.1005976.g006
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 10 / 13
extended to similar experiments as it is general and computationally efficient. The analysis pre-
dicted the optimum concentration of medium for micropropagation of the selected tree spe-
cies from the model plots derived from the preliminary experimental data. The study indicated
that these models would have significant potential for saving time and expenditure in plant tis-
sue culture laboratories for the commercial establishment of in vitro protocols in tree species.
Fig 7. The plot generated for parameters predicted by symbolic regression model for W.tinctoria plantlets as a
function of NAA and charcoal concentration showing the number of lateral roots. The cyan coloured dot indicates
the optimum concentration predicted by the model function.
https://doi.org/10.1371/journal.pcbi.1005976.g007
Fig 8. The plot generated for the combination of all parameters studied, by symbolic regression model for W.
tinctoria plantlets as a function of NAA and charcoal concentration. The cyan coloured dot indicates the optimum
concentration predicted by the model function.
https://doi.org/10.1371/journal.pcbi.1005976.g008
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 11 / 13
Supporting information
S1 Data. The minimal data set used for the analysis. Legends NAA, CH, BC, R, RL, NR and
NRL represent the concentration of NAA, concentration of charcoal, basal callus diameter,
percentage of shoots rooted, length of longest roots, number of roots and number of lateral
roots, respectively.
(CSV)
Author Contributions
Conceptualization: Meenu R. Mridula, K. Satheesh Kumar.
Data curation: Meenu R. Mridula.
Formal analysis: Meenu R. Mridula, K. Satheesh Kumar.
Funding acquisition: Ashalatha S. Nair.
Investigation: Meenu R. Mridula, Ashalatha S. Nair.
Methodology: K. Satheesh Kumar.
Project administration: Ashalatha S. Nair.
Resources: Ashalatha S. Nair, K. Satheesh Kumar.
Software: Meenu R. Mridula, K. Satheesh Kumar.
Supervision: Ashalatha S. Nair, K. Satheesh Kumar.
Validation: K. Satheesh Kumar.
Visualization: K. Satheesh Kumar.
Writing original draft: Meenu R. Mridula.
Writing review & editing: Ashalatha S. Nair.
References
1. Vladislavleva EJ, Smits GF, Den Hertog D. Order of nonlinearity as a complexity measure for models
generated by symbolic regression via pareto genetic programming. IEEE Transactions on Evolutionary
Computation. 2009 Apr; 13(2):333–49.
2. Gago J, Martı
´nez-Nu
´ñez L, Landı
´n M, Gallego PP. Artificial neural networks as an alternative to the tra-
ditional statistical methodology in plant research. Journal of plant physiology. 2010 Jan 1; 167(1):23–7.
https://doi.org/10.1016/j.jplph.2009.07.007 PMID: 19716625
3. Angeline PJ. Genetic programming: On the programming of computers by means of natural selection:
Koza John R., A Bradford Book, MIT Press, Cambridge MA, 1992,819 pp.
4. Larsen PE, Field D, Gilbert JA. Predicting bacterial community assemblages using an artificial neural
network approach. Nature methods. 2012 Jun; 9(6):621–625. https://doi.org/10.1038/nmeth.1975
PMID: 22504588
5. Andrianasolo FN, Casadebaig P, Maza E, Champolivier L, Maury P, Debaeke P. Prediction of sunflower
grain oil concentration as a function of variety, crop management and environment using statistical
models. European Journal of Agronomy. 2014 Mar 1; 54:84–96.
6. Mehrkesh A, Karunanithi AT. New quantum chemistry-based descriptors for better prediction of melting
point and viscosity of ionic liquids. Fluid Phase Equilibria. 2016 Nov 15; 427:498–503.
7. Xu J, Wang J, Wei Q, Wang Y. Symbolic regression equations for calculating daily reference evapo-
transpiration with the same input to Hargreaves-Samani in arid China. Water resources management.
2016 Apr 1; 30(6):2055–73.
8. Mahouti P, Gu¨neşF, Belen MA, Demirel S. Symbolic Regression for Derivation of an Accurate Analyti-
cal Formulation using “Big Data” An Application Example. Applied Computational Electromagnetics
Society Journal. 2017 May 1; 32(5):372–80.
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 12 / 13
9. Paul T, Stanley K, Osgood N, Bell S, Muhajarine N. Scaling behaviour of human mobility distributions.
InInternational Conference on Geographic Information Science 2016 Sep 27 (pp. 145–159). Springer,
Cham.
10. Bewick V, Cheek L, Ball J. Statistics review 9: one-way analysis of variance. Critical care. 2004 Apr; 8
(2):130. https://doi.org/10.1186/cc2836 PMID: 15025774
11. Compton ME. Statistical methods suitable for the analysis of plant tissue culture data. Plant Cell,Tissue
and Organ Culture. 1994 Jun 1; 37(3):217–42.
12. Gago J, Martı
´nez-Nu
´ñez L, Landı
´n M, Flexas J, Gallego PP. Modeling the effects of light and sucrose
on in vitro propagated plants: a multiscale system analysis using artificial intelligence technology. PLOS
One. 2014 Jan 20; 9(1):e85989. https://doi.org/10.1371/journal.pone.0085989 PMID: 24465829
13. Korns MF. Highly Accurate Symbolic Regression with Noisy Training Data. InGenetic Programming
Theory and Practice XIII 2016 (pp. 91–115). Springer, Cham.
14. Gandomi AH, Alavi AH, Ryan C, editors. Handbook of genetic programming applications. Switzerland:
Springer; 2015 Nov 6.
15. Muttil N, Chau KW. Neural network and genetic programming for modelling coastal algal blooms. Inter-
national Journal of Environment and Pollution. 2006 Jan 1; 28(3-4):223–38.
16. Bonga JM, von Aderkas P. In vitro culture of trees. Springer Science & Business Media; 1992 May 31.
17. Purohit SD, Joshi P, Tak K, Nagori R. Development of high-efficiency micropropagation protocol of an
adult tree—Wrightia tomentosa. InPlant Biotechnology and Molecular Markers 2004 (pp. 217–227).
Springer, Dordrecht.
18. Murashige T, Skoog F. A revised medium for rapid growth and bioassays with tobaccotissue cultures.
Physiologia Plantarum. 1962 Jul 1; 15(3):473–97.
19. Hastie T, Tibshirani R, Friedman J. Unsupervised learning. InThe elements of statistical learning 2009
(pp. 485–585). Springer, New York, NY.
20. Schmidt M, Lipson H. (2013) Eureqa (version 0.98 beta)[software].
21. Schmidt M, Lipson H. Distilling free-form natural laws from experimental data. Science. 2009 Apr 3; 324
(5923):81–5. https://doi.org/10.1126/science.1165893 PMID: 19342586
22. Nagelkerke NJ. A note on a general definition of the coefficient of determination. Biometrika. 1991 Sep
1; 78(3):691–2.
23. Colbourn E, Rowe RC. Neural computing and pharmaceutical formulation. Encyclopedia of Pharmaceu-
tical Technology. New York, USA: Marcel Dekker. 2005:145–57.
24. Faris H, Sheta A. Identification of the Tennessee Eastman chemical process reactor using genetic pro-
gramming. International Journal of Advanced Science and Technology. 2013 Jan; 50:121–40.
25. Klug M, Bagrow JP. Understanding the group dynamics and success of teams. Royal Society open sci-
ence. 2016 Apr 1; 3(4):160007. https://doi.org/10.1098/rsos.160007 PMID: 27152217
26. Sharma PK, Trivedi R, Purohit SD. Activated Charcoal Improves Rooting in in vitro-Derived Acacia leu-
cophloea shoots.
27. Mathai MP, Zachariah JC, Samsudeen K, Rema J, Nirmal Babu K, Ravindran PN. Micropropagation of
Cinnamomum verum (Bercht & Presl.) In Edison S., Ramana KV, Sasikumar B., Babu K. Nirmal and SJ
Eapen (eds) Biotechnology of Spices, Medicinal and Aromatic Plants, ISS, IISR, Calicut. 1997:35–8.
28. Chen C. Application of growth models to evaluate the microenvironmental conditions using tissue cul-
ture plantlets of Phalaenopsis Sogo Yukidian ‘V3’. Scientia Horticulturae. 2015 Aug 6; 191:25–30.
29. Arab MM, Yadollahi A, Shojaeiyan A, Shokri S, Ghojah SM. Effects of nutrient media, different cytokinin
types and their concentrations on in vitro multiplication of G×N15 (hybrid of almond×peach) vegetative
rootstock. Journal of Genetic Engineering and Biotechnology. 2014 Dec 1; 12(2):81–7.
30. Duffy J, Engle-Warnick J. Using symbolic regression to infer strategies from experimental data. InEvolu-
tionary computation in Economics and Finance 2002 (pp. 61-82). Physica, Heidelberg.
31. Koza JR, Bennett FH, Andre D, Keane MA. Genetic programming III: Darwinian invention and problem
solving [Book Review]. IEEE Transactions on Evolutionary Computation. 1999 Sep; 3(3):251–3.
32. Purohit SD, Kukda G. In vitro propagation of Wrightia tinctoria.Biologia Plantarum. 1994 Dec 1; 36
(4):519–26.
33. Vladislavleva EY. Model-based problem solving through symbolic regression via pareto genetic pro-
gramming. CentER, Tilburg University; 2008 Aug.
Symbolic regression to plant tissue culture data: An addendum to routine approach
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1005976 February 27, 2018 13 / 13

Supplementary resource (1)

... To tackle this impediment, machine learning algorithms can be used as an alternative mathematical method [8][9][10][11]. Different machine learning algorithms (e.g., artificial neural networks (ANNs), neuro-fuzzy logic systems, support vector machine (SVM), and random forest) have been recently used for modeling and predicting various in vitro culture systems such as explant sterilization [12,13], in vitro seed germination [14], callogenesis [15][16][17], androgenesis [18], shoot proliferation [19,20], rhizogenesis [21,22], in vitro secondary metabolite production [8,23,24], and gene transformation [25,26]. Among machine learning algorithms, different types of ANNs such as multilayer perceptron (MLP), radial basis function (RBF), and generalized regression neural network (GRNN) have been widely employed to model and predict in vitro culture processes [27]. ...
... The somatic embryogenesis in carrot has been previously studied [21,29,30,[32][33][34][35]. Establishing carrot somatic embryogenesis was associated with different obstacles such as low efficiency of somatic embryogenesis, chimeric callogenesis consisting of both non-embryogenic and embryogenic calli [21,29,30,[32][33][34][35]. ...
... The somatic embryogenesis in carrot has been previously studied [21,29,30,[32][33][34][35]. Establishing carrot somatic embryogenesis was associated with different obstacles such as low efficiency of somatic embryogenesis, chimeric callogenesis consisting of both non-embryogenic and embryogenic calli [21,29,30,[32][33][34][35]. Data mining approaches may help in reducing trial and errors in the process of optimizing carrot somatic embryogenesis. ...
Article
Full-text available
Introduction Developing somatic embryogenesis is one of the main steps in successful in vitro propagation and gene transformation in the carrot. However, somatic embryogenesis is influenced by different intrinsic (genetics, genotype, and explant) and extrinsic (e.g., plant growth regulators (PGRs), medium composition, and gelling agent) factors which cause challenges in developing the somatic embryogenesis protocol. Therefore, optimizing somatic embryogenesis is a tedious, time-consuming, and costly process. Novel data mining approaches through a hybrid of artificial neural networks (ANNs) and optimization algorithms can facilitate modeling and optimizing in vitro culture processes and thereby reduce large experimental treatments and combinations. Carrot is a model plant in genetic engineering works and recombinant drugs, and therefore it is an important plant in research works. Also, in this research, for the first time, embryogenesis in carrot (Daucus carota L.) using Genetic algorithm (GA) and data mining technology has been reviewed and analyzed. Materials and methods In the current study, data mining approach through multilayer perceptron (MLP) and radial basis function (RBF) as two well-known ANNs were employed to model and predict embryogenic callus production in carrot based on eight input variables including carrot cultivars, agar, magnesium sulfate (MgSO4), calcium dichloride (CaCl2), manganese (II) sulfate (MnSO4), 2,4-dichlorophenoxyacetic acid (2,4-D), 6-benzylaminopurine (BAP), and kinetin (KIN). To confirm the reliability and accuracy of the developed model, the result obtained from RBF-GA model were tested in the laboratory. Results The results showed that RBF had better prediction efficiency than MLP. Then, the developed model was linked to a genetic algorithm (GA) to optimize the system. To confirm the reliability and accuracy of the developed model, the result of RBF-GA was experimentally tested in the lab as a validation experiment. The result showed that there was no significant difference between the predicted optimized result and the experimental result. Conclutions Generally, the results of this study suggest that data mining through RBF-GA can be considered as a robust approach, besides experimental methods, to model and optimize in vitro culture systems. According to the RBF-GA result, the highest somatic embryogenesis rate (62.5%) can be obtained from Nantes improved cultivar cultured on medium containing 195.23 mg/l MgSO4, 330.07 mg/l CaCl2, 18.3 mg/l MnSO4, 0.46 mg/l 2,4- D, 0.03 mg/l BAP, and 0.88 mg/l KIN. These results were also confirmed in the laboratory
... This method relies on genetic algorithms and is especially suitable for plant tissue culture. The whole process takes into account the different culture stages of different plant tissues (such as embryo [75], callus [78], bud [79], and root [80]). The modeling, prediction, and optimization of plant genotypes, media, sterilization conditions, different types, and concentrations of plant growth regulators also need to be considered [75]. ...
... As shown in Figure 3, data-driven models are effectively used for different purposes in plant tissue culture. AI and OA have been effectively applied to predict and optimize the length and number of micro-buds [79,81] or roots [80], plant cell culture or hairy root biomass [82], and culture environmental conditions (such as temperature and sterilization [83,84]) to achieve the maximum productivity and efficiency as well as the classification of micro-buds and somatic embryos. Future AI-OA methods could also be used in the development of genetic engineering and genome editing. ...
Article
Full-text available
At present, most precious compounds are still obtained by plant cultivation such as ginsenosides, glycyrrhizic acid, and paclitaxel, which cannot be easily obtained by artificial synthesis. Plant tissue culture technology is the most commonly used biotechnology tool, which can be used for a variety of studies such as the production of natural compounds, functional gene research, plant micropropagation, plant breeding, and crop improvement. Tissue culture material is a basic and important part of this issue. The formation of different plant tissues and natural products is affected by growth conditions and endogenous substances. The accumulation of secondary metabolites are affected by plant tissue type, culture method, and environmental stress. Multi-domain technologies are developing rapidly, and they have made outstanding contributions to the application of plant tissue culture. The modes of action have their own characteristics, covering the whole process of plant tissue from the induction, culture, and production of natural secondary metabolites. This paper reviews the induction mechanism of different plant tissues and the application of multi-domain technologies such as artificial intelligence, biosensors, bioreactors, multi-omics monitoring, and nanomaterials in plant tissue culture and the production of secondary metabolites. This will help to improve the tissue culture technology of medicinal plants and increase the availability and the yield of natural metabolites.
... In contrast, common machine learning approaches, including neural networks and support vector machines, although powerful in fields like computer vision and natural language processing, are typically non-transparent and less interpretable due to their black-box nature [9]. Symbolic regression offers a solution, searching for mathematical relationships between variables without predefined assumptions, distinguishing it from conventional regression, which relies on predefined functions [10]. It explores a broad range of mathematical operations to uncover potentially complex and non-linear relationships which traditional methods may not reveal. ...
... Recently plant tissue culture is recognising the benefits of the application of artificial intelligence and optimization algorithms in spite of their complexity in genetic programming and modelling [5,17]. Radial Basis Function (RBF), Generalized Regression Neural Network (GRNN), Probabilistic Neural Network (PNN), Neuro-Fuzzy Logic (NFL), Support Vector Machine (SVM), and Multilayer Perceptron (MLP) are some of the most widely used artificial neural networks (ANNs) for modeling and protocol optimization in plant tissue culture. ...
Article
Full-text available
In vitro, sterilization is one of the key components for proceeding with plant tissue cultures. Since the effectiveness of sterilization has a direct impact on the culture's final outcomes, there is a crucial need for optimization of the sterilization process. However, compared with traditional optimizing methods, the use of computational approaches through artificial intelligence-based process modeling and optimization algorithms provides a precise optimal condition for in vitro culturing. This study aimed to optimise in vitro sterilization of grape rootstock 3309C using RSM, ANN, and genetic algorithm (GA) techniques. In this context, two output responses, namely, Clean Culture and Explant Viability, were optimised using the models developed by RSM and ANN, followed by a GA, to obtain a globally optimal solution. The most influential independent factors, such as HgCl2, NaOCl, AgNO3, and immersion time, were considered input variables. The significance of the developed models was investigated with statistical and non-statistical techniques and was optimised to determine the significance of selected inputs. The optimal clean culture of 91%, and the explant viability of 89% can be obtained from 1.62% NaOCl at a 13.96 min immersion time, according to MLP-NSGAII. Sensitivity analysis revealed that the clean culture and explant viability were less sensitive to AgNO3 and more sensitive to immersion time. Results showed that the differences between the GA predicted and validation data were significant after the performance validation of predicted and optimised sterilising agents with immersion time combinations were tested. In general, GA, a potent methodology, may open the door to the development of new computational methods in plant tissue culture.
... Symbolic regression has been identified as a promising solution for obtaining explanatory models without imposing any a priori assumptions [57]. Symbolic regression has seen development using both genetic programming and simulated annealing; however, as shown in [58], symbolic regression using simulated annealing generally outperforms symbolic regression based on genetic programming. ...
Article
Full-text available
Process cooling for food production is an energy-intensive industry with complex interactions and restrictions that complicate the ability to utilize energy-flexibility due to unforeseen consequences in production. Therefore, methods for assessing the potential flexibility in individual facilities to enable the active participation of process-cooling facilities in the electricity system are essential, but not yet well discussed in the literature. Therefore, this paper introduces an assessment method based on multi-method simulation and multi-objective optimization for investigating energy flexibility in process cooling, with a case study of a Danish process-cooling facility for canned-meat food production. Multi-method simulation is used in this paper: multi-agent-based simulation to investigate individual entities within the process-cooling system and the system’s behavior; discrete-event simulation to explore the entire process-cooling flow; and system dynamics to capture the thermophysical properties of the refrigeration unit and states of the refrigerated environment. A simulation library is developed, and is able to represent a generic production-flow of the canned-food process cooling. A data-driven symbolic-regression approach determines the complex logic of individual agents. Using a binary tuple-matrix for refrigeration-schedule optimization, the refrigeration-cycle operation is determined, based on weather forecasts, electricity price, and electricity CO2 emissions without violating individual room-temperature limits. The simulation results of one-week’s production in October 2020 show that 32% of energy costs can be saved and 822 kg of CO2 emissions can be reduced. The results thereby show the energy-flexibility potential in the process-cooling facilities, with the benefit of overall production cost and CO2 emissions reduction; at the same time, the production quality and throughput are not influenced.
... Different machine learning algorithms (e.g., ANNs, neuro-fuzzy logic systems, support vector machine (SVM), and random forest) have been recently used for modeling and predicting various in vitro culture systems such as explant sterilization [13,20], in vitro seed germination [21], callogenesis [22][23][24], androgenesis [25], shoot proliferation [15,26], rhizogenesis [27], in vitro secondary metabolite production [28][29][30], and gene transformation [31,32]. Among machine learning algorithms, different types of ANNs, such as MLP, radial basis function (RBF), and generalized regression neural network (GRNN), have been widely employed to model and predict in vitro culture processes [16,33,34]. ...
Article
Full-text available
Background Callus induction is the first step in optimizing plant regeneration. Fit embryogenesis and shooting rely on callus induction. In addition, using artificial intelligence models in combination with an algorithm can be helpful in the optimization of in vitro culture. The present study aimed to evaluate the percentage and speed of callus induction optimization in carrot with a Multilayer Perceptron-Single point discrete genetic algorithm (GA). Materials and methods In this study, the outputs included callus induction percentage and speed, while inputs were different types and concentrations of plant growth regulator (0. 5, 0.2 mg/l 2,4-D, 0.3, 0.2, 0.5 mg/l BAP, 1, 0.2 mg/l Kin, and 2 mg/l NAA), different explants (shoot, root, leaf, and nodal), a different concentration compound of MS medium (1 × MS, 4× MS, and 8× MS) and time of sampling. The data were obtained in the laboratory, and multilayer perceptron (MLP) and radial basis function (RBF), two well-known ANNs, were employed to model. Then, GA was used for optimization, and sensitivity analysis was performed to indicate the inputs’ importance. Results The results showed that MLP had better prediction efficiency than RBF. Based on the results, R ² in training and testing data was 95 and 95% for the percentage of callus induction, while it was 94 and 95% for the speed of callus induction, respectively. In addition, a concentration compound of MS had high sensitivity, while times of sampling had low sensitivity. Based on the MLP-Single point discrete GA, the best results were obtained for shoot explants, 1× MS media, and 0.5 mg/l 2, 4-D + 0.5 mg/l BAP. Further, a non-significant difference was observed between the test result and predicted MLP. Conclusions Generally, MLP-Single point discrete GA is considered a potent tool for predicting treatment and fit model results used in plant tissue culture and selecting the best medium for callus induction.
... 99 % of shoots initiated roots within 21 days of shoot inoculation. Mridula, Nair and Kumar13 reported five roots per shoot explant of WTR inoculated on MS medium which is augmented with NAA (0.07% charcoal and 4µM). Similarly, John et al 10 reported 4.7 roots per nodal shoot explants of WTR on MS media amended with IBA (0.8 mg/L).On the other hand, Purohit and Kukda 14,15 observed 2.8 roots per nodal shoot explants of WTR with maximum root length of 5.4 cm on MS media amended with IBA (100 mg/L), 4.6 roots per nodal shoot explants of WTR with maximum root length of 5.6 cm on MS media amended with IBA (500 mg/L) respectively. ...
Article
Wrightia tinctoria Roxb. (WTR) is a medicinal plant with soft wood that belongs to Apocynaceae family. The therapeutic constituents of this plant are used in treating many human ailments like psoriasis, diabetics and cancer. In view of low percent of natural regeneration, over exploitation due to its medicinal (treatment of human ailments) and commercial importance (toy making and pala indigo dye) and the hurdles faced by the Etikoppaka toy making industry (Vizag, A.P. India), there is dier need for conservation and production of propagules of this valuable soft woody and medicinally important tree. This study is undertaken to optimize and develop large scale regeneration and hardening protocol in Vadlamudi, Guntur Dt., A.P. using nodal explants of fifteen days old (in vitro) seedling. The nodal explants are inoculated on MS media amended with BAP (1.5 mg/L) + NAA (0.1 mg/L), induced 14 shoots per explant in 30 days with a length of 5.2 cm and 85-90% shoot initiation. The shoots when inoculated on Murashige Skoog media reinforce with IBA (3.5mg/L), showed eight roots per one shoot explant with maximum length of 6.9 cm. 99 % shoots initiated roots within 21 days of shoot inoculation. After 28 days of rooting, the plantlets are acclimatized to natural conditions through different stages and the % of survival rates of plantlets in the natural conditions was found to be 55 %.
Article
Plant tissue culture is a process of in vitro regeneration requiring numerous resources and intensive labour to mass produce disease-free clones. Diverse factors such as sterilizing agents, media composition, and environmental conditions contribute toward successful regeneration and decide the production, such as the total shoot number, shoot length, in vitro rooting, and adaptation of plants to the external environment. Plant tissue culture, the successful induction of rapid shoot production, and subsequent root formation in plants are influenced by the utilization of appropriate growing conditions customized to each specific explant type. By carefully manipulating environmental factors, such as temperature, light, and nutrient availability, it is possible to stimulate the growth and development of new shoots in a time-efficient manner. This strategic combination of optimal growing conditions and hormone supplementation holds great promise in the domain of efficient propagation of plants through tissue culture techniques. The recent progress in artificial techniques such as artificial neural networks (ANN) and machine learning (ML) algorithms has presented promising opportunities for the development of sustainable and precise plant tissue culture processes. These techniques are widely recognized as robust techniques for assessing outcomes and enhancing the accuracy of predicting outputs in the domain of plant tissue culture. AI techniques and optimization algorithms have been applied to predict and optimize callogenesis, embryogenesis, several shoots, shoot length, hairy root culture, in vitro rooting, and plant acclimatization by helping predict sterilizing conditions, optimal culture conditions, and formulation of a suitable medium. Patents, modeling, and formulation of each stage of plant tissue culture using tools like artificial neural networks (ANNs), neuro-fuzzy logic, support vector machines (SVMs), decision trees (DT), random forests (FR), and genetic algorithms (GA) are presented. Conclusion In this article, the current state of Artificial Intelligence (AI) algorithms, including their applications in all elements of plant tissue culture, as well as the patents that have been gained for these algorithms, are dissected in great detail.
Article
Advancements in genome mining, high-throughput sequencing and experimental techniques have generated an enormous amount of data on natural products. This has led to the design and development of advanced machine learning (ML) and artificial intelligence (AI) algorithms which have simplified the search for novel natural products in the 21st century. These algorithms could effectively analyse the chemical structure of natural products and predict their biological function. They could also effectively analyse large sets of data in a sophisticated manner. In this context, this article reviews the various AI/ML algorithms employed in natural products-based drug discovery. Particular attention is paid to case studies employing AI tools in plant and microbial research. Challenges associated with the use of AI tools for natural products research have also been discussed.
Article
Full-text available
With emerging of the Big Data era, sample datasets are becoming increasingly large. One of the recently proposed algorithms for Big Data applications is Symbolic Regression (SR). SR is a type of regression analysis that performs a search within mathematical expression domain to generate an analytical expression that fits large size dataset. SR is capable of finding intrinsic relationships within the dataset to obtain an accurate model. Herein, for the first time in literature, SR is applied to derivate a full-wave simulation based analytical expression for the characteristic impedance Z0 of microstrip lines using Big Data obtained from an 3DEM simulator, in terms of only its real parameters which are substrate dielectric constant ϵ, height h and strip width w within 1-10 GHz band. The obtained expression is compared with the targeted simulation data together with the other analytical counterpart expressions of Z0 for different types of error function. It can be concluded that SR is a suitable algorithm for obtaining accurate analytical expressions where the size of the available data is large and the interrelations within the data are highly complex, to be used in Electromagnetic analysis and designs.
Chapter
Full-text available
As symbolic regression (SR) has advanced into the early stages of commercial exploitation, the poor accuracy of SR, still plaguing even the most advanced commercial packages, has become an issue for early adopters. Users expect to have the correct formula returned, especially in cases with zero noise and only one basis function with minimally complex grammar depth. At a minimum, users expect the response surface of the SR tool to be easily understood, so that the user can know a priori on what classes of problems to expect excellent, average, or poor accuracy. Poor or unknown accuracy is a hindrance to greater academic and industrial acceptance of SR tools. In two previous papers, we published a complex algorithm for modern symbolic regression which is extremely accurate for a large class of Symbolic Regression problems. The class of problems, on which SR is extremely accurate, is described in detail in these two previous papers. This algorithm is extremely accurate, in reasonable time on a single processor, for from 25 up to 3000 features (columns). Extensive statistically correct, out of sample training and testing, demonstrated the extreme accuracy algorithm’s advantages over a previously published base line Pareto algorithm in case where the training and testing data contained zero noise. While the algorithm’s extreme accuracy for deep problems with a large number of features on noiseless training data is an impressive advance, there are many very important academic and industrial SR problems where the training data is very noisy. In this chapter we test the extreme accuracy algorithm and compare the results with the previously published baseline Pareto algorithm. Both algorithms’ performance are compared on a set of complex representative problems (from 25 to 3000 features), on noiseless training, on noisy training data, and on noisy training data with range shifted testing data. The enhanced algorithm is shown to be robust, with definite advantages over the baseline Pareto algorithm, performing well even in the face of noisy training data and range shifted testing data.
Article
Full-text available
To present an alternative simple equation for reference evapotranspiration (ET o) estimation, the symbolic regression (SR) method was applied to establish equations with the same inputs to simple Hargreaves-Samani (HS) equation in arid China. For most of the equations derived by SR method for each station, their performance increased with an increase in the equation complex index (CI). The most precise equation performed well although it was always complex and greatly varied in form. On the other hand, the simplest one was uniform in equation structure and performed slightly better than the HS equation for all the five stations, and sometimes better than the local calibrated HS equation. A trade-off equation was selected with almost the same equation form for all the five stations and low CI index. The site-specific trade-off equation performed better than the simplest one and the locally calibrated HS equation. Then parameters in the trade-off equation were unified for all the five stations, it did not perform as good as the site-specific one, but performed better than the HS equation and unified local calibrated HS equation. Thus, the SR method is suitable to determine both the site-specific and the unified equation among stations for daily ET o calculation in arid regions.
Article
Woody plants provide many challenges to the tissue culturist. Although there are many excellent tissue culture books and manuals available, these are generally strongly biased towards herbacious crops. Consequently, they often do not pay sufficient attention to the problems that specifically apply to in vitro culture of tree species. Culture of the latter often poses problems which are either absent or of lesser significance when culturing herbacious species. When trees in the field are used as explant source, the problems can be especially severe. For example, the physiological condition of the explants is difficult to control because of variation in weather and biotic factors. Furthermore, it is often difficult to obtain explants free of contaminants from field grown trees. Lack of genetic uniformity and maturation are additional problems one often has to deal with when culturing tree cells or tissues. These problems are emphasized in this text. In vitro culture of trees is not viewed in isolation. It is considered in conjunction with breeding, traditional cloning and other common tree improvement techniques. The text discusses theoretical as well as practical aspects of the in vitro culture of trees.
Article
Ionic liquids (ILs) are chemicals that are nonvolatile and hence have the potential to replace volatile organic compounds in industrial applications. A large number of ILs, through the combination of different cations and anions, can potentially be synthesized. In this context, it will be useful to intelligently design customized ILs through computer-aided methods. Practical limitations dictate that any successful attempt to design new ILs for industrial applications requires the ability to accurately predict their melting point and viscosity as experimental data will not be available for the designed structures. In this paper, we present two new correlations for precise prediction of melting point and viscosity of ILs solely based on inputs from quantum chemistry calculations (no experimental data or simulation results are needed). To develop these correlations we utilized data related to size, shape, and electrostatic properties of cations and anions that constitutes ILs. In this work, new descriptors such as dielectric energy of cations and anions as well as the values predicted by an ‘ad-hoc’ model for the radii of cations and anions (instead of their van der waals radii) were used. A large number of correlation equations consisting different combination of descriptors (as inputs to the model) were tested and the best correlation for viscosity and melting point were identified. The average relative errors were estimated as 3.16% and 6.45% for melting point, Tm, and ln(vis), respectively.
Book
This contributed volume, written by leading international researchers, reviews the latest developments of genetic programming (GP) and its key applications in solving current real world problems, such as energy conversion and management, financial analysis, engineering modeling and design, and software engineering, to name a few. Inspired by natural evolution, the use of GP has expanded significantly in the last decade in almost every area of science and engineering. Exploring applications in a variety of fields, the information in this volume can help optimize computer programs throughout the sciences. Taking a hands-on approach, this book provides an invaluable reference to practitioners, providing the necessary details required for a successful application of GP and its branches to challenging problems ranging from drought prediction to trading volatility. It also demonstrates the evolution of GP through major developments in GP studies and applications. It is suitable for advanced students who wish to use relevant book chapters as a basis to pursue further research in these areas, as well as experienced practitioners looking to apply GP to new areas. The book also offers valuable supplementary material for design courses and computation in engineering.
Book
The book consists of articles contributed by experts in the respective field of biotechnology. During the last couple of years, biotechnological approaches are being employed for the improvement in quality and quantity of oil crops, cereals, legumes and medicinal plants. Considering the rapid depletion of precious natural wealth, micropropagation through tissue culture has assumed added significance. This book, timely presentation of current status of achievements by active researchers as well as future prospects, deals with some thrust areas of plant biotechnology. For the purpose of prevention of bio-piracy molecular markers have emerged as the most reliable tool for indexing genetic polymorphism of plants. The chapters testify the value of the book at this juncture when plant biotechnology and molecular marker system are making rapid strides towards commercial applications. Depending upon the importance of the crop some of the chapters are plant-specific, while others deal more generally with the area of the subject. Graduate and postgraduate students and researchers who wish to further investigate those areas, will find this book of utmost use. © 2005 Springer Science + Business Media, Inc. All rights reserved.