Content uploaded by Basmattee Boodram
Author content
All content in this area was uploaded by Basmattee Boodram on Nov 04, 2021
Content may be subject to copyright.
Proceedings of the 2019 Winter Simulation Conference
N. Mustafee, K.-H.G. Bae, S. Lazarova-Molnar, M. Rabe, C. Szabo, P. Haas, and Y.-J. Son, eds.
MULTI-OBJECTIVE MODEL EXPLORATION OF HEPATITIS C ELIMINATION IN AN
AGENT-BASED MODEL OF PEOPLE WHO INJECT DRUGS
Eric Tatara
Alexander Gutfraind
Nicholson T. Collier
Scott J. Cotler
Jonathan Ozik
Harel Dahari
Decision and Infrastructure Sciences Division
Division of Hepatology, Dept of Medicine
Argonne National Laboratory
Loyola University Medical Center
5735 S Ellis Ave
2160 S 1st Ave
Chicago, IL 60637, USA
Maywood, IL 60153, USA
Marian Major
Basmattee Boodram
Division of Viral Products
School of Public Health
US Food and Drug Administration
University of Illinois at Chicago
10903 New Hampshire Ave
1603 W Taylor St
Silver Spring, MD 20993, USA
Chicago, IL 60612, USA
ABSTRACT
Hepatitis C (HCV) is a leading cause of chronic liver disease and mortality worldwide and persons who
inject drugs (PWID) are at the highest risk for acquiring and transmitting HCV infection. We developed an
agent-based model (ABM) to identify and optimize direct-acting antiviral (DAA) therapy scale-up and
treatment strategies for achieving the World Health Organization (WHO) goals of HCV elimination by the
year 2030. While DAA is highly efficacious, it is also expensive, and therefore intervention strategies
should balance the goals of elimination and the cost of the intervention. Here we present and compare two
methods for finding PWID treatment enrollment strategies by conducting a standard model parameter
sweep and compare the results to an evolutionary multi-objective optimization algorithm. The evolutionary
approach provides a pareto-optimal set of solutions that minimizes treatment costs and incidence rates.
1 INTRODUCTION
Persons who inject drugs (PWID) are at the highest risk for acquiring and transmitting hepatitis C virus
(HCV) infection (Bruggmann 2013), which is a leading cause of liver disease in the US (El-Serag 2012).
Approximately 32,000 PWID reside in metropolitan Chicago, Illinois (Tempalski et al. 2013) with an
estimated HCV-RNA prevalence of 47% (Gutfraind et al. 2015). Given the public health burden of hepatitis
C, there is a growing need to provide model-based forecasting of current (prevalence) and new (incidence)
infections in the population that accounts for the complex interplay of demographic factors, social networks,
geographic location, and behaviors (Dahari and Boodram 2018). Additionally, the ability to predict the
effectiveness of treatment programs on reducing new infections may be of crucial importance, as the effects
of disease treatment cannot be known a priori and can take several years to a decade to fully appreciate.
We previously developed the Agent-based Pathogen Kinetics (APK) model to simulate the PWID
population in metropolitan Chicago including the social interactions that result in HCV infection, and
applied the model to study and predict changes in HCV prevalence in this population in the absence of
treatment (Gutfraind et al. 2015). In APK, each PWID performs drug-related daily activities and has a state
1008978-1-7281-3283-9/19/$31.00 ©2019 IEEE
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
of HCV infection, a location of residency, and maintains a drug sharing network with other PWID. APK
was designed using highly detailed empirical networks and geospatial Chicago-based data from large
epidemiological empirical datasets to model the PWID population in each neighborhood; capturing
associations between demographic characteristics (e.g. age, race/ethnicity) and drug injection behaviors
(e.g. sharing syringes). The main novelty of APK is the detailed simulation of four aspects of the drug
lifestyle: demographic, behavioral, social and geospatial.
In our present study we have ported the APK model to our Hepatitis C Elimination in PWID (HepCEP)
model to identify and optimize direct-acting antiviral (DAA) therapy scale-up and treatment strategies for
achieving the World Health Organization (WHO) goals of reducing new chronic HCV infections by 90%
by 2030. HepCEP includes novel PWID behaviors for DAA therapy enrollment and response to treatment.
DAA treatment is highly efficacious, however the cost of treatment can exceed $25,000 per patient (Bethea
et al. 2018). Therefore, reducing the number of treatments in a population is desirable. This presents an
opportunity to investigate DAA treatment enrollment strategies that can simultaneously minimize treatment
costs and the rate of new chronic infections.
Recruitment of PWID into DAA treatment programs for HCV can be conducted using various methods.
In this paper, we present and compare random to non-random recruitment strategies. To do this, we first
conducted a standard model parameter sweep exploration with an a priori-defined set of parameter values,
then compared the results to an evolutionary genetic algorithm approach that can provide a pareto-optimal
set of solutions that minimizes DAA treatment costs and HCV incidence rates.
2 HEPCEP MODEL
To perform the proposed analyses, we extended the capabilities of our previous APK model (Gutfraind et
al. 2015) to include DAA therapy and to better enable large-scale simulated parameter sweeps over DAA
scale-up rates with the Repast HPC-based (Colllier and North 2013) HepCEP model. The primary behavior
enhancements in HepCEP are the treatment enrollment strategies, which are examined specifically in the
context of achieving significant reductions in HCV incidence. All other PWID behaviors in the APK model
were ported to HepCEP unchanged. The Repast HPC implementation of the model is written entirely in
C++, which provides a nominal performance improvement over the Java APK model; however, it provides
a substantial improvement in the memory footprint. A typical simulation of 32,000 PWID for a 20-year
time duration on a single CPU core requires approximately one hour of computation (wall) time and 600MB
of memory. For single runs on individual workstations, the efficiency improvements observed in HepCEP
are nominal; however, the roughly 7-fold memory footprint reduction compared to APK has a large impact
on cluster-based simulations in which a single compute node may contain 36 cores, but with a shared
memory that can be used across multiple concurrently running simulations.
We used an updated method in HepCEP for creating PWID agents in-memory by sampling from a pre-
generated database of individuals. The APK model generated the PWID population as needed from multiple
surveys of demographic and behavioral information for Chicago area PWIDs (Gutfraind et al. 2015). The
HepCEP PWID input data set, in contrast, contained only the following pre-computed data for each
individual: age, age started injecting drugs, gender, race, zip code, syringe source (harm reduction), HCV
infection status, drug sharing network degree, and parameters for daily injection rates and syringe sharing.
The population demographic and behavioral properties were statistically identical between APK and
HepCEP. The primary benefits of this PWID sampling method are reduced computational complexity since
the sample population is completely file-based, and, more importantly, the elimination of personally
identifiable information (PII) medical data, enabling the use of any open-science high-performance
computing resource without data protection concerns. The HepCEP PWID input data set contains
approximately 100,000 unique entries which are randomly sampled throughout the simulation to create new
PWID agents. The initial model population is a model parameter, and this study uses a population size of
32,000 individuals. PWID agents may die according to an age-related random process, and are replaced
with new PWIDs sampled from the input data set to maintain a nearly constant population size for the entire
course of the simulation.
1009
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
2.1 Syringe Sharing Networks
Syringe-sharing is the primary mode of HCV transmission among PWID (Alter 2007) and therefore PWIDs
in HepCEP are connected via syringe sharing networks. Network formation is determined by the probability
of two persons encountering each other in their neighborhood of residence, in outdoor drug purchasing
market areas in Chicago, or random interaction with any other individual. The methods used to calculate
network encounter rates, establishment processes, and removal of networks are detailed in (Gutfraind et al.
2015). The initial PWID population members are placed as nodes in the network, and edges are
probabilistically created based on the above criteria. Each individual has a predetermined number of in-
network PWID partners who provide syringes to the individual and out-network PWID partners who
receive syringes from the individual. The network edge direction determines the flow of HCV-contaminated
syringes between individuals, and thus the direction of HCV transmission. Network connections have an
average lifespan after which they are destroyed, and new connections are formed probabilistically.
2.2 DAA Treatment Enrollment
Treatment enrollment is modelled by selecting HCV RNA positive PWIDs such that infected individuals
are sampled at random from the total PWID population or from syringe-sharing network connections,
depending on the enrollment method type. The annual target enrollment rate is a model parameter that
determines the total annual treatment enrollment as a fraction of the total population, and in this study we
examine annual enrollment rates in the range of 1-10% of the total population. We chose a conservative
treatment duration of 12 weeks. Treatment success probability is a function of treatment non-adherence and
sustained virologic response (SVR) parameters. We used a treatment failure rate of 10% and treatment SVR
of 90%. In this study, treatment re-enrollment is allowed such that PWID who have completed a successful
treatment and become re-infected are eligible for additional re-treatments. DAA cost is assumed $25,000
(USD) per treatment, consistent with recent wholesale prices (Bethea et al. 2018).
The process of treatment enrollment of the PWID population may be scheduled to begin at any time
during a simulation relative to the simulation start date, which is specified via a model parameter, and
repeats every tick interval of 1 day. The total PWID target enrollment for a single day is determined by the
daily mean treatment enrollment, which is the total PWID population multiplied by the annual treatment
enrollment parameter / 365. The daily enrollment target is sampled from a Poisson distribution using the
daily mean treatment enrollment. Multiple treatment enrollment methods are available, and are described
in detail below. Each treatment method has a model parameter enrollment probability of 0-1, and the sum
of all enrollment probabilities must equal 1.0. The total PWID daily recruitment target for each enrollment
type is calculated as the product of the enrollment probability for the enrollment type and the daily
enrollment target. A PWID is only eligible for enrollment in DAA treatment if s/he has a positive HCV
RNA test result (clinical diagnosis of HCV) and is not currently in treatment.
2.3 DAA Treatment Enrollment Methods
2.3.1 Random Recruitment
Random recruitment selects HCV-infected individuals from the PWID population. The list of available
PWID candidates is shuffled and random individuals are selected until the daily enrollment target for the
recruitment is met or no eligible PWID remain for recruitment that day. This sampling process does not
consider demographics or social network connections.
2.3.2 Harm Reduction Program (HRP)
Similar to random recruitment, but individuals must be registered in a harm reduction program, such as
syringe service program (SSP) that provides sterile syringe injection equipment along with risk reduction
1010
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
counseling to enrollees. PWID enrolled in these programs are considered a lower risk for HCV transmission
than those with similar injection behaviors who are not enrolled. The list of available PWID candidates is
shuffled and random individuals in HRP are selected until the daily enrollment target for the HRP
recruitment is met or no eligible PWID remain for recruitment that day.
2.3.3 Full Network Recruitment
The network recruitment methods begin by selecting a PWID via the random recruitment method, then
enrolling all other PWID who share syringes with the selected individual (personal injection network).
These include individuals who give and receive syringes from the selected PWID. The list of available
PWID candidates is shuffled and random individuals are selected until the daily enrollment target for the
full network recruitment is met or no eligible PWID remain for recruitment that day. Individuals who are
enrolled as part of the network recruitment are counted toward the daily target. All individuals, whether
the originally selected, or selected through the network, must meet the eligibility criteria outlined above.
2.3.4 In-Partner and Out-Partner Network Recruitment
Similar to the full network recruitment, the in-partner enrollment method only recruits a single social
network “in” edge, or another PWID who provides syringes to the originally selected individual.
Alternately, the out-partner recruitment only recruits a single social network “out” edges, or another PWID
who receives syringes from the originally selected individual.
2.4 Simulation Schedule
HepCEP uses a simulation step size of one day in which multiple model behaviors are scheduled in the
following sequence. First, the main model step is executed that iterates over each PWID agent and executes
the individual’s step method, which probabilistically determines the number of drug sharing episodes
between the individual and other PWID in the individual’s drug sharing network. After each individual has
executed its step, any individuals that have died are removed from the population and new PWID agents
are generated to maintain the population size. Next, a PWID network linking is conducted to update PWID
network connections due to losses or gains from deaths or arriving PWID agents, or from network edges
that have expired. Finally, the treatment enrollment is conducted by creating a candidate pool of all PWID
eligible for enrollment and entering candidates in each of the enrollment methods.
2.5 Exploration of Treatment Enrollment Methods
HepCEP provides the ability to independently specify the total annual target treatment enrollment
percentage of the population and the distribution for each treatment enrollment method. Individual
treatment method enrollment fractions range from 0 (indicating no individuals will be enrolled in treatment
via this method) to 1 (indicating all individuals will be enrolled via this method) and are constrained such
that the sum of all enrollment method fractions must sum to 1. The primary model outcome is the annual
HCV incidence rate (per 1000 person-years) for a given year, which is defined as:
The denominator represents the total number of individuals that are eligible to become infected, while
the numerator is simply the number of new daily infections. This formulation assumes that treated
individuals can be re-infected after cure and that re-infected individuals are treated as new infections, which
is included in the numerator of infected daily.
1011
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
To examine the effects of enrollment rate scale up and the distribution of enrollment method fractions,
a series of simulation experiments were conducted using high-performance computing workflows
implemented with the EMEWS framework (Ozik et al. 2016). EMEWS is built on the general-purpose
parallel scripting language Swift/T (Wozniak et al. 2013), which provides the capability of running multi-
language software tasks on anywhere from desktops to peta-scale plus computing resources. While
EMEWS enables running very large, a priori defined parameter sweeps, it also utilizes high-level queue-
like interfaces to integrate more sophisticated and dynamic parameter sampling methods. Here we exploit
this capability by plugging in a Python-based multi-objective optimization algorithm via EQ/Py (EMEWS
Queues for Python). The simulation experiments were conducted on the Bebop cluster run by the
Laboratory Computing Resource Center at Argonne National Laboratory.
3 SIMULATION RESULTS
3.1 Effect of Enrollment Rate on HCV Incidence
To first understand the effect of the enrollment rate parameter on the incidence with random recruitment,
four annual target enrollment rate scenarios were simulated (2.5%, 5%, 7.5%, 10%) with 20 stochastic
replicates each for DAA scale-up. The stochastic elements of the model include those already described,
and additional model parameters described in detail previously (Gutfraind et al. 2015). Each simulation
required approximately one hour of wall time to complete. Using the EMEWS workflow on the Bebop
cluster, the actual compute time was close to one hour since all runs can execute in parallel.
The simulation start date of 2010 was selected based on the PWID demographic data from multiple
surveys in previous years (Gutfraind et al. 2015). The model time step is one day, and treatment enrollment
is started in year 2020 and run until year 2030, with detailed model data collected at daily intervals. We
report the mean incidence per 1000 person-years relative to the mean baseline incidence rate in year 2020
with no treatment (enrollment rate of 0%). The mean relative incidence rate and 95% confidence interval
of the mean incidence rate is determined from the 20 stochastic runs.
The time-series relative incidence rate for the four enrollment rate percentages for random recruitment
are shown in Figure 1. Annual target treatment enrollment rates of 2.5% did not reduce the incidence rate
relative to the year 2020 incidence rate before enrollment started. Enrollment rates ≥ 5% reduced the
incidence rate to near zero percent at year 2029 (5% enrollment), 2026 (7.5% enrollment), and 2025 (10%
enrollment). The initial increase in incidence during the first few years after treatment begins in 2020 is due
to re-infections of treated individuals.
3.2 Large-Scale Parameter Sweep for Enrollment Rate and Treatment Methods
The effect of enrollment rate on HCV incidence for random recruitment provides some insight into how the
total number of randomly selected and treated PWID impacts the incidence over time. This relatively small
parameter space exploration consisting of 80 simulation runs is feasible even on small workstations given
enough time. However, the inclusion of additional simulation parameters led to a rapidly expanding
parameter space requiring a truly large-scale model exploration approach.
We next investigated the effects of all treatment enrollment recruitment methods described in Section
2.3, as a function of the annual target enrollment rate and as a function of the probability of PWIDs being
selected into a specific enrollment method. The treatment method fractions were determined by distributing
5 units of 20% across the five different treatment methods, resulting in 126 possible treatment method
combinations. The annual target enrollment rate has four discrete states: 0.025 (2.5%), 0.05 (5%), 0.075
(7.5%), and 0.1 (10%). The total parameter space contained unique combinations of the
enrollment rate and individual treatment probabilities. For these parameter combinations, we generated 10
random runs, each with a unique random seed, for a total of 5040 simulation runs. The same 10 random
seeds were used for each parameter combination. The mean incidence at year 2030 was calculated for each
parameter combination.
1012
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
Figure 1: Projected relative HCV mean incidence among PWID relative to the predicted 2020 incidence
during DAA scale-up for random recruitment and annual target enrollment rates from 2.5-10%. The ribbons
represent the 95% confidence interval around the mean, based on 20 stochastic simulation runs.
Of the 504 experiment combinations, 150 had incidence rates at year 2030 greater than zero, while the
remaining 354 had final incidence rates at or very close to zero. Table 1 shows the treatment count and
costs, and enrollment rate and method percentages for 15 model runs with the lowest treatment counts and
a zero incidence rate in year 2030. The incidence is fully minimized since the rate must be ≥ 0. The
treatment counts across all 504 experiments with zero incidence in year 2030 range from the fewest number
of treatments (11,949) and cost ($298,725,000) to the most number of treatments (13,039) and cost
($325,975,000) (not shown). The cost different between the model runs with zero incidence in year 2030
is $27,250,000 due to the large difference in the number of treatments that result in the same incidence.
Considering the large number of parameter combinations that result in near-zero relative incidence at
year 2030, selecting the “best” set of treatment enrollment strategies can be challenging. Along with
minimizing HCV incidence, minimization of the number of treatments, and thus total cost of treatments,
should be considered. Finally, it’s possible that the desired optimal set of treatment enrollment parameters
was missed entirely due to the fact that the parameter space was discretized on a coarse grid, yet still
required 5,000 core-hours of computation on a High Performance Computing (HPC) resource.
3.3 Model Exploration with a Multi-Objective Evolutionary Genetic Algorithm
The discretized parameter space partitioning produces a large set of model output solutions from which to
select the most desirable, depending on the selection criteria. One approach to more strategically navigate
the model parameter is through the use of sequential optimization techniques that optimize model outputs.
For an ABM like HepCEP in which it is desired to minimize both the number of treatments (cost) and the
HCV incidence, there is not a convenient single objective function evaluation since both objectives need to
be minimized independently. Therefore, a multi-objective optimization approach is needed.
Evolutionary model exploration (ME) approaches can be used to find a set of model solutions that
satisfy a multi-objective optimization problem. Genetic algorithms (GA) are especially suited for
evolutionary exploration. In a typical GA, each individual in a population represents a specific set of model
parameters, consisting of a set of intrinsic properties (genotype), that manifest as a set of apparent
characteristics (phenotype) which are evaluated via a fitness function of model outputs that is optimized
(Holland 1992). Individuals with the best fitness are selected and their properties are combined in a
crossover operation producing offspring individuals that contain the combined characteristics of their
1013
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
parents. Through successive generations of parents and offspring, only the individuals with the best fitness
are selected, resulting in a population of individuals with a higher fitness than the initial population
Table 1: Total treatment counts and costs, and enrollment rate and enrollment method percentages for the
15 model runs with the lowest treatment counts and an incidence of zero in year 2030 for the parameter
sweep. Each row is a unique model run.
Treatment
Count
Treatment
Cost (1K $)
Enrollment
Rate (%)
HRP
(%)
Network
Full (%)
Network In-
Partner (%)
Network Out-
Partner (%)
Random
(%)
11,949
298,725
5
0
80
20
0
0
11,968
299,210
5
0
0
100
0
0
12,012
300,310
10
0
20
80
0
0
12,044
301,095
5
0
0
80
20
0
12,047
301,175
10
0
0
100
0
0
12,049
301,233
7.5
0
0
80
0
20
12,054
301,338
10
20
40
40
0
0
12,058
301,438
10
20
20
60
0
0
12,058
301,440
5
20
0
60
20
0
12,068
301,695
5
0
20
60
0
20
12,068
301,708
5
0
40
40
0
20
12,072
301,798
10
0
40
60
0
0
12,083
302,063
7.5
0
40
40
20
0
12,087
302,180
5
20
40
20
0
20
12,093
302,330
10
20
60
0
20
0
A GA can be used to discover the best combination of model parameters based on the fitness of the
model outputs. In the ME context, an individual in a GA population consists of a single set of input
parameters to be explored, e.g. a single model run. All other model input parameters that are constant are
not included in a GA individual. Unlike the parameter sweep, the model parameter space does not need to
be discretized and, instead, model parameters are constrained between minimum and maximum values,
from which the GA selects the parameters randomly when creating the initial population. For each GA
individual, a complete model simulation is run, and the model outputs are evaluated to determine the
individual’s fitness. At each generation, higher-fitness individuals are more likely to be picked to create
new offspring parameter sets which are subsequently used to run new simulations.
We use EMEWS EQ/Py to explore treatment enrollment combinations in HepCEP with a multi-
objective GA approach. We use the DEAP evolutionary computation framework (Fortin et al. 2012) Python
library to provide the GA components of the workflow. The NSGA-II algorithm (Deb et al. 2002) was
selected based on its ability to provide a set of Pareto-optimal solutions using multi-objective optimization,
and is included in the DEAP library. The NSGA-II algorithm maintains a constant population size over
successive generations by selecting only non-dominated individuals at each generation and discarding
lower fitness individuals. The set of non-dominated individuals comprises the Pareto front in the objective
space and are optimal in the sense that neither objective value can be further improved without worsening
the other, and represents the boundary between optimal and sub-optimal solutions in the parameter space.
1014
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
3.4 Optimal HepCEP Treatment Parameters with Evolutionary GA
As described above, the initial population of model parameters is randomly sampled based on the minimum
and maximum values for each treatment parameter, and we use the same parameter ranges for the
enrollment rate and treatment method combinations as in the parameter sweep. We use a GA population
size of 100 individuals, and each individual is repeated four times using a different random seed to account
for the noise in the model outputs. The objective function uses the mean values of the four stochastic runs
to assess each GA individual. The GA population is evaluated for 20 generations, where each generation
consists of 400 simulation runs. Thus the complete GA workflow results in 8,000 HepCEP simulation runs.
We note here that the GA parameters and resulting number of runs were selected based on the available
computational budget.
For each GA generation, we record the objective functions of the non-dominated GA individuals, which
include the number of treatments and HCV incidence at year 2030. Figure 2 shows the Pareto-optimal
solutions for each of the 20 GA generations, with each point representing a single model run that resulted
in the objective values for year 2030 incidence (along the y-axis) and total number of treatments (along the
x-axis). Initially the Pareto curve appears to smoothly span the parameter space, however later generations
have a discontinuous set of solutions, with a local cluster forming around treatment counts of around 3,000.
The remaining solutions on the Pareto front continue to vary for HCV incidence of 0 to 0.6, and treatment
counts of around 8,000 to 12,000 treated individuals.
The Pareto curve and treatment enrollment parameter values are combined in Figure 3 for selected GA
generations. As with Figure 2, each point is a Pareto-optimal solution, but we have now visualized the
treatment enrollment combinations as a pie chart for each solution. The fraction of each enrollment method
is displayed as a slice of the pie charts, and the enrollment rate is shown as a numerical annotation (in units
of %) on each solution. The first GA generation contains what appears to be an equal distribution of
enrollment types and enrollment rate parameter values. We also observe an expected relationship between
the enrollment rate parameter and the treatment count and HCV incidence, with larger enrollment rate
values resulting in more treatments, and lower incidence.
Successive generations produce more non-dominated GA individuals as evidenced by the increased
number of solutions in Figure 3. The enrollment rate parameter value is reduced significantly for the later
GA generations, while the total treatment count and incidence rates are maintained. The solution clustering
around treatment counts of 3,000 all use an enrollment rate of 1% which is the lower limit for this parameter.
The GA finds these pareto-optimal solutions, however from a practical point of view the incidence rates for
these solutions are too high to meet the year 2030 reduction targets. Additionally, we observe that for the
pareto-optimal solutions, the dominant treatment enrollment method is the in-partner network method, in
which a PWID who provides syringes to the randomly recruited individual is also enrolled for treatment.
The second most common enrollment methods are the full network and HRP enrollments.
3.5 Comparison of Parameter Sweep and GA Optimization Results
The GA pareto-optimal solutions in generation 20 (Figure 3) provide a well-defined boundary in the multi-
objective space that separates the optimal and sub-optimal solutions. The simpler parameter sweep
exploration shown in Table 1 provides a more limited view of the model objective space, so we directly
compare all of the solutions from the parameter sweep and GA optimization in Figure 4.
Again in Figure 4 we observe the same front of Pareto-optimal solutions produced by the GA shown
as blue circles, along with the parameter sweep solutions shown by black circles. The contrast between the
two model exploration approaches is significant; the parameter sweep results are more spread out over the
multi-objective space, and all of the simple sweep results fall entirely within the region of sub-optimal
solutions located above and to the right of the Pareto front solutions. Furthermore, the sub-optimal sweep
solutions tend to cluster around two locations in the objective space. First, there is a large cluster of solutions
located around a treatment count of approximately 9,000. These solutions have a markedly lower treatment
count that the results shown in Table 1, however the objective for year 2030 incidence for these solutions
1015
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
is poor, ranging from 0.6 to 1.0, indicating these combinations of enrollment parameters do not result in the
desired 90% reduction in incidence by year 2030.
Figure 2: Pareto-optimal solutions for each of the 20 GA generations. Each point represents a single model
run in the 2D parameter space for the year 2030 relative incidence rate and the total treatment count.
The second notable clustering of solutions in Figure 4. for the parameter sweep are for solutions with
a year 2030 incidence rate at, or near, zero. These solutions are reflected in the results shown in Table 1.
The sweep solution results with a year 2030 zero incidence rate certainly meet the incidence reduction
goals, though there is considerable variance in the number of treatment counts for these solutions. The GA
pareto-optimal solutions, in contrast to the sweep solutions, result in only optimal solutions for the treatment
counts with year 2030 incidence rates of zero. The best GA optimal solution with zero incidence in year
2030 has 11,818 total treatments, compared to the best parameter sweep solution of 11,949 treatments,
representing a cost savings of $3,275,000 between the two solution approaches.
4 CONCLUSIONS
The HepCEP model provides meaningful insight into the effect of DAA treatment administration on HCV
incidence among PWID over time. Brute-force parameter sweeps with HepCEP were shown to partly
characterize the model parameter space. An evolutionary GA model approach to parameter space
exploration was shown to strategically produce a model solution set of desired characteristics, with results
1016
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
that are easier to interpret. Moreover, the GA-based Pareto-optimal solution curves do not provide a single
best solution for reaching the WHO goal, but rather a range of treatment enrollment solutions that could
inform the development of public health policy and strategies that would most effectively promote HCV
elimination among PWID. Although we selected the WHO goal of 90% reduction in incidence by year
2030 as the criterion for treatment enrollments, the GA approach does provide other potentially useful
solutions that allow approaching this goal. In practice, where additional outcome constraints exist, such as
the unavailability of certain treatment methods, the GA approach could be used to select available treatment
methods that result in the lowest possible incidence. The GA approach was also shown to elucidate
enrollment solutions that achieved the target goal at substantially lower cost than the parameter sweep.
Figure 3: Pareto-optimal solutions for GA generations 1, 6, 12, and 20. Each pie-point represents a single
model run in the 2D parameter space for the year 2030 relative incidence rate and the total treatment count.
The treatment enrollment percentage for each solution is shown as an individual pie chart at each solution
point. The numeric labels on each point refer to the treatment enrollment percentage for the solution. The
red dashed horizontal line represents the threshold for 90% reduction in incidence in year 2030.
The treatment enrollment methods that achieve a 90% reduction in incidence by year 2030 are those
that largely combine in-network, full-network recruitment, and harm reduction programs, i.e., policies that
are targeted to specific sub-populations of PWID (e.g., individual with an injection network). Interestingly,
(Zelenev et al. 2018) showed that random strategies were the most effective approach to reduce HCV
prevalence, which is in agreement with a network-based analysis done outside the USA (Hellard et al.
1017
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
2014). In contrast, HepCEP showed that, in the Chicago context, exploiting PWID network structures by
targeting individuals who may have transmitted the infection to the primary PWID resulted in the lowest
number of treatments and incidence.
Figure 4: Multi-objective parameter space for all of the parameter sweep solutions (black circles) and GA
generation 20 Pareto front solutions (blue circles). The GA solutions provides a discontinuous Pareto-
optimal front of solutions. Solutions falling above and to the right of the Pareto front are sub-optimal with
respect to the objectives for incidence and treatment count. The red dashed horizontal line represents the
threshold for 90% reduction in incidence in year 2030.
The current multi-objective GA exploration only considers the HCV incidence at year 2030, and the
total number of treatments. However, based on the existing simulation results, it’s clear that the relative
incidence rate in PWID can be reduced to approximately zero before year 2030 depending on the
combination of treatment enrollment methods and enrollment rates. Future studies should include more
multi-objective criteria that account for incidence rates at each year, so that treatment combinations and
enrollment rates that result in earlier reduction of incidence are preferred to those that only reduce incidence
by the last year of treatment. A more complex evaluation of cost may also consider the length of time that
treatment enrollment programs are operating versus an increased initial enrollment rate, along with variable
enrollment rates per year and incorporate estimates of operating costs associated with specific recruitment
strategy activities such as outreach approaches and testing.
ACKNOWLEDGMENTS
This research is supported by NIH grant R01GM121600 and is based upon work supported by the U.S.
Department of Energy, Office of Science, under contract DE-AC02-06CH11357, and was completed with
resources provided by the Research Computing Center at the University of Chicago (Midway2 cluster) and
the Laboratory Computing Resource Center at Argonne National Laboratory (Bebop cluster).
REFERENCES
Alter, M. J. 2007. "Epidemiology of Hepatitis C Virus Infection". World Journal of Gastroenterology 13 (17):2436-2441.
Bethea, E. D., Q. Chen, C. Hur, R. T. Chung, and J. Chhatwal. 2018. "Should We Treat Acute Hepatitis C? A Decision and Cost-
Effectiveness Analysis". Hepatology 67 (3):837-846.
1018
Tatara, Collier, Ozik, Gutfraind, Cotler, Dahari, Major, and Boodram
Bruggmann, P. 2013. "Treatment as Prevention: The Breaking of Taboos Is Required in the Fight against Hepatitis C among People
Who Inject Drugs". Hepatology 58 (5):1523-1525.
Colllier, N., and M. North. 2013. "Parallel Agent-Based Simulation with Repast for High Performance Computing". Simuation 89
(10):1215-1235.
Dahari, H., and B. Boodram. 2018. "How to Eliminate Hcv in People Who Inject Drugs in the USA". The Lancet Infectious Diseases
18 (2):134-135.
Deb, K., A. Pratap, S. Agarwal, and T. Meyarivan. 2002. "A Fast and Elitist Multiobjective Genetic Algorithm: Nsga-Ii". IEEE
Transactions on Evolutionary Computation 6 (2):182-197.
El-Serag, H. B. 2012. "Epidemiology of Viral Hepatitis and Hepatocellular Carcinoma". Gastroenterology 142 (6):1264-1273.
Fortin, F.-A., F.-M. De Rainville, M.-A. Gardner, M. Parizeau, and C. Gagné. 2012. "Deap: Evolutionary Algorithms Made Easy".
Journal of Machine Learning Research 13 (1):2171-2175.
Gutfraind, A., B. Boodram, N. Prachand, A. Hailegiorgis, H. Dahari, and M. E. Major. 2015. "Agent-Based Model Forecasts Aging
of the Population of People Who Inject Drugs in Metropolitan Chicago and Changing Prevalence of Hepatitis C Infections".
PLOS ONE 10 (9):e0137993.
Hellard, M., D. A. Rolls, R. Sacks-Davis, G. Robins, P. Pattison, P. Higgs, C. Aitken, and E. McBryde. 2014. "The Impact of
Injecting Networks on Hepatitis C Transmission and Treatment in People Who Inject Drugs". Hepatology 60 (6):1861-1870.
Holland, J. H. 1992. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control,
and Artificial Intelligence, Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to
Biology, Control, and Artificial Intelligence. Cambridge, MA: MIT Press
Ozik, J., N. T. Collier, J. M. Wozniak, and C. Spagnuolo. 2016. From Desktop to Large-Scale Model Exploration with Swift/T.
2016 Winter Simulation Conference (WSC), 11-14 Dec. 2016, Washington, DC.
Tempalski, B., E. R. Pouget, C. M. Cleland, J. E. Brady, H. L. Cooper, H. I. Hall, A. Lansky, B. S. West, and S. R. Friedman. 2013.
"Trends in the Population Prevalence of People Who Inject Drugs in Us Metropolitan Areas 1992-2007". PLoS One 8
(6):e64789.
Wozniak, J. M., T. G. Armstrong, M. Wilde, D. S. Katz, E. Lusk, and I. T. Foster. 2013. Swift/T: Large-Scale Application
Composition Via Distributed-Memory Dataflow Processing. 2013 13th IEEE/ACM International Symposium on Cluster,
Cloud, and Grid Computing, 13-16 May 2013.
Zelenev, A., J. Li, A. Mazhnaya, S. Basu, and F. L. Altice. 2018. "Hepatitis C Virus Treatment as Prevention in an Extended
Network of People Who Inject Drugs in the USA: A Modelling Study". Lancet Infectious Diseases 18 (2):215-224.
AUTHOR BIOGRAPHIES
ERIC TATARA, PhD, PE, is a member of the Consortium for Advanced Science and Engineering of the University of Chicago
and a software engineer in the Decision and Infrastructure Sciences Division of Argonne National Laboratory. His email address
is tatara@anl.gov.
NICHOLSON T. COLLIER, PhD, is a member of the Consortium for Advanced Science and Engineering of the University of
Chicago and a software engineer in the Decision and Infrastructure Sciences Division of Argonne National Laboratory. His email
address is ncollier@anl.gov.
JONATHAN OZIK, PhD, is a Senior Scientist in the Consortium for Advanced Science and Engineering of the University of
Chicago and a computational scientist in the Decision and Infrastructure Sciences Division of Argonne National Laboratory, and
Lecturer at the Harris School of Public Policy, University of Chicago. His email address is jozik@anl.gov.
ALEXANDER GUTFRAIND, PhD, is a Lecturer at the Division of Hepatology, Department of Medicine, Loyola University
Chicago, Chicago, IL. His email address is agutfraind.research@gmail.com.
SCOTT J. COTLER, MD, is the Director of the Division of Hepatology at Loyola University Medical Center. His email address
is scott.cotler@luhs.org.
HAREL DAHARI, PhD, is an Associate Professor and co-direct a cross-disciplinary Program for Experimental & Theoretical
Modeling (PETM) in the Division of Hepatology at Loyola University Medical Center. His email address is hdahari@luc.edu.
MARIAN MAJOR, PhD, is currently Chief of the Laboratory of Hepatitis Viruses in the Center for Biologics Research and
Evaluation, US Food and Drug Administration. Her email address is marian.major@fda.hhs.gov.
BASMATTEE BOODRAM, PhD, is an Associate Professor in the Division of Community Health Sciences, School of Public
Health, University of Illinois at Chicago. Her email address is bboodram@uic.edu.
1019