ArticlePDF Available

Major subpopulations of Plasmodium falciparum in sub-Saharan Africa

Authors:
  • Medical Research Council Unit The Gambia at LSHTM

Abstract and Figures

Understanding genomic variation and population structure of Plasmodium falciparum across Africa is necessary to sustain progress toward malaria elimination. Genome clustering of 2263 P. falciparum isolates from 24 malaria-endemic settings in 15 African countries identified major western, central, and eastern ancestries, plus a highly divergent Ethiopian population. Ancestry aligned to these regional blocs, overlapping with both the parasite’s origin and with historical human migration. The parasite populations are interbred and shared genomic haplotypes, especially across drug resistance loci, which showed the strongest recent identity-by-descent between populations. A recent signature of selection on chromosome 12 with candidate resistance loci against artemisinin derivatives was evident in Ghana and Malawi. Such selection and the emerging substructure may affect treatment-based intervention strategies against P. falciparum malaria.
Content may be subject to copyright.
MALARIA
Major subpopulations of Plasmodium
falciparum in sub-Saharan Africa
Alfred Amambua-Ngwa
1
, Lucas Amenga-Etego
2
, Edwin Kamau
3,4
, Roberto Amato
5,6
,
Anita Ghansah
7
, Lemu Golassa
8
, Milijaona Randrianarivelojosia
9
, Deus Ishengoma
10
,
Tobias Apinjoh
11
, Oumou Maïga-Ascofaré
12
, Ben Andagalu
3
, William Yavo
13
,
Marielle Bouyou-Akotet
14
, Oyebola Kolapo
1,15
, Karim Mane
1
, Archibald Worwui
1
,
David Jeffries
1
, Vikki Simpson
4,6
, Umberto DAlessandro
1
,
Dominic Kwiatkowski
5,6
, Abdoulaye A. Djimde
5,16
*
Understanding genomic variation and population structure of Plasmodium falciparum
across Africa is necessary to sustain progress toward malaria elimination. Genome
clustering of 2263 P. falciparum isolates from 24 malaria-endemic settings in
15 African countries identified major western, central, and eastern ancestries, plus a
highly divergent Ethiopian population. Ancestry aligned to these regional blocs,
overlapping with both the parasites origin and with historical human migration. The
parasite populations are interbred and shared genomic haplotypes, especially across drug
resistance loci, which showed the strongest recent identity-by-descent between
populations. A recent signature of selection on chromosome 12 with candidate resistance
loci against artemisinin derivatives was evident in Ghana and Malawi. Such selection
and the emerging substructure may affect treatment-based intervention strategies
against P. falciparum malaria.
The worldwide decline in malaria prevalence
is now stalling and additional knowledge,
new tools, and intervention strategies
will be needed for global malaria elimi-
nation and eradication (1). The burden of
Plasmodium falciparum malaria in particular
remains substantial in sub-Saharan Africa (sSA),
where it involves various vectors and human
populations (2,3). Although interventions have
reduced and disconnected malaria parasite pop-
ulations, they may be driving selection, adapta-
tion, and population fragmentation. Population
fragmentation and reduced diversity can be as-
sessed for refining approaches or tools for elim-
ination (4). Therefore, it is important to determine
the effect of large-scale control interventions on
the structure of the parasite population, which
until recently was considered to be highly diverse
and homogeneously interconnected in sSA (5).
The ancestry, current structure, and gene flow
between different P. falciparum populations
across sSA remain unclear. Previous studies
have used single-nucleotide polymorphism (SNP)
markers to characterize specific geographic
populations and describe genomic variation
and signatures of selection in sSA (6,7). Re-
cent higher-density genomic polymorphisms
from next-generation sequencing technologies
can further resolve African P. falciparum sub-
populations and population-specific genomic
signatures.
The Plasmodium Diversity Network Africa
(PDNA) conducts P. falciparum genomic sur-
veillance across sSA, from the West Atlantic
coastal regions with their high rainfall and
perennial transmission; the Sahel with its short
rainy seasons and seasonal transmission; Central
Africa with its forest-covered areas and perennial
transmission; Eastern Africa with its perennial
and seasonal transmission; to Ethiopia and the
RESEARCH
Amambua-Ngwa et al., Science 365, 813816 (2019) 23 August 2019 1of4
1
Medical Research Council Unit The Gambia at LSHTM,
Banjul, The Gambia.
2
West African Centre for Cell Biology of
Infectious Pathogens (WACCBIP), University of Ghana, Accra,
Ghana.
3
United States Army Medical Research Directorate-
Africa, Kenya Medical Research Institute/Walter Reed
Project, Kisumu, Kenya.
4
Walter Reed Army Institute of
Research, U.S. Military HIV Research Program, Silver Spring,
MD, USA.
5
Wellcome Sanger Institute, Hinxton, UK.
6
MRC
Centre for Genomics and Global Health, Big Data Institute,
University of Oxford, Oxford, UK.
7
Noguchi Memorial Institute
for Medical Research (NMIMR), Accra, Ghana.
8
Aklilu Lemma
Institute of Pathobiology, Addis Ababa University, Addis
Ababa, Ethiopia.
9
Institut Pasteur of Madagascar, Antanarivo,
Madagascar.
10
National Insti tute for Medical Research
(NIMR), Tanga, Tanzania.
11
Department of Biochemistry and
Molecular Biology, University of Buea, Buea, Cameroon.
12
Bernhard Nocht Institute for Topical Medicine (BNITM),
Hamburg, Germany.
13
Unite des Sciences Pharmaceutiques
et Biologiques, University Félix Houphouët-Boigny, Abidjan,
Côte dIvoire.
14
Faculty of Medicine, University of Health
Sciences, Libreville, Gabon.
15
Department of Zoology,
University of Lagos, Lagos, Nigeria.
16
Malaria Research and
Training Centre, University of Science, Techniques and
Technologies of Bamako, Bamako, Mali.
*Corresponding author. Email: adjimde@icermali.org
Fig. 1. Sites, sample sizes,
and genetic groupings
of P. falciparum isolates
across PDNA and Pf3K
studies in Africa.
(A)Sites,P. falciparum
(Pf) prevalence rate, and
studies from which
SNP data of 2263 isolates
were accessed. Map
was extracted from a
malaria atlas showing
P. falciparum prevalence
as brown density within
the ranges of the key
(https://map.ox.ac.uk/
explorer/#/). (B)Com-
plexity of infections
by inbreeding coefficient
(Fws). (C) Scatter plot
from multidimensional
scaling of tess3r
ancestry coefficients
for six predicted
ancestral populations.
on August 24, 2019 http://science.sciencemag.org/Downloaded from
island of Madagascar with their cotransmission
of P. vivax (8). Using high-resolution genome-
wide SNP variants of P. falciparum isolates
across sSA, we reveal the population structure,
admixture, markers of identity-by-descent (IBD),
differentiation, and signatures of selection.
SNP variants (29,998) were extracted from
whole-genome sequences of 2263 P. falciparum
isolates sampled from across 15 African coun-
tries (Fig. 1A and tables S1 and S2). At least 55%
of infections were polygenomic, with up to nine
clones in some infections from Ghana, Guinea,
and Malawi (fig. S1). The proportion of complex
infections [i.e., lower mean inbreeding coefficient
(Fws)] was highest in Kenya and lowest in
Ethiopia (Fig. 1B). Malaria transmission around
the sampling site in Kenya (Kisumu, Western
Kenya) was stable and high (9), probably driving
the high infection complexity. In West Africa,
isolates from The Gambia and Senegal were the
least complex, confirming earlier reports of a
decline in complexity with decreasing preva-
lence, probably due to the scale-up of inter-
ventions (10).
Standard principal components analysis, using
imputed genome haplotypes (fig. S2), resolved
three major groups: western (West Africa and the
more-central countries of Cameroon and Gabon),
eastern [Democratic Republic of the Congo (DR
Congo) and all other sites in East Africa], and a
Amambua-Ngwa et al., Science 365, 813816 (2019) 23 August 2019 2of4
Fig. 2. Genome-wide ancestry proportions. Ancestry proportions for P. falciparum isolates (admixture-like bar plots) or populations (pie charts)
modeled to include donors from all sites (incl. self) or excluding isolates from recipient sampling site (without self). (A) Ancestry per isolate (rows) from
each sampling site (left column). (B) Median ancestry from each sampling site. (C) Median ancestry proportions between isolates from each sampling
site, excluding donors from same site. Country colors are the same as in Fig. 1.
Fig. 3. Genome-wide ancestry proportions for P. falciparum populations in sSA. (A) Ancestry proportions for regional genetic blocs (left column).
Ancestry proportions for each genetic cluster (B) including self-copying and (C) without self-copying.
RESEARCH |REPORT
on August 24, 2019 http://science.sciencemag.org/Downloaded from
distinct Ethiopian population (fig. S3). This sub-
structure was refined to six distinct clusters from
multidimensional scaling of ancestral member-
ship coefficients, splitting DR Congo from East
African populations (Fig. 1C and fig. S4). The six
retained genetic clusters were West African (WAF;
Senegal, Gambia, Guinea, Mali, Côte dIvoire,
Ghana, and Nigeria), Central African (CAF;
Cameroon and Gabon), South Central African
(SCAF; DR Congo), East African (EAF; Kenya
and Tanzania), Southeast African (SEAF; Malawi
and Madagascar), and the Horn of Africa (HAF;
Ethiopia).
Each cluster suggests an ancestral or trans-
mission connectivity supported by geographic
proximity and confirmed by significant isola-
tion by distance (P= 0.03, Mantel test) (fig. S5).
The major population continuums were within
West Africa and East Africa, with several-fold
difference in genetic distance [all fixation index
(F
ST
) values > 0.1] between them and Ethiopia.
Differentiation might also result from differences
in human and vector populations, the history
of interventions on spatial separation, and geo-
graphic barriers (e.g., western Cameroon forest,
the equatorial forest, Congo Basin rivers, and
highlands of Ethiopia). Isolates from DR Congo
and Ethiopia clustered away from geographically
proximal sites in CAF and EAF, respectively.
Human populations from Ethiopia and other
HAF sites, such as Djibouti, have a distinct an-
cestry from the rest of Africa, allowing sympat-
ric transmission of P. vivax, with earlier reports
of divergent P. falciparum populations (11,12).
As in Madagascar, HAF human populations have
higher frequencies of the Duffy antigen, allowing
P. vivax cotransmission. However, isolates from
Madagascar clustered with those from Malawi,
indicating mainland ancestry despite a high pro-
portion of human populations originating from
Southeast Asia and being separated by 1400 km
of land and the Indian Ocean. Therefore, it is not
likely that the divergence of HAF isolates is due
to co-prevalence with P. vivax but might be
driven by other factors such as differences in
vector populations. This could also explain the
differentiation between Congolese and other CAF
isolates where vector populations differ, with
Anopheles funestus being relatively dominant in
DR Congo (13).
Recent studies have shown that P. falciparum
from western great apes jumped into humans
about 10,000 years ago, prior to major human
migrations (14,15). The donation of ancestral
genome chunks from CAF to both western and
eastern P. falciparum populations aligns with
such an origin and the spread of malaria through
historical and more recent human migration in
Africa. Recent human migration brought on by
colonization and slavery may have resulted
in P. falciparum ancestral chunks shared be-
tween distal French colonies like Cameroon,
Mali, and Senegal, whereas ancestry from WAF
sites of Mali, Guinea, and Senegal are present in
DR Congo (Fig. 2 and fig. S6). However, historical
links prior to dispersal of humans and parasites
to West and East Africa may also account for the
shared ancestry between all major population
blocs (Fig. 3). The early human migration from
Central Africa, after the emergence of malaria
in humans, was dominated by Bantu popula-
tions moving westward and southeastward (16).
T-SNE and fineSTRUCTURE clustering of an-
cestral chunk matrices also maintained the
major West and East African subpopulations,
further indicating that isolates from DR Congo
share more eastern ancestry (figs. S7 and S8). Hu-
man population mixing could have facilitated
P. falciparum gene flow, IBD signatures, and
spread of adaptive alleles across Africa (17).
The proportions of isolates sharing IBD (<3%)
was weak and uneven across the genome, as ex-
pected for intensely recombining parasite pop-
ulations (Fig. 4A and fig. S9). However, relatively
high IBD proportions spanned 12 segments of the
genome, including regions coding for candidate
drug resistance loci; Pfaat1 (PF3D7_0629500) on
chromosome 6; known drug resistance genes
Pfmdr1, Pfcrt, and Pfdhps; anda cluster of genes
on chromosome 12 (Pfap2mu, PfATPase, and
Pfap2g2). These genes are involved in drug re-
sponses, transportation, and metabolism (fig. S10).
These results confirm links between Pfcrt and
Pfaat1, which together with Pfap2g2 and PfAT-
Pase2 have been identified as part of the malaria
druggablegenome(18). Pfap2mu in particular
has been linked to artemisinin tolerance in
Africa (19). Strong IBD around Pfap2mu in Ghana
and Malawi (Fig. 4B) may have emerged inde-
pendently and calls for increased vigilance
against artemisinin-based combination therapy
(ACT) efficacy. The introduction or local emer-
gence and sharing of candidate drug resistance
haplotypes would be recent, as IBD detection
was limited to 25 generations. Haplotype paint-
ing across drug resistance loci (table S6) empha-
sized bidirectional gene flow across these loci
(fig. S11). Multiple origins of antifolate markers
were confirmed (20) but also seen for Pfmdr1,
which showed two ancestral lineages dominant
in West and East African populations, respec-
tively (fig. S12). Multiple emergence for a major
quinolone resistance mediator such as Pfmdr1
Amambua-Ngwa et al., Science 365, 813816 (2019) 23 August 2019 3of4
Fig. 4. Pairwise IBD between isolates across sites. (A) Manhattan plot of median IBD between
pairs of P. falciparum isolates, showing each chromosome as numbered on the xaxis. IBD segment
peaks labeled for dihydrofolate reductase (dhfr), multidrug resistance protein 1 (mdr1), amino
acid transporter 1 (aat1), chloroquine resistance transporter (crt), dihydropteroate synthetase
(dhps), AP2 domain transcription factors (ap2-g2 and ap2-mu), and aminophospholipid-
transp orting P-ATPase (atpase2). (B) Heatmap of pairwise IBD between sampled populations
clustered on rows for similar patterns between populations. SNP values are in columns
separated by chromosomes for each pair of populations in rows. Low to high values are color
graded from blue to red on RGB color wheel.
RESEARCH |REPORT
on August 24, 2019 http://science.sciencemag.org/Downloaded from
has not been previously reported. Selection,
emergence, and spread of resistance to drugs is
therefore possible in all malaria endemic sites
across sSA. These findings are important because
artemisinin resistance may emerge independently
in sSA and not necessarily spread from Southeast
Asia. This calls for careful surveillance of artemis-
inin resistance in sSA, where drug pressure from
ACT and seasonal malaria chemoprevention with
sulfadoxine-pyrimethamine and amodiaquine are
being scaled up for elimination. These would also
lead to population differentiation (fig. S13) and
positive selection that could facilitate the devel-
opment of clinical drug resistance.
SNPs related to drug resistance, erythrocyte
invasion, gametocytogenesis, oocyst development,
and antigenic loci were the most differentiated
between populations (fig. S14, A and B, and tables
S7 and S8). These could be due to different envi-
ronmental conditions and varying human and
mosquito populations. Known drug loci (Pfaat1,
Pfmdr1, Pfcrt, Pfdhfr, and Pfdhps) and the IBD
cluster on chromosome 12 showed signatures of
positive selection and haplotype differentiation
across sampled populations (figs. S14, C and D,
S15, and S16, and tables S9 and S10). It would be
important to determine whether variants at these
loci can compromise the efficacy of artemisinins
and/or ACTs.
P. falciparum in sSA is clustered into major
western, central, and eastern subgroups and a
highly divergent Ethiopian subpopulation. These
endogenous genomic lineages are the ancestral
backbone on which adaptive loci such as drug
resistance mutations may have emerged, recom-
bined, and been shared both westerly and easterly
across sSA. This may occur again against current
artemisinin-based treatments, which are already
directionally selecting loci on chromosome 12.
These signal the need for broader molecular
and phenotypic surveillance of P. falciparum in
sSA, including the large swathes of endemic pop-
ulations in Central Africa, where civil strife and
other global health pathogen epidemics could
maintain malaria and threaten elimination efforts.
REFERENCES AND NOTES
1. World Malaria Report 2018 (World Health Organization, 2018).
2. Anopheles gambiae 1000 Genomes Consortium, Nature 552,
96100 (2017).
3. D. Gurdasani et al., Nature 517, 327332 (2015).
4. T. G. Anthony et al., J. Infect. Dis. 191, 15581564 (2005).
5. M. Manske et al., Nature 487, 375379 (2012).
6. A. Amambua-Ngwa et al., Sci. Rep. 8, 9687 (2018).
7. C. W. Duffy et al., Sci. Rep. 8, 15763 (2018).
8. A. Ghansah et al., Science 345, 12971298 (2014).
9. A. Kapesa et al., PLOS ONE 13, e0202031 (2018).
10. A. K. Bei et al., J. Infect. Dis. 217, 622627 (2018).
11. J. A. Hodgson, C. J. Mulligan, A. Al-Meeri, R. L. Raaum,
PLOS Genet. 10, e1004393 (2014).
12. H. Bogreau et al., Am. J. Trop. Med. Hyg. 74, 953959 (2006).
13. M. E. Sinka et al., Parasit. Vectors 5, 69 (2012).
14. D. E. Loy et al., Int. J. Parasitol. 47,8797 (2017).
15. D. A. Joy et al., Science 300, 318321 (2003).
16. E. Patin et al., Science 356, 543546 (2017).
17. L. Henden, S. Lee, I. Mueller, A. Barry, M. Bahlo, PLOS Genet.
14, e1007279 (2018).
18. A. N. Cowell et al., Science 359, 191199 (2018).
19. G. Henriques et al., Antimicrob. Agents Chemother. 59,
25402547 (2015).
20. T. Anderson, PLOS Med. 6, e1000054 (2009).
ACKNOW LEDGM ENTS
We thank the participants and local health workers from PDNA
sites. Special thanks to G. Busby for discussion and advising on
admixture analyses. Genome sequencing was done at the
Wellcome Sanger Institute as part of the MalariaGEN Plasmodium
falciparum Community Project (www.malariagen.net/projects).
We thank the MalariaGEN P. falciparum Community Project and
Pf3K Project for allowing access to non-PDNA data. We thank
K. Rockett, J. Stalker, R. Pearson, and other members of the
MalariaGEN resource center and the staff of Wellcome Sanger
Institute Sample Logistics, Sequencing, and Informatics facilities
for their contributions to sample processing, sequence data
generation, and variant calling pipelines. Funding: A.A.-N., L.A.-E.,
A.G., L.G., D.I., T.A., O.M.-A., B.A., Y.W., M.B.-A., and A.A.D. are
currently supported through the DELTAS Africa Initiative, an
independent funding scheme of the African Academy of Sciences
(AAS)s Alliance for Accelerating Excellence in Science in Africa
(AESA), and are also supported by the New Partnership for Africas
Development Planning and Coordinating Agency (NEPAD Agency)
with funding from Wellcome (DELGEME grant 107740/Z/15/Z) and
the U.K. government. Sample collection in Kenya was funded by
Armed Forces Health Surveillance Center (AFHSB) and its Global
Emerging Infections Surveillance (GEIS) Section, Grant P0209_15_
KY. The views expressed in this publication are those of the
authors and not necessarily those of AAS, NEPAD Agency,
Wellcome, the U.S. Army or the Department of Defense, or the U.K.
government. The investigators have adhered to the policies for
protection of human subjects as prescribed in AR-70. Sequencing
was undertaken in partnership with MalariaGEN and the
Parasites and Microbes program at the Wellcome Sanger Institute
with funding from Wellcome (206194; 090770/Z/09/Z) and by
the MRC Centre for Genomics and Global Health which is jointly
funded by the Medical Research Council and the Department
for International Development (DFID) (G0600718 to D.K.;
M006212). Author contributions: A.G., L.G., M.R., D.I., T.A.,
O.M.-A., B.A., Y.W., O.K., and M.B.-A. contributed samples and
reviewed the manuscript. A.A.-N. and L.A.-E. contributed samples,
conceived of the manuscript, executed data analysis, and
participated in the writing (A.A.-N.) and revision (L.A.-E.) of the
manuscript. E.K. reviewed the analysis and manuscript. R.A.
provided analytical support. K.M., A.W., and D.J. conducted data
analysis and reviewed the manuscript. V.S. coor dinated the
collaboration an d reviewed the m anuscript. U.D. read and
reviewed the manu script. D.K. led the team that generated data,
conceived of the m anuscript, and reviewed the analysis an d
manuscript. A.A .D. coordinated the consortium, contr ibuted
samples conceived of the manuscript, and read and reviewed the
manuscript. Competing interests: The authors declare no
competi ng interest. Data and materials availability: The short-
read sequences used in this publication are available in the ENA
and SRA databases (see table S2 for accession numbers). The
views expressed are those of the authors and should not be
construed to represent the positions of the U.S. Army or the
Department of Defense. The investigators have adhered to the
policies for protection of human subjects as prescribed in AR-70.
SUPPLEMENTARY MATERIALS
science.sciencemag.org/content/365/6455/813/suppl/DC1
Materials and Methods
Figs. S1 to S16
Tables S1 to S10
References (2131)
27 September 2018; accepted 5 July 2019
10.1126/science.aav5427
Amambua-Ngwa et al., Science 365, 813816 (2019) 23 August 2019 4of4
RESEARCH |REPORT
on August 24, 2019 http://science.sciencemag.org/Downloaded from
in sub-Saharan AfricaPlasmodium falciparumMajor subpopulations of
Dominic Kwiatkowski and Abdoulaye A. Djimde
Bouyou-Akotet, Oyebola Kolapo, Karim Mane, Archibald Worwui, David Jeffries, Vikki Simpson, Umberto D'Alessandro,
Randrianarivelojosia, Deus Ishengoma, Tobias Apinjoh, Oumou Maïga-Ascofaré, Ben Andagalu, William Yavo, Marielle
Alfred Amambua-Ngwa, Lucas Amenga-Etego, Edwin Kamau, Roberto Amato, Anita Ghansah, Lemu Golassa, Milijaona
DOI: 10.1126/science.aav5427
(6455), 813-816.365Science
, this issue p. 813; see also p. 752Science P. vivax.malaria parasite, , which may be indicative of coexistence with anotherP. falciparumand that Ethiopia has a distinctive population of
slavery. Furthermore, whole-genome sequencing showed that there is extensive gene flow among the different regions
signatures of selection by antimalarial drugs were detected, along with indications of the effect of colonization and
within Africa that is consistent with human and vector population divergence (see the Perspective by Sibley). Specific
of the Plasmodium Diversity Network Africa found substantial population structureet al.genomics, Amambua-Ngwa
important to know for grasping the risks and dynamics of the spread of drug resistance. Harnessing the power of
across Africa is poorly understood butPlasmodium falciparumThe population genetics of the malaria parasite
Ebb and flow of parasite populations
ARTICLE TOOLS http://science.sciencemag.org/content/365/6455/813
MATERIALS
SUPPLEMENTARY http://science.sciencemag.org/content/suppl/2019/08/21/365.6455.813.DC1
CONTENT
RELATED
http://stm.sciencemag.org/content/scitransmed/11/474/eaau1458.full
http://stm.sciencemag.org/content/scitransmed/11/495/eaav3963.full
http://science.sciencemag.org/content/sci/365/6455/752.full
REFERENCES http://science.sciencemag.org/content/365/6455/813#BIBL
This article cites 30 articles, 9 of which you can access for free
PERMISSIONS http://www.sciencemag.org/help/reprints-and-permissions
Terms of ServiceUse of this article is subject to the
is a registered trademark of AAAS.Science
licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. The title
Science, 1200 New York Avenue NW, Washington, DC 20005. 2017 © The Authors, some rights reserved; exclusive
(print ISSN 0036-8075; online ISSN 1095-9203) is published by the American Association for the Advancement ofScience
on August 24, 2019 http://science.sciencemag.org/Downloaded from
... Analyses of population structure in sub-Saharan Africa have shown high levels of genetic variations in hightransmission regions, with gradual genetic differentiation between east and west Africa. 10 Population structure can be observed at the margins of endemicity, in lower transmission regions such as The Gambia and the Horn of Africa. 10,11 To date, however, no published analyses have reported population structure driven by the selection of complex co-inherited multilocus genetic backgrounds. ...
... 10 Population structure can be observed at the margins of endemicity, in lower transmission regions such as The Gambia and the Horn of Africa. 10,11 To date, however, no published analyses have reported population structure driven by the selection of complex co-inherited multilocus genetic backgrounds. ...
... PCoA plots showed that the first component (PC1) was driven by the differentiation between parasites from west Africa and east Africa (figure 1), as reported previously. 10 Unexpectedly, the second component (PC2) was driven by a diverging cluster, which we named AF1, composed of parasites from multiple countries across Africa, rather than from sites in close geographical proximity. The broad geographical distribution of AF1, including regions of high transmission, suggests that population structure is not driven by low endemicity. ...
... The overall mean FWS value was 0.76, much lower than 1.0 and driven by low values from infections that contained very high levels of parasite genomic diversity. In more recent population samples from across Africa, such profiles have been seen only in areas with intense malaria transmission, whereas in moderate to low transmission areas within-infection diversity is much lower 29,30 . ...
Preprint
Understanding evolution of human pathogens requires looking beyond effects of recent interventions or known epidemiological changes. To study malaria parasites prior to widespread drug selection, Plasmodium falciparum genome sequences were analysed from the oldest set of archived research samples yet identified, placental blood collected in the Gambia between 1966 and 1971. Genomic complexity within infections was high, infections were genetically unrelated, and no drug resistance alleles were detected. Strong signatures of positive selection are clearly seen at multiple loci throughout the genome, most of which encode surface proteins that bind erythrocytes and are targets of acquired antibody responses. Comparison of population samples obtained over a following period of almost 50 years revealed major directional allele frequency changes at several loci apart from drug resistance genes. Exceptional changes over this time are seen at gdv1 that regulates the rate of parasite sexual conversion required for transmission, and at the unlinked Pfsa1 and Pfsa3 loci previously associated with infection of individuals with sickle-cell trait. Other affected loci encode surface and transporter proteins warranting targeted functional analyses. This identification of key long-term adaptions that have not reached equilibrium is important for understanding and potentially managing future evolution of malaria parasites.
... In such strategic decision-making and social interaction, we assume that individuals emulate others' activities. Specifically, they sample other members randomly at a constant rate, and if the payoff of the sampled person is higher, then the sampled strategy is adopted with a probability that is proportional to the expected gain in payoff [1,2]. It is assumed that individuals switch between the two strategies depending on the perceived benefits either from using ITNs properly or improperly. ...
Preprint
Full-text available
The WHO 2021 malaria report revealed that its Global Technical Strategy (GTS) 2020 milestones for morbidity and mortality, based on the 2015 baseline, have not been achieved globally -- the world is off-track by 42% and, can be extended up to 91% in 2030. Most of the Sub-Saharan African (SSA) countries failed to achieve GTS 2020 -- only 4 out of 40 highest burden countries met the goals. By fitting evolutionary game modeling to the malaria case and Insecticide-Treated Nets (ITN) usages data, we identify factors contributing to the GTS 2020 failures of 38 SSA countries. We use optimized projection of our model to evaluate further the potential achievement of GTS 2025 and 2030 objectives and discuss strategies for attaining goals in situations where these milestones seem unattainable. Our findings categorize all 38 countries based on the possibility to achieve their future milestone, either through increased campaigns, or by economic assistance, or through enhancing the efficacy of ITNs at minimal expense.
... genomic studies of P. falciparum have enabled monitoring of drug resistance markers 6 , facilitated the identification of promising vaccine candidates 7 , uncovered the structure of parasite populations 8 , and identified evolutionary forces shaping their demography 9,10 . Much less is known about non-falciparum species, especially their comparative evolutionary history and susceptibility to malaria control interventions focused on P. falciparum. ...
Article
Full-text available
Plasmodium ovale curtisi (Poc) and Plasmodium ovale wallikeri (Pow) are relapsing malaria parasites endemic to Africa and Asia that were previously thought to represent a single species. Amid increasing detection of ovale malaria in sub-Saharan Africa, we present a population genomic study of both species across the continent. We conducted whole-genome sequencing of 25 isolates from Central and East Africa and analyzed them alongside 20 previously published African genomes. Isolates are predominantly monoclonal (43/45), with their genetic similarity aligning with geography. Pow shows lower average nucleotide diversity (1.8×10⁻⁴) across the genome compared to Poc (3.0×10⁻⁴) (p < 0.0001). Signatures of selective sweeps involving the dihydrofolate reductase gene have been found in both species, as are signs of balancing selection at the merozoite surface protein 1 gene. Differences in the nucleotide diversity of Poc and Pow may reflect unique demographic history, even as similar selective forces facilitate their resilience to malaria control interventions.
... These mutations are distinct from those found in artemisinin-resistant Asian P. falciparum strains. These observations call into question the relevance of molecular surveillance targeting a single marker that is characteristic of Asian P. falciparum strains [16,17]. As East Africa has historically been the gateway to drug-resistant P. falciparum on the continent, it is urgent to document therapeutic failures after ACT in the region using suitable molecular tools. ...
Article
Full-text available
Djibouti is confronted with malaria resurgence, with malaria having been occurring in epidemic proportions since a decade ago. The current epidemiology of drug-resistant Plasmodium falciparum is not well known. Molecular markers were analyzed by targeted sequencing in 79 P. falciparum clinical isolates collected in Djibouti city in 2023 using the Miseq Illumina platform newly installed in the country. The objective of the study was to analyze the key codons in these molecular markers associated with antimalarial drug resistance. The prevalence of the mutant Pfcrt CVIET haplotype (92%) associated with chloroquine resistance and mutant Pfdhps-Pfdhfr haplotypes (7.4% SGEA and 53.5% IRN, respectively) associated with sulfadoxine-pyrimethamine resistance was high. By contrast, Pfmdr1 haplotypes associated with amodiaquine (YYY) or lumefantrine (NFD) resistance were not observed in any of the isolates. Although the “Asian-type” PfK13 mutations associated with artemisinin resistance were not observed, the “African-type” PfK13 substitution, R622I, was found in a single isolate (1.4%) for the first time in Djibouti. Our genotyping data suggest that most Djiboutian P. falciparum isolates are resistant to chloroquine and sulfadoxine-pyrimethamine but are sensitive to amodiaquine, lumefantrine, and artemisinin. Nonetheless, the presence of an isolate with the R622I PfK13 substitution is a warning signal that calls for a regular surveillance of molecular markers of antimalarial drug resistance.
Article
Background: Insecticide resistance challenges the vector control efforts towards malaria elimination and proving the development of complementary tools. Targeting the genes that are involved in mosquito fertility and susceptibility to Plasmodium with small molecule inhibitors has been a promising alternative to curb the vector population and drive the transmission down. However, such an approach would require a comprehensive knowledge of the genetic diversity of the targeted genes to ensure the broad efficacy of new tools across the natural vector populations. Methods: Four fertility and parasite susceptibility genes were identified from a systematic review of the literature. The Single Nucleotide Polymorphisms (SNPs) found within the regions spanned by these four genes, genotyped across 2784 wild-caught Anopheles gambiae s.l. from 19 sub-Saharan African (SSA) countries, were extracted from the whole genome SNP data of the Ag1000G project (Ag3.0). The population genetic analysis on gene-specific data included the determination of the population structure, estimation of the differentiation level between the populations, evaluation of the linkage between the non-synonymous SNPs (nsSNPs), and a few statistical tests. Results: As potential targets for small molecule inhibitors to reduce malaria transmission, our set of four genes associated with Anopheles fertility and their susceptibility to Plasmodium comprises the mating-induced stimulator of oogenesis protein (MISO, AGAP002620), Vitellogenin (Vg, AGAP004203), Lipophorin (Lp, AGAP001826), and Haem-peroxidase 15 (HPX15, AGAP013327). The analyses performed on these potential targets of small inhibitor molecules revealed that the genes are conserved within SSA populations of An. gambiae s.l. The overall low Fst values and low clustering of principal component analysis between species indicated low genetic differentiation at all the genes (MISO, Vg, Lp and HPX15). The low nucleotide diversity (>0.10), negative Tajima’s D values, and heterozygosity analysis provided ecological insights into the purifying selection that acts to remove deleterious mutations, maintaining genetic diversity at low levels within the populations. None of MISO nsSNPs were identified in linkage disequilibrium, whereas a few weakly linked nsSNPs with ambiguous haplotyping were detected at other genes. Conclusions: This integrated finding on the genetic features of major malaria vectors’ biological factors across natural populations offer new insights for developing sustainable malaria control tools. These loci were reasonably conserved, allowing for the design of effective targeting with small molecule inhibitors towards controlling vector populations and lowering global malaria transmission.
Article
Full-text available
Malaria disease caused by parasites of genus Plasmodium places an enormous disease burden across tropical regions of the world. The circumsporozoite protein (CSP) of Plasmodium has several key functions in binding and accessing host cells, with functions subdivided across multiple protein regions. While its key roles during infection make the gene a primary target for malaria vaccine development, the evolutionary dynamics that could affect the forecasting of useful strains remain poorly understood. We tested whether the gene undergoes multiple DNA substitution processes and whether these are divided across gene regions using a phylogenetic mixture model, and a global sample of CSP sequences specific to P. falciparum. These analyses reveal evolutionary processes unique to the central repeat region and the C-terminus. The central repeat region is dominated by synonymous substitutions (putatively neutral) and heavy C-T substitution bias, while the C-terminus undergoes mostly non-synonymous changes. These evolutionary processes are not strongly geographically restricted, and lineages from Africa and Asia where the parasite is most abundant appear to drive evolution across all CSP gene regions. We propose that insights about DNA substitution processes can help forecast the variants of importance to vaccine development, aided by state-of-the-art evolutionary modelling.
Article
Genetic studies of Plasmodium parasites increasingly feature relatedness estimates. However, various aspects of malaria parasite relatedness estimation are not fully understood. For example, relatedness estimates based on whole-genome-sequence (WGS) data often exceed those based on sparser data types. Systematic bias in relatedness estimation is well documented in the literature geared towards diploid organisms, but largely unknown within the malaria community. We characterise systematic bias in malaria parasite relatedness estimation using three complementary approaches: theoretically, under a non-ancestral statistical model of pairwise relatedness; numerically, under a simulation model of ancestry; and empirically, using data on parasites sampled from Guyana and Colombia. We show that allele frequency estimates encode, locus-by-locus, relatedness averaged over the set of sampled parasites used to compute them. Plugging sample allele frequencies into models of pairwise relatedness can lead to systematic underestimation. However, systematic underestimation can be viewed as population-relatedness calibration, i.e., a way of generating measures of relative relatedness. Systematic underestimation is unavoidable when relatedness is estimated assuming independence between genetic markers. It is mitigated when relatedness is estimated using WGS data under a hidden Markov model (HMM) that exploits linkage between proximal markers. The extent of mitigation is unknowable when a HMM is fit to sparser data, but downstream analyses that use high relatedness thresholds are relatively robust regardless. In summary, practitioners can either resolve to use relative relatedness estimated under independence, or try to estimate absolute relatedness under a HMM. We propose various tools to help practitioners evaluate their situation on a case-by-case basis.
Article
Surveillance of drug resistance and the discovery of novel targets—key objectives in the fight against malaria—rely on identifying resistance-conferring mutations in Plasmodium parasites. Current approaches, while successful, require laborious experimentation or large sample sizes. To elucidate shared determinants of antimalarial resistance that can empower in silico inference, we examined the genomes of 724 Plasmodium falciparum clones, each selected in vitro for resistance to one of 118 compounds. We identified 1448 variants in 128 recurrently mutated genes, including drivers of antimalarial multidrug resistance. In contrast to naturally occurring variants, those selected in vitro are more likely to be missense or frameshift, involve bulky substitutions, and occur in conserved, ordered protein domains. Collectively, our dataset reveals mutation features that predict drug resistance in eukaryotic pathogens.
Article
Full-text available
Abstract Parasites infect hosts in widely varying environments, encountering diverse challenges for adaptation. To identify malaria parasite genes under locally divergent selection across a large endemic region with a wide spectrum of transmission intensity, genome sequences were obtained from 284 clinical Plasmodium falciparum infections from four newly sampled locations in Senegal, The Gambia, Mali and Guinea. Combining these with previous data from seven other sites in West Africa enabled a multi-population analysis to identify discrete loci under varying local selection. A genome-wide scan showed the most exceptional geographical divergence to be at the early gametocyte gene locus gdv1 which is essential for parasite sexual development and transmission. We identified a major structural dimorphism with alternative 1.5 kb and 1.0 kb sequence deletions at different positions of the 3′-intergenic region, in tight linkage disequilibrium with the most highly differentiated single nucleotide polymorphism, one of the alleles being very frequent in Senegal and The Gambia but rare in the other locations. Long non-coding RNA transcripts were previously shown to include the entire antisense of the gdv1 coding sequence and the portion of the intergenic region with allelic deletions, suggesting adaptive regulation of parasite sexual development and transmission in response to local conditions.
Article
Full-text available
Background Passive surveillance of malaria in health facilities remains vital for implementation of control and elimination programs. It is therefore essential understanding current age profile of clinical malaria morbidity, mortality and presentations in areas with variant infection susceptibility. This study aimed at understanding the current malaria morbidity and mortality in Western Kenya. Methods Surveillance of clinical and asymptomatic parasitological positivity rates of all malaria suspected patients and school children were respectively determined from June 2015 to August 2016. From 2014 to 2016, register books in hospitals were referred and the confirmed malaria cases in conjunction with total number of monthly outpatient visits (OPD) counted. All registered malaria admissions were counted together with other causes of admissions. Moreover, outcome of malaria admissions in terms of discharge or death was recorded using inpatient charts within the same time frame. Prospective surveillance of severe malaria collected information on clinical features of the disease. Giemsa stained blood slides confirmed existence of malaria parasitemia. Chi-square and analysis of variance tests were used, respectively, to compute proportions and means; then a comparison was made between different age groups, periods, and study areas. Results During the survey of asymptomatic infections among school children, overall blood slide positivity ranged from 6.4% at the epidemic prone site to 38.3% at the hyperendemic site. During the clinical malaria survey, school age children (5–14) presented with overall the highest (45%) blood slide positivity rate among those suspected to have the infection at the epidemic prone study site. The survey of all malaria confirmed and registered cases at OPD found 17% to 27% of all consultations among <5 children and 9.9% to 20.7% of all OPD visits among the ≥5 patients were due to malaria. Moreover, survey of all registered causes of admission in hospitals found 47% of admissions were due to malaria. The disease was a major cause of admission in epidemic prone setting where 63.4% of the <5 children and 62.8% of the ≥5 patients were admitted due to malaria (p>0.05) and 40% of all malaria admissions were school age children. Malaria related death rate was highest among <5 years at the hyperendemic site, that is 60.9 death per 1000 malaria <5 admissions. Conversely, the epidemic prone setting experienced highest malaria related death among ≥15 years (18.6 death per 1000 admissions) than the < 15 years (5.7 death per 1000 admissions of the <15 years) (p< 0.001). Surveillance of severe form of the disease found that hyperpyrexia, hyperparastemia, prostration and convulsions as common presentations of severe disease. Conclusion Malaria is still the major cause of hospital consultations in Western Kenya with an alarming number of severe forms of the disease among the school aged children at the epidemic prone setting. Mortalities were higher among <5 children years in high infection transmission setting and among ≥15 years in low and moderate transmission settings. Surveillance of asymptomatic and symptomatic malaria along with evaluation of current interventions in different age groups should be implemented in Kenya.
Article
Full-text available
Genome sequences of 247 Plasmodium falciparum isolates collected in The Gambia in 2008 and 2014 were analysed to identify changes possibly related to the scale-up of antimalarial interventions that occurred during this period. Overall, there were 15 regions across the genomes with signatures of positive selection. Five of these were sweeps around known drug resistance and antigenic loci. Signatures at antigenic loci such as thrombospodin related adhesive protein (Pftrap) were most frequent in eastern Gambia, where parasite prevalence and transmission remain high. There was a strong temporal differentiation at a non-synonymous SNP in a cysteine desulfarase (Pfnfs) involved in iron-sulphur complex biogenesis. During the 7-year period, the frequency of the lysine variant at codon 65 (Pfnfs-Q65K) increased by 22% (10% to 32%) in the Greater Banjul area. Between 2014 and 2015, the frequency of this variant increased by 6% (20% to 26%) in eastern Gambia. IC50 for lumefantrine was significantly higher in Pfnfs-65K isolates. This is probably the first evidence of directional selection on Pfnfs or linked loci by lumefantrine. Given the declining malaria transmission, the consequent loss of population immunity, and sustained drug pressure, it is important to monitor Gambian P. falciparum populations for further signs of adaptation.
Article
Full-text available
Identification of genomic regions that are identical by descent (IBD) has proven useful for human genetic studies where analyses have led to the discovery of familial relatedness and fine-mapping of disease critical regions. Unfortunately however, IBD analyses have been underutilized in analysis of other organisms, including human pathogens. This is in part due to the lack of statistical methodologies for non-diploid genomes in addition to the added complexity of multiclonal infections. As such, we have developed an IBD methodology, called isoRelate, for analysis of haploid recombining microorganisms in the presence of multiclonal infections. Using the inferred IBD status at genomic locations, we have also developed a novel statistic for identifying loci under positive selection and propose relatedness networks as a means of exploring shared haplotypes within populations. We evaluate the performance of our methodologies for detecting IBD and selection, including comparisons with existing tools, then perform an exploratory analysis of whole genome sequencing data from a global Plasmodium falciparum dataset of more than 2500 genomes. This analysis identifies Southeast Asia as having many highly related isolates, possibly as a result of both reduced transmission from intensified control efforts and population bottlenecks following the emergence of antimalarial drug resistance. Many signals of selection are also identified, most of which overlap genes that are known to be associated with drug resistance, in addition to two novel signals observed in multiple countries that have yet to be explored in detail. Additionally, we investigate relatedness networks over the selected loci and determine that one of these sweeps has spread between continents while the other has arisen independently in different countries. IBD analysis of microorganisms using isoRelate can be used for exploring population structure, positive selection and haplotype distributions, and will be a valuable tool for monitoring disease control and elimination efforts of many diseases.
Article
Full-text available
Dissecting Plasmodium drug resistance Malaria is a deadly disease with no effective vaccine. Physicians thus depend on antimalarial drugs to save lives, but such compounds are often rendered ineffective when parasites evolve resistance. Cowell et al. systematically studied patterns of Plasmodium falciparum genome evolution by analyzing the sequences of clones that were resistant to diverse antimalarial compounds across the P. falciparum life cycle (see the Perspective by Carlton). The findings identify hitherto unrecognized drug targets and drug-resistance genes, as well as additional alleles in known drug-resistance genes. Science , this issue p. 191 ; see also p. 159
Article
Full-text available
The sustainability of malaria control in Africa is threatened by the rise of insecticide resistance in Anopheles mosquitoes, which transmit the disease. To gain a deeper understanding of how mosquito populations are evolving, here we sequenced the genomes of 765 specimens of Anopheles gambiae and Anopheles coluzzii sampled from 15 locations across Africa, and identified over 50 million single nucleotide polymorphisms within the accessible genome. These data revealed complex population structure and patterns of gene flow, with evidence of ancient expansions, recent bottlenecks, and local variation in effective population size. Strong signals of recent selection were observed in insecticide-resistance genes, with several sweeps spreading over large geographical distances and between species. The design of new tools for mosquito control using gene-drive systems will need to take account of high levels of genetic diversity in natural mosquito populations.
Article
Full-text available
ELife digest Our genomes contain a record of historical events. This is because when groups of people are separated for generations, the DNA sequence in the two groups’ genomes will change in different ways. Looking at the differences in the genomes of people from the same population can help researchers to understand and reconstruct the historical interactions that brought their ancestors together. The mixing of two populations that were previously separate is known as admixture. Africa as a continent has few written records of its history. This means that it is somewhat unknown which important movements of people in the past generated the populations found in modern-day Africa. Busby et al. have now attempted to use DNA to look into this and reconstruct the last 4000 years of genetic history in African populations. As has been shown in other regions of the world, the new analysis showed that all African populations are the result of historical admixture events. However, Busby et al. could characterize these events to unprecedented level of detail. For example, multiple ethnic groups from The Gambia and Mali all show signs of sharing the same set of ancestors from West Africa, Europe and Asia who mixed around 2000 years ago. Evidence of a migration of people from Central West Africa, known as the Bantu expansion, could also be detected, and was shown to carry genes to the south and east. An important next step will be to now look at the consequences of the observed gene-flow, and ask if it has contributed to spreading beneficial, or detrimental, mutations around Africa. DOI: http://dx.doi.org/10.7554/eLife.15266.002
Article
Full-text available
The t-distributed stochastic neighbor embedding t-SNE is a new dimension reduction and visualization technique for high-dimensional data. t-SNE is rarely applied to human genetic data, even though it is commonly used in other data-intensive biological fields, such as single-cell genomics. We explore the applicability of t-SNE to human genetic data and make these observations: (i) similar to previously used dimension reduction techniques such as principal component analysis (PCA), t-SNE is able to separate samples from different continents; (ii) unlike PCA, t-SNE is more robust with respect to the presence of outliers; (iii) t-SNE is able to display both continental and sub-continental patterns in a single plot. We conclude that the ability for t-SNE to reveal population stratification at different scales could be useful for human genetic association studies.
Article
Full-text available
Bantu languages are spoken by about 310 million Africans, yet the genetic history of Bantu-speaking populations remains largely unexplored. We generated genomic data for 1318 individuals from 35 populations in western central Africa, where Bantu languages originated. We found that early Bantu speakers first moved southward, through the equatorial rainforest, before spreading toward eastern and southern Africa. We also found that genetic adaptation of Bantu speakers was facilitated by admixture with local populations, particularly for the HLA and LCT loci. Finally, we identified a major contribution of western central African Bantu speakers to the ancestry of African Americans, whose genomes present no strong signals of natural selection. Together, these results highlight the contribution of Bantu-speaking peoples to the complex genetic history of Africans and African Americans. © 2017, American Association for the Advancement of Science. All rights reserved.
Article
Dramatic changes in transmission intensity can impact Plasmodium population diversity. Using samples from 2 distant time-points in the Dielmo/Ndiop longitudinal cohorts from Senegal, we applied a molecular barcode tool to detect changes in parasite genotypes and complexity of infection that corresponded to changes in transmission intensity. We observed a striking statistically significant difference in genetic diversity between the 2 parasite populations. Furthermore, we identified a genotype in Dielmo and Ndiop previously observed in Thiès, potentially implicating imported malaria. This genetic surveillance study validates the molecular barcode as a tool to assess parasite population diversity changes and track parasite genotypes.