ArticlePDF Available

Robust Image Processing Framework for Intelligent Multi-Stage Malaria Parasite Recognition of Thick and Thin Smear Images

Authors:

Abstract and Figures

Malaria is a pressing medical issue in tropical and subtropical regions. Currently, the manual microscopic examination remains the gold standard malaria diagnosis method. Nevertheless, this procedure required highly skilled lab technicians to prepare and examine the slides. Therefore, a framework encompassing image processing and machine learning is proposed due to inconsistencies in manual inspection, counting, and staging. Here, a standardized segmentation framework utilizing thresholding and clustering is developed to segment parasites’ stages of P. falciparum and P. vivax species. Moreover, a multi-stage classifier is designed for recognizing parasite species and staging in both species. Experimental results indicate the effectiveness of segmenting thick smear images based on Phansalkar thresholding garnered an accuracy of 99.86%. The employment of variance and new transferring process for the clustered members, enhanced k-means (EKM) clustering has successfully segmented all malaria stages with accuracy and an F1-score of 99.20% and 0.9033, respectively. In addition, the accuracies of parasite detection, species recognition, and staging obtained through a random forest (RF) accounted for 86.89%, 98.82%, and 90.78%, respectively, simultaneously. The proposed framework enables versatile malaria parasite detection and staging with an interactive result, paving the path for future improvements by utilizing the proposed framework on all others malaria species.
Content may be subject to copyright.
Citation: Aris, T.A.; Nasir, A.S.A.;
Mustafa, W.A.; Mashor, M.Y.;
Haryanto, E.V.; Mohamed, Z. Robust
Image Processing Framework for
Intelligent Multi-Stage Malaria
Parasite Recognition of Thick and
Thin Smear Images. Diagnostics 2023,
13, 511. https://doi.org/10.3390/
diagnostics13030511
Academic Editor: Ahsan
Khandoker
Received: 14 October 2022
Revised: 16 January 2023
Accepted: 26 January 2023
Published: 31 January 2023
Copyright: © 2023 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
4.0/).
diagnostics
Article
Robust Image Processing Framework for Intelligent
Multi-Stage Malaria Parasite Recognition of Thick and Thin
Smear Images
Thaqifah Ahmad Aris 1, Aimi Salihah Abdul Nasir 1, Wan Azani Mustafa 1,2,* , Mohd Yusoff Mashor 3,
Edy Victor Haryanto 4and Zeehaida Mohamed 5
1Faculty of Electrical Engineering and Technology, Universiti Malaysia Perlis, UniCITI Alam Campus,
Sungai Chuchuh, Padang Besar 02100, Malaysia
2Advanced Computing (AdvCOMP), Centre of Excellence, Universiti Malaysia Perlis, Pauh Putra Campus,
Arau 02600, Malaysia
3Faculty of Electronic Engineering and Technology, Universiti Malaysia Perlis, Pauh Putra Campus,
Arau 02600, Malaysia
4Faculty of Engineering and Computer Science, Universitas Potensi Utama, Medan 20241, Indonesia
5
Department of Medical Microbiology and Parasitology, School of Medical Sciences, Universiti Sains Malaysia,
Kubang Kerian 16150, Malaysia
*Correspondence: wanazani@unimap.edu.my
Abstract:
Malaria is a pressing medical issue in tropical and subtropical regions. Currently, the
manual microscopic examination remains the gold standard malaria diagnosis method. Nevertheless,
this procedure required highly skilled lab technicians to prepare and examine the slides. Therefore, a
framework encompassing image processing and machine learning is proposed due to inconsistencies
in manual inspection, counting, and staging. Here, a standardized segmentation framework utilizing
thresholding and clustering is developed to segment parasites’ stages of P. falciparum and P. vivax
species. Moreover, a multi-stage classifier is designed for recognizing parasite species and staging in
both species. Experimental results indicate the effectiveness of segmenting thick smear images based
on Phansalkar thresholding garnered an accuracy of 99.86%. The employment of variance and new
transferring process for the clustered members, enhanced k-means (EKM) clustering has successfully
segmented all malaria stages with accuracy and an F1-score of 99.20% and 0.9033, respectively. In
addition, the accuracies of parasite detection, species recognition, and staging obtained through a
random forest (RF) accounted for 86.89%, 98.82%, and 90.78%, respectively, simultaneously. The
proposed framework enables versatile malaria parasite detection and staging with an interactive
result, paving the path for future improvements by utilizing the proposed framework on all others
malaria species.
Keywords:
plasmodium parasite; image segmentation; thresholding; clustering; species and staging;
machine learning
1. Introduction
In this day and age, malaria remains a threat to public health worldwide, especially in
tropical regions, such as Africa. This can be due to the environmental conditions suitable
for the growth of mosquitoes and exacerbated by poor socio-economic conditions. Malaria
is caused by plasmodium parasites transmitted through the infected female Anopheles
mosquito bite. There are five species of malaria, namely P. falciparum,P. vivax,P. malariae,
P. ovale, and P. knowlesi. However, among these species, P. falciparum and P. vivax species
are the two most common species that typically cause malaria infection, while others are
considered to be occasionally seen. As stated in World Health Organization (WHO) report
in 2020, there were an estimated 241 million malaria cases and 627,000 malaria deaths
Diagnostics 2023,13, 511. https://doi.org/10.3390/diagnostics13030511 https://www.mdpi.com/journal/diagnostics
Diagnostics 2023,13, 511 2 of 36
worldwide [
1
]. Additionally, 77% of all malaria deaths occurred in children under five
years old and remained the highest recorded mortality rate in 2020 [1].
Despite advances in rapid point-of-care diagnostics, rapid diagnostic tests (RDTs),
and manual microscopic examination based on thick and thin blood smears remain the
most recommended diagnostic methods for routine malaria diagnosis of suspected malaria
cases. On top of that, microscopic examination is the “gold standard” method for malaria
diagnosis [
2
]. In this procedure, both thick and thin blood smears are used to identify the
parasites and counted to quantify the level of parasitemia [
3
]. Here, the analysis of a thick
blood smear is used for parasite detection. This is followed by the analysis of a thin blood
smear to identify malaria parasite species and its life-cycle stages. Thick blood smears
utilize a greater volume of blood and multiple layers of red blood cells (RBCs), which is
about 6
µ
L of blood, making it more sensitive for parasite detection [
3
]. Meanwhile, thin
blood smears utilize a single layer of uniformly spread RBCs, which is only about 2–3
µ
L
of blood, making it unsuitable for parasite detection [
3
]. However, due to alcohol fixation
prior to staining, the RBCs in the thin blood smear are preserved, which makes it more
sensitive to analyze parasite morphology and parasitic forms that may be present.
Currently, lab technicians will manually count and examine thick and thin blood smear
samples under a microscope. Although the manual microscopic examination is acknowl-
edged for its high sensitivity and being competently specific, the sensitivity and specificity
of microscopy differ based on the quality of the stained slide and the expertise of the lab
technicians. Nevertheless, this manual evaluation may vary amongst slide readers and be
inaccurate [
4
]. Furthermore, this procedure is labor-intensive. The lab technicians usually
examine 30 to 40 slides per day while being responsible for diagnosing other diseases [
5
].
In addition, highly skilled lab technicians are needed to prepare and examine the slides,
and distinguish between the parasite species and stages. This is because some parasite
morphologies are almost similar, such as those of P. vivax,P. ovale species, and the early
trophozoite stage of P. falciparum and P. knowlesi species [
6
]. Moreover, microscopic exami-
nation and RDTs have limitations, which are ineffective when detecting sub-microscopic
infections. Therefore, due to the shortcomings of the currently recommended diagnostic
methods, new diagnostic procedures are compulsory.
Thus, a computerized system for malaria diagnosis consisting of image processing
and machine learning approaches is necessary to examine more blood slides compared to
the manual diagnosis. Yet, several issues arise upon developing a computerized system,
such as the emergence of noise, shadow, random backgrounds, overlapping objects, and
illumination issues, and making segmenting medical images, such as blood images, and
difficult work in image analysis and processing [
7
]. In addition, the low image quality and
the usage of staining solution significantly impact the parasite’s appearance, contributing
to the false analysis in the computerized system. In addition, the small size of the parasites
is also one of the main factors that make the computerized system difficult to isolate the
parasites from stained blood images on thick blood smear samples. Thus, relevant image-
processing procedures and machine learning algorithms are necessary to provide good
results in analyzing thick and thin blood smear images. Recently, many studies have been
conducted to diagnose malaria based on thick and thin blood smear images with various
transfer learning techniques. Hence, novel network architectures and image-processing
procedures were proposed to improve the performance of malaria diagnosis [810].
Azif et al. [
11
] reported accuracy and sensitivity of 61.53% and 98.04%, respectively, for
parasite detection based on thick blood smear images. Correspondingly, Gitonga et al. [
12
]
reported 96.2% accuracy in identifying plasmodium species and 99.9% classification of life-
cycle stages on infected RBC images using an artificial neural network (ANN) classifier. On
the other hand, Fatima and Farid [
13
] reported an accuracy of 91.80% for parasite detection
based on thin blood smear images using a segmentation approach of bilateral filter and
adaptive thresholding technique. Additionally, Parveen et al. [
14
] proposed a multilayer
perceptron (MLP) neural network to classify malarial parasites into three different species.
The study reported accuracy of 85% was achieved when performing the classification using
Diagnostics 2023,13, 511 3 of 36
the proposed classifier. Alternatively, Nugroho et al. [
15
] have identified three malaria
stages of P. falciparum species by utilizing MLP training by the backpropagation algorithm.
The results indicate that the proposed method achieves an accuracy of 87.8%, a sensitivity
of 81.7%, and a specificity of 90.8% for detecting infected RBCs.
Certainly, the segmentation process is important since further steps, such as feature
extraction and classification, depend on the accurate segmentation of the region of interest
(ROI). Therefore, a good image segmentation procedure that can be used for obtaining
segmented malaria parasites in thick and thin blood smear images is required to be analyzed
thoroughly. Furthermore, segmentation quality is important for the success of the image
analysis process. Many researchers have proposed various image segmentation approaches
to improve malaria diagnosis. This includes the application of k-means (KM) clustering [
16
],
fast k-means (FKM) clustering [
17
], cascaded EKM [
18
], fuzzy c-means (FCM) clustering [
18
],
watershed distance transform [
19
], Otsu’s thresholding [
20
], and adaptive thresholding [
21
].
In addition to segmentation, a good image classification technique is also important in
producing a reliable malaria diagnosis system. It can be used to classify malaria parasites
in thick blood smear images and types of species in thin blood smear images. In addition, it
can also classify stages in thin blood smear images required to diagnose malaria efficiently.
The significance and inspiration for conducting this study have been described in this
paragraph. The recent advances in machine learning have provided an opportunity to
utterly explore automating the detection of parasites from thick and thin smears images.
Nonetheless, the literature lacks proposed works to detect and segment malaria parasites
in thick and thin smears based on one complete standard framework. Most of the work
presented focused on either detecting thick parasite smears for counting or classifying
parasites extracted from thin smear images based on frameworks presented in different
studies. Furthermore, to the best of the authors’ knowledge, most of the existing references
have paid significant attention to processing only the thin blood smear samples compared
to the thick blood smear [
22
24
]. As for the thick smear, most of the literature has focused
on counting the parasites to estimate the level of parasitemia after the segmentation is
complete, lacking a framework for enhancing the parasite count with machine learning.
Despite significant improvements in model performance in thin smear analysis, there is
currently no widely used tool for the automated detection and staging of malaria parasites.
The main objective of this research work is to develop an intelligent multi-stage
malaria parasite recognition for microscopy diagnosis of both thick and thin smear images.
Additionally, the sub-objective of this research is to find the best image segmentation
technique to segment thick and thin smear images, find the best classifier to classify malaria
parasites in both thick and thin images, and classify species and stages in thin smear images.
Note that this research will focus on processing the most common species responsible for
malaria infection in most cases. They are P. falciparum and P. vivax species of thick and
thin smear samples of the ring, trophozoite, schizont, and gametocyte stages. In the
experimental analysis, eleven relevant image segmentation techniques and three machine
learnings have been compared to select the most suitable approach for malaria parasite
recognition with the purpose to improve the standardization on thick and thin smear
inspection for malaria research and potential field diagnosis. The contributions of this
paper can be explicitly stated below:
A full framework constituted from image processing and machine learning is designed
to address the problem of detecting parasites in thick smear microscopy, classifying
parasite species, and staging for thin smear microscopy.
A standard segmentation framework based on thresholding and clustering algorithms
is developed to segment malaria parasites in thick and thin smears of all stages for
both P. falciparum and P. vivax species.
The proposed segmentation framework is applicable in producing good segmentation
results, though the captured malaria image varies in terms of lighting conditions, and
each stage of the malaria parasite consists of different shapes and sizes.
Diagnostics 2023,13, 511 4 of 36
The use of several machine-learning algorithms for classifying parasite species and
life-cycle stages resulted in benchmark results that can be compared with deep-
learning algorithms.
2. Related Literature of Malaria Diagnosis
Various applications of image segmentation procedures and artificial intelligent classi-
fiers have been proposed by previous researchers to improve the network performances to
perform malaria diagnosis, including the classification of malaria parasites, their species,
and life-cycle stages. For instance, Nanoti et al. [
25
] have proposed malaria detection
of species and life-cycle stages based on thin blood smear images. In this study, several
pre-processes were applied to the input image: geometric mean filter, median filter, and
forward discrete curvelet transform, to improve the image quality. Subsequently, partial
contrast stretching was utilized to enhance the contrast of the parasite region. Here, the
enhanced image was converted into RGB (red, green, blue), HIS (hue, intensity, saturation),
and L*A*B color spaces and applied with KM clustering. Based on this study, the A*B com-
ponent produces a good, segmented image. Thus, it is selected to be extracted by various
texture and shape-based features, such as Gray Level Co-occurrence Matrix (GLCM), Gray
Level Run Length Matrix (GLRLM), perimeter, area, entropy, local binary pattern, Hu’s
seven invariant moments, and histogram features. A total of 300 images were extracted
and d on the k-nearest neighbor and support vector machine (SVM). The finding shows
that classification using k-nearest neighbor produces higher accuracy and sensitivity with
values of 90.17% and 90.23%, respectively, compared to SVM.
Hendrawan et al. [
26
] have proposed image-processing procedures to identify malaria
infection in four types of plasmodium species and three different life-cycle stages. First,
the input image was enhanced using a color normalization process followed by gamma
correction to improve the quality of the image. Consequently, edge enhancement was
applied to sharpen the parasite’s edge. The image was then converted to grayscale and
green color components before segmenting using fuzzy c-means (FCM) clustering. After
that, erosion and closing morphological operation was conducted to remove unwanted
pixels, and close the holes produced inside the parasite’s region. This proposed analysis was
tested on 574 malaria image datasets. The experimental results present that the proposed
image-processing procedures successfully obtain a good malaria-segmented image. It was
done by applying a green color component with a segmentation performance sensitivity of
97.91%, specificity of 98.61%, and accuracy of 98.26%.
On the other hand, Dave [
27
] has analyzed several image-processing steps for malaria
parasite detection based on thick blood smear images. At first, the malaria image was
converted into hue, saturation, and value (HSV) color space, and the saturation component
was extracted. Correspondingly, the average filter was applied to the saturation component
to reduce the noise produced in the malaria image. After that, the image was segmented
using adaptive thresholding, and several morphological operations, such as filling holes and
closing operations, were applied to eliminate unwanted pixels. Using the final segmented
images, various features based on shape, texture, color, and frequency domain are extracted
to be fed as input to the SVM classifier. The finding shows that classification using SVM
has produced a good discrepancy in parasite count, sensitivity, and specificity with values
of 7.18%, 86.34%, and 96.60%, respectively.
Aggarwal et al. [
28
] have proposed image-processing procedures for detecting malaria
parasites using thin blood smear samples. First, the input image was enhanced using
the histogram equalization technique to improve the image’s contrast. Subsequently, a
median filter was applied to discard noises produced on the enhanced image. The image is
then converted to binary using Otsu thresholding. This proposed analysis was tested on
60 malaria image datasets. The average evaluation results demonstrate that the proposed
image-processing procedures successfully obtained a good malaria-segmented image with
an accuracy of 93%.
Diagnostics 2023,13, 511 5 of 36
Setianingrum et al. [
29
] have proposed image-processing procedures to identify
malaria parasites in trophozoite and schizont stages based on thin blood smear images of P.
falciparum species. At first, the malaria image was converted into grayscale and enhanced
by contrast stretching. Next, the enhanced image was segmented using Otsu thresholding.
After that, the GLCM and morphological features were extracted from the malaria image to
be fed as input to the SVM classifier. The result shows that SVM classification has achieved
91.67% accuracy in detecting the two stages of P. falciparum species.
Mustafa et al. [
30
] have analyzed several thresholding methods, such as Wolf,
Bradley, Bernsen, Triangle, Deghost, and FCM algorithms, on the malaria image dataset
to simplify the malaria image into something easier to examine. At first, the malaria
image was converted to a grayscale image before applying the selected thresholding
methods. This proposed analysis has been tested on 30 malaria image datasets. Note
that the average evaluation results present that the FCM algorithm method successfully
obtains a good malaria-segmented image by having a better segmentation performance
compared to other methods with a sensitivity of 95.51% and specificity of 75.1%, and
overall performance of 85.31%.
Alternatively, Nugroho et al. [
31
] proposed a segmentation procedure to segment
infected red blood cells (RBCs) in thin blood smear images using threshold and morpholog-
ical operation. In this study, the malaria image was enhanced and converted to grayscale
before being applied to a multilevel Otsu threshold. Overall, the proposed procedure
was tested on a total of 30 malaria. The results showed that the proposed segmentation
procedure achieved 96.74% of accuracy, 76.77% of sensitivity, 99.74% of specificity, 97.84%
of prediction value positive, and 96.61% of prediction value negative. They indicate that
this proposed procedure provides a consistent result for segmenting parasites in thin blood
smear images.
Sifat and Islam [
32
] have proposed identifying the malaria parasite species and life-
cycle stages based on thin blood smear images. The combination of median and geometric
mean filters was utilized in the proposed methodology to improve image quality. After
that, the image was enhanced by applying a partial contrast technique before extracting
the RBC from blood smear images using the U-Net segmentation technique. Subsequently,
a convolutional neural network (CNN) was used to identify infected RBCs. In contrast,
the visual geometry group 16 (VGG16) network was utilized to distinguish between the
various species and the malaria life cycle. This proposed method has been practiced on
5512 malaria images. Using the CNN model, the detection accuracy and specificity of
infected RBC were 100% and 95%, respectively. Meanwhile, an average accuracy of 95.55%,
and specificity of 94.75% was achieved for species identification, while an overall accuracy
of 96.25%, and specificity of 94.82% were achieved in the ring life cycle.
Nugroho et al. [
33
] proposed the classification of malaria species and the life cycle
stages of P. falciparum,P. vivax,P. malariae, and P. ovale species. In this research, the input
image was converted to a grayscale image. After that, the Otsu thresholding was applied
to the grayscale image to produce a binary image. A morphological operation called
the opening operation was applied to the binary image to remove unwanted regions.
Consequently, a morpho-geometrical feature has been extracted and fed as input to Naïve
Bayes (NB) classifier. The application of the proposed algorithm achieved a sensitivity
value of 84.37%, a positive predictive value of 80.60%, and an F1 score of 80.60%, which
shows that the proposed algorithm is reliable.
Swastika et al. [
34
] have employed regularization on several CNN architectures. It
was to investigate the impact of regularization on increasing malaria parasite detection
accuracy using thin blood smear images. This study has applied regularization methods,
including data augmentation, dropout layer, and L2 regularization, to ResNet-50, MicroVG-
GNet, and BaselineNet-1 models. The analysis was conducted on a total of 27,588 images,
consisting of 13,779 uninfected blood cells and 13,779 parasite-infected images. It was then
compared between the CNN model with regularization and without regularization. The
findings indicate that 94.92% accuracy was attained when BaselineNet1 was used without
Diagnostics 2023,13, 511 6 of 36
regularization. Meanwhile, the accuracy of 97.12%, 95.64%, and 96.28% was attained by
applying regularization on ResNet-50, MicroVGGNet, and BaselineNet-1, respectively.
On the other hand, Taha and Liza [
35
] have utilized two types of deep learning models
to identify malaria from blood smear images based on CNN. In this study, 27,558 thin
blood smear images have been resized into the same size array with a width and height
of 299
×
299 pixels. Two deep learning models were used and compared: basic CNN and
InceptionResNetV2 model. Based on the findings, the InceptionResNetV2 provides much
higher accuracy with a value of 97.10%. This includes high precision, recall, F1-score, and
Matthew’s correlation coefficient, with 96.93%, 97.28%, 97.10%, and 94.20%, respectively.
Compared to the basic CNN with five convolutional layers, it obtained an accuracy of
96.46% with precision, recall, F1-score, and Matthew’s correlation coefficient being 98.30%,
94.56%, 96.39%, and 92.99%.
Widiawati et al. [
36
] have proposed a malaria detection method based on thick blood
smear images. This study applied several pre-processing techniques, such as color model
conversion, contrast stretching, and median filter, on 38 thick blood smear images. Corre-
spondingly, the images were segmented using Bottom-Hat and Adaptive Entropy Thresh-
olding. After that, several texture features were extracted from the green component image:
mean, standard deviation, skewness, energy, and entropy. Here, three different classifiers
have been compared: NB, SVM, and MLP. The best results in the classification of species
were obtained from the NB classification with an accuracy of 82.05%.
Maqsood et al. [
37
] have analyzed the efficiency of several current deep-learning
models on malaria detection using blood smear images. Additionally, the author also
proposed an efficient deep-learning method for classifying infected and uninfected malaria
cells utilizing CNN without using a hand-crafted feature. This study employed image
augmentation and bilateral filtering techniques to emphasize the features of RBCs before
training the datasets. Here, the modified CNN model is generalized, and over-fitting can
be prevented because of image augmentation approaches. The analysis has been conducted
on a total of 27,588 malaria datasets obtained from the National Institute of Health (NIH)
website. The results indicate that the proposed algorithm can perform malaria detection
with an accuracy of 96.82%.
Other than that, Setiawan et al. [
38
] have proposed a computer-aided diagnosis to
detect and count the malaria parasites in thick blood smears. This study utilizes a color-
based segmentation to detect the parasite using three different color spaces: RGB, YCbCr,
and LAB. Initially, the malaria image was converted to several color components. Next,
Otsu thresholding was used to segment the converted malaria image. This proposed study
has been applied to 27,558 images of 13,779 infected malaria images and 13,779 uninfected
malaria images. The best results are achieved by color-based segmentation in RGB color
space with an accuracy of 94.75%, sensitivity of 96.02%, and specificity of 93.47%.
Rameen et al. [
39
] proposed supervised machine learning techniques for malaria identi-
fication based on samples of blood smears. This study involves several pre-processing, such
as resizing the input images and converting the images into grayscale. Subsequently, Otsu
thresholding was implemented to produce the binary image. Afterwards, the GoogLeNet
was applied to extract features from the segmented blood smear image to be fed as input
to the SVM classifier. Based on the experimental result, the application of the proposed
algorithm achieved an accuracy value of 95.80%, which proves that the proposed algorithm
is reliable.
Aris et al. [
40
] have analyzed the utilization of several clustering techniques to acquire
fully segmented malaria parasites based on thick blood smear images of P. falciparum and
P. vivax species. At first, the author applied modified global contrast stretching, followed
by gray world techniques, to improve the image quality. Next, the enhanced image was
converted into a blue component of RGB color spaces to ease the segmentation process.
Here, KM, FKM, and EKM clustering techniques were applied to 100 thick blood smear
images. Eventually, the segmentation performances were measured and demonstrated that
Diagnostics 2023,13, 511 7 of 36
segmentation using the FKM clustering algorithm produced a good segmentation result,
with an accuracy of 99.91%, sensitivity of 75.75%, and specificity of 99.93%.
Apart from that, Shal and Gupta [
41
] examined the application of different classifier
models to classify uninfected and infected cells for malaria detection. In this study, classifier
models, namely CNN, YOLOv4, YOLOv5, and single shot detector, were applied on
27,560 images consisting of 13,780 infected images and 13,780 uninfected cell images.
Furthermore, these four classifier models were tested on several performance evaluation
techniques, such as accuracy, true positivity rate, and precision. Based on this analysis, the
best results are achieved in YOLOv5 with an accuracy of 94.67%, a true positivity rate of
50.30%, and a precision of 95.25%.
Razin et al. [
42
] developed a model for malaria parasite detection by implementing
the CNN and YOLOv5 model to detect the malaria parasites and classify the uninfected
and infected malaria images. This research was applied to a total of 27,588 thin blood
smear images obtained from the NIH website, a publicly accessible dataset. In accordance
with the experimental outcomes, the YOLOv5 model obtained an accuracy of 95.37% for
infected and 97.05% for uninfected samples. In addition, the CNN model can obtain the
highest accuracy with a value of 96.21% for both infected and uninfected blood samples.
Additionally, this CNN model also achieved high precision and recall rates of 95.42% and
97.05%, respectively.
On the other hand, Zarima et al. [
43
] utilize hand-crafted features and deep learning
techniques on blood smear images to diagnose malaria. In this study, a texture feature
known as Local Binary Pattern (LBP) has been extracted from a total of 27,588 images,
consisting of 13,779 uninfected blood cells images and 13,779 parasite’s infected images.
This study analyzed several deep-learning models, namely ResNet-34, VGG16, Inception
V3, and EfficientNet, to determine which model yield the best accuracy. Based on the
experimental result, the EfficientNet model has obtained the highest accuracy, with a value
of 91%, compared to the VGG16, ResNet-34, and InceptionV3 models, with 87%, 81%, and
77%, respectively.
Gummadi, Ghoosh, and Vootla [
44
] employed a transfer-learning-based CNN architec-
ture to differentiate between uninfected and infected malaria images using global average
pooling (GAP) and heat maps. In this study, a total of 27,588 thin blood smear images
have been used on three transfer learning-based neural network architectures, known as
VGG16, VGG19, and InceptionResNetV2 architecture. These three classifiers were trained
and compared. Based on the study finding, InceptionResNetV2 architecture yielded a
maximum accuracy of 96.88%, with sensitivity, specificity, F1-score, and precision of 97.35%,
96.41%, 96.89%, and 96.44%, respectively. In addition, this study also interfaced with a
newly developed web service that anyone can easily access.
Abdurahman, Fante, and Aliy [
45
] have proposed an automated malaria parasite
detection based on thick blood smear images using modified YOLOv3 and YOLOv4
models. In this study, these modified models were called YOLOv3-MOD1, YOLOv3-
MOD2, and YOLOv4-MOD. The effectiveness of the modified YOLOv3 and YOLOv4
models was assessed using a total of 1182 blood smear images consisting of 948 malaria-
infected images and 234 normal images. Based on the study verdict, YOLOv3-MOD1,
YOLOv3-MOD2, and YOLOv4-MOD models have superior mean average precision (mAP),
recall, precision, F1-score, and average IOU compared to their original versions, faster
region-based convolutional neural network (FR-CNN), and single shot multi-box detector
(SSD). However, among all the models, YOLOv4-MOD has the highest detection accuracy,
with a mAP of 96.32%, recall of 94%, precision of 95%, F1-score of 94%, and average IOU of
62.12% as compared to YOLOv3-MOD1 and YOLOv3-MOD2 models.
Based on the findings from previous researchers’ work, various segmentation tech-
niques have been proposed and successfully applied for the segmentation of malaria
parasites or infected RBCs. In addition, it can be found that segmentation using thresh-
olding and clustering offers several advantages in segmenting malaria images. Generally,
the thresholding algorithm is suitable to be applied when the objects are not overlapped,
Diagnostics 2023,13, 511 8 of 36
and the gray levels are distinct from the background level. However, this algorithm is less
suitable to be applied on images blurred at the object boundaries or for the overlapped
ROI area as it will result in over-segmentation. Meanwhile, the clustering algorithm will
segment the image into clusters, having pixels with similar characteristics, such as group-
ing the pixels based on intensity similarity. As a result, both thresholding and pixel-level
segmentation based on clustering will also be added to the framework for segmenting the
malaria parasites. This is an initiative to find suitable segmentation techniques, reducing
the confounding of neighboring cells on parasite detection due to over-segmentation.
Subsequently, based on the review of intelligent classifiers, it is found that several
research has utilized SVM to perform the malaria detection process and classify malaria
parasite species and their life-cycle stages. Although SVM is a popular method utilized
to classify malaria parasites due to its good generalization performance, SVM has some
limitations. The biggest limitation of SVM lies in the selection of the kernel [
46
]. Therefore,
SVM will not be utilized for classification in this study due to these limitations. Addition-
ally, the performance of the extreme learning machine (ELM) network will be compared
with other types of intelligent classifiers, online sequential-ELM (OS-ELM), and RFs, to
determine the capability of these methods for the classification process. The OS-ELM
has been chosen to be compared because it is an upgraded version of ELM. It has been
proven to be faster than the other well-known sequential algorithms and produces better
generalization performance with lower training time [
47
]. As for RF, it is an ensemble
classifier that can be used for complex classification tasks.
3. Materials and Methods
In this study, the processes to obtain thick and thin blood smear segmented images
and classify malaria parasites, types of species, and their life-cycle stages consist of three
main phases. The first phase deals with data collection and image acquisition setup, while
the second deals with segmentation. Lastly, the third phase deals with classification. This
section describes various applications of image segmentation and machine learning techniques
applied to thick and thin blood smear images. The full proposed procedures for thick and
thin blood smear images included image enhancement, extraction of color components, image
segmentation, watershed segmentation (only for thick blood smear images), removal of
unwanted pixels, feature extraction, and classification will be explained in next sections.
3.1. Data Collection and Image Acquisition Setup
In this research, the malaria slides of P. falciparum and P. vivax species of both thick and
thin blood smear samples were obtained from the Department of Medical Microbiology and
Parasitology, Hospital Universiti Sains Malaysia (HUSM). The blood slides were examined
under 100
×
oil immersion using an OLYMPUS BX41 digital microscope to capture the thick
blood smear images, as shown in Figure 1. The images of the slides were captured using an
OLYMPUS XC50 digital camera with a resolution of 1294
×
980 pixels and saved in a jpg
format. The blood slides were examined under the 100
×
oil immersion objective of the Leica
DLMA 1200 digital microscope for the thin blood smear images. The images of the slides
were captured using a Luminera Infinity-2 digital camera with a resolution of 800
×
600 pixels
and saved in bitmap (*.bmp) format.
Diagnostics 2023, 13, x FOR PEER REVIEW 9 of 38
Figure 1. OLYMPUS BX41 digital microscope.
This study captured 500 thick blood smear images from four malaria slides contain-
ing both P. falciparum and P. vivax species. Meanwhile, 4035 thin blood smear images with
various conditions were captured from 34 malaria slides consisting of both P. falciparum
and P. vivax species in the ring, mature trophozoite, schizont, and gametocyte stages. Ad-
ditionally, both thick and thin blood smear images were captured under several lighting
conditions, such as underexposed, overexposed, and normal, as seen in Figure 2. Figure 2
represents the samples of the captured blood images from P. falciparum and P. vivax in
thick and thin blood smear samples. As seen in Figure 2, the stain highlights the malaria
parasites and other components inside the blood sample, such as white blood cells
(WBCs), platelets, and artifacts. As a result, the WBCs and platelets also share a similar
color property with the malaria parasites, as seen in images (b) and (f). Figure 2d,e,i exhibit
the samples of the captured malaria images with the presence of Herring-bone scratches
in the glass slide, air bubbles under the coverslip of the slide, and artifacts, respectively.
(a) (b) (c) (d) (e)
(f) (g) (h) (i) (j)
Figure 2. The samples of the captured blood images consisting of (a) P. falciparum of thick smear;
(be) P. falciparum of thin smear in the ring, trophozoite, schizont, gametocyte; (f) P. vivax of thick
smear; (gj) P. vivax of thin smear in the ring, mature trophozoite, schizont, gametocyte.
3.2. Image Segmentation Using Various Local Thresholding and Clustering Techniques on Thick
and Thin Blood Smear Images
Image segmentation separates an image into homogenous segments based on spe-
cific criteria [48]. The basic aim of this separation was to ensure the images were easily
analyzed and interpreted by preserving the image quality. Note that segmentation quality
is one of the important elements for the success of the image analysis process. Segmenta-
tion, such as thresholding and clustering, offers several benefits compared to other meth-
ods. The significant benefit of the thresholding technique was that it could be employed
on non-overlapping objects because their grey levels were easily distinguished from the
background level. Meanwhile, the significant benefit of the clustering technique was ideal
for segmenting the medical image as the number of clusters could be determined from the
image with specific anatomical regions. Therefore, this research used several local thresh-
Figure 1. OLYMPUS BX41 digital microscope.
Diagnostics 2023,13, 511 9 of 36
This study captured 500 thick blood smear images from four malaria slides containing
both P. falciparum and P. vivax species. Meanwhile, 4035 thin blood smear images with
various conditions were captured from 34 malaria slides consisting of both P. falciparum
and P. vivax species in the ring, mature trophozoite, schizont, and gametocyte stages.
Additionally, both thick and thin blood smear images were captured under several lighting
conditions, such as underexposed, overexposed, and normal, as seen in Figure 2. Figure 2
represents the samples of the captured blood images from P. falciparum and P. vivax in thick
and thin blood smear samples. As seen in Figure 2, the stain highlights the malaria parasites
and other components inside the blood sample, such as white blood cells (WBCs), platelets,
and artifacts. As a result, the WBCs and platelets also share a similar color property with the
malaria parasites, as seen in Figure 2b,f. Figure 2d,e,i exhibit the samples of the captured
malaria images with the presence of Herring-bone scratches in the glass slide, air bubbles
under the coverslip of the slide, and artifacts, respectively.
Diagnostics 2023, 13, x FOR PEER REVIEW 9 of 38
Figure 1. OLYMPUS BX41 digital microscope.
This study captured 500 thick blood smear images from four malaria slides contain-
ing both P. falciparum and P. vivax species. Meanwhile, 4035 thin blood smear images with
various conditions were captured from 34 malaria slides consisting of both P. falciparum
and P. vivax species in the ring, mature trophozoite, schizont, and gametocyte stages. Ad-
ditionally, both thick and thin blood smear images were captured under several lighting
conditions, such as underexposed, overexposed, and normal, as seen in Figure 2. Figure 2
represents the samples of the captured blood images from P. falciparum and P. vivax in
thick and thin blood smear samples. As seen in Figure 2, the stain highlights the malaria
parasites and other components inside the blood sample, such as white blood cells
(WBCs), platelets, and artifacts. As a result, the WBCs and platelets also share a similar
color property with the malaria parasites, as seen in images (b) and (f). Figure 2d,e,i exhibit
the samples of the captured malaria images with the presence of Herring-bone scratches
in the glass slide, air bubbles under the coverslip of the slide, and artifacts, respectively.
(a) (b) (c) (d) (e)
(f) (g) (h) (i) (j)
Figure 2. The samples of the captured blood images consisting of (a) P. falciparum of thick smear;
(be) P. falciparum of thin smear in the ring, trophozoite, schizont, gametocyte; (f) P. vivax of thick
smear; (gj) P. vivax of thin smear in the ring, mature trophozoite, schizont, gametocyte.
3.2. Image Segmentation Using Various Local Thresholding and Clustering Techniques on Thick
and Thin Blood Smear Images
Image segmentation separates an image into homogenous segments based on spe-
cific criteria [48]. The basic aim of this separation was to ensure the images were easily
analyzed and interpreted by preserving the image quality. Note that segmentation quality
is one of the important elements for the success of the image analysis process. Segmenta-
tion, such as thresholding and clustering, offers several benefits compared to other meth-
ods. The significant benefit of the thresholding technique was that it could be employed
on non-overlapping objects because their grey levels were easily distinguished from the
background level. Meanwhile, the significant benefit of the clustering technique was ideal
for segmenting the medical image as the number of clusters could be determined from the
image with specific anatomical regions. Therefore, this research used several local thresh-
Figure 2.
The samples of the captured blood images consisting of (
a
)P. falciparum of thick smear;
(
b
e
)P. falciparum of thin smear in the ring, trophozoite, schizont, gametocyte; (
f
)P. vivax of thick
smear; (gj)P. vivax of thin smear in the ring, mature trophozoite, schizont, gametocyte.
3.2. Image Segmentation Using Various Local Thresholding and Clustering Techniques on Thick
and Thin Blood Smear Images
Image segmentation separates an image into homogenous segments based on specific
criteria [
48
]. The basic aim of this separation was to ensure the images were easily analyzed
and interpreted by preserving the image quality. Note that segmentation quality is one
of the important elements for the success of the image analysis process. Segmentation,
such as thresholding and clustering, offers several benefits compared to other methods.
The significant benefit of the thresholding technique was that it could be employed on
non-overlapping objects because their grey levels were easily distinguished from the back-
ground level. Meanwhile, the significant benefit of the clustering technique was ideal
for segmenting the medical image as the number of clusters could be determined from
the image with specific anatomical regions. Therefore, this research used several local
thresholding and clustering techniques to segment malaria images. This local thresholding
technique was superior in segmenting uneven lighting images because this threshold-
ing technique determines the threshold values according to the local area information.
Explanations about the types of thresholding and clustering algorithms were presented
as follows:
Adaptive thresholding
: Local adaptive thresholding was a basic and simple algorithm
to separate the foreground from the background with non-uniform illumination. For
each pixel in the image, a threshold had to be calculated. If the pixel value is below the
threshold, it is set to the background value. Otherwise, it assumes the foreground value. The
assumption behind this method is that the smaller image regions were more likely to have
approximately uniform illumination, being more suitable for thresholding. Here, the input
image was partitioned into several intersecting sub-images. After that, the histogram of
Diagnostics 2023,13, 511 10 of 36
each sub-image was analyzed to discover the ideal threshold for that particular sub-image.
By approximating the outputs of the sub-images, the threshold for each pixel was obtained.
The neighborhood size had to be large enough to cover good foreground and background
pixels; otherwise, a poor threshold was chosen. On the other hand, choosing regions that
are too large could violate the assumption of approximately uniform illumination. In the
adaptive thresholding method, the local threshold was determined as the mean of the local
intensity distribution of each pixel [49]. The threshold for each pixel was computed by:
T=max +min
2. (1)
Niblack
: Niblack algorithm was one of the most popular local adaptive algorithms.
It determines the threshold value of every pixel by sliding a particular window size
throughout the image pixel location to set the threshold value. It is performed according to
the local standard deviation and local mean [
50
]. The threshold was obtained based on the
following equation:
T=m+kσ. (2)
Here, kis referred to as a constant value of
0.2 fixed by the Niblack algorithm author,
while the window size could be determined manually. Therefore, both sliding window
size and the k-factor parameter might affect binarization quality. However, this method
had a drawback: It makes excessive noises in the emptied windows [
51
]. Additionally, the
Niblack algorithm did not suffice to adapt non-uniform illumination images.
Sauvola
: Sauvola algorithm was inspired by the Niblack algorithm, which works by
computing the threshold value using a dynamic range of standard deviation in the gray
value image [
52
]. By implementing various gray-level values in the images, this method
addresses the Niblack weakness of excessive noises based on the standard deviation effect.
In Sauvola, the interpolation was utilized for the remaining pixels after the initial threshold
had been determined. This approach gives a more precise threshold value for each pixel.
Thus, using the following equation, the threshold value was determined:
T=m1k1σ
R. (3)
Here, kis referred to as a factor that took on positive values and was often set to
0.5, while Rwas the dynamics of the standard deviation with a fixed value of 128. In
addition, the noise issue was resolved by this algorithm, but it was unable to perform well
on the low-contrast image. In the Sauvola method, the threshold was computed similarly
to the Niblack method. Still, it used the average of an image’s highest and lowest gray
value to modify the local standard deviation to emphasize segmentation on low-contrast
information images.
Bradley
: Bradley algorithm utilized adaptive thresholding to adjust the spatial differ-
ences in illumination by employing the integral image of the input data [
53
]. This method
is effective and easy to use and is more vulnerable to variations in image illumination. To
produce the integral image,
I(x,y)
should be maintained at each position as the sum of
those
f(x,y)
terms toward the left being above the pixel
(x,y)
, which can be computed by
the following equation for each pixel:
x2
x=x1
y2
y=y1
f(x,y)=I(x2,y2)I(x2,y11)I(x11, y2)+I(x11, y11). (4)
This algorithm computes the average window size of pixels centered on each pixel.
The average calculation was completed in constant time by utilizing an integral image.
This approach computed the integral image using the input image in the first phase. In a
subsequent phase, a comparison analysis was used to determine whether a pixel belonged
to the foreground or background region. This was done by determining the window size
average for each pixel using the integral image in constant time. Moreover, inconsistent
Diagnostics 2023,13, 511 11 of 36
luminance images could be effectively binarized using this algorithm. Finally, the Bradley
method determines the local threshold based on the mean of the neighboring pixels using
the integral image as the input.
Phansalkar
: Phansalkar algorithm is the modification of Sauvola’s method, which
was invented to overcome Sauvola’s weakness, specifically to deal with low-contrast
images [
54
]. In Sauvola’s method, if the contrast in the local neighborhood were low, then
the local mean values would be lower, and the threshold value would fall below the mean,
causing the dark regions of the background to be removed. The Phansalkar algorithm
functions similarly to Sauvola’s method, which is based on the local mean and employs the
combination of the local standard deviation and parameters to constrain the calculation
in certain conditions. In addition, this method worked similarly to Sauvola’s method for
high values of the local mean. However, in Phansalkar, if the value of the local means is
low, then the threshold value determined should be much higher than the value of the
local mean and the value determined by Sauvola’s method. This method could be done by
computing the threshold value using the following equation:
T(x,y)=m(x,y)1+peq.m(x,y)+ks(x,y)
R1, (5)
where
k
,
p
, and
q
were constants. Here, the value of
q
was chosen above the value of the
local mean. On the other hand, the exponential term in the equation becomes negligible as
the equation functions similarly to Sauvola’s method. Subsequently, the constant
p
decides
the magnitude to which the exponential term affects the threshold. If the
p
values were low,
such as (0
<=p<=
1), then it would function similarly to Sauvola’s method. However,
if the
p
value is high, such as (
p>
5), the threshold would become high, and too many
background pixels would be classified as foreground.
Feng
: Feng’s method was built on the idea of two local windows [
55
]. This algorithm
functioned locally and calculated the dynamic range of gray-level value standard deviation.
Here, the values of the minimum gray level, standard deviation, and local mean were
calculated in the primary local window to determine the local threshold value. Meanwhile,
to compensate for the negative effect of uneven illumination, the dynamic range standard
deviation was calculated in the secondary window as it could tolerate different degrees
of uneven illumination. Therefore, the respective equation was used to calculate the
threshold value:
T=(1a1)m+a2s
Rs(mM)+a3M, (6)
where
a1
is between 0.1–0.2, while
a2=k1s
Rsγ
and
a3=k2s
Rsγ
. These
a1
,
a2
, and
a3
were defined as adaptive weighting. The values of other parameters, such as
k1
and
k2
were
proposed in the ranges between 0.15–0.25 and 0.01–0.05, respectively. In the Feng algorithm,
the local threshold value was bilinearly interpolated to obtain the threshold values for
all pixels to minimize the computational load. In Feng’s method, the local threshold was
determined by considering two local windows, such as primary and secondary. It calculates
the standard deviation of the gray value from the whole image using both windows.
Nick
: Nick’s algorithm was a modified version of the Niblack and Sauvola algo-
rithm [
51
]. This algorithm was created to solve the limitation in the Niblack algorithm
regarding excessive noises in the emptied windows and the problem in the Sauvola al-
gorithm, which is unable to perform well on the low-contrast image. The thresholding
formula of this algorithm was derived based on the basic Niblack algorithm. Apart from
that, this algorithm had been improvised from the Niblack and Sauvola algorithm by shift-
ing the thresholding value downward [
51
]. The following equation was used to determine
the thresholding value:
T=m+krpi2m2
N. (7)
Diagnostics 2023,13, 511 12 of 36
Here, krefers to a constant parameter with a value between
0.1 to
0.2,
pi
refers
to the value of the grayscale pixel levels in the image, and Nis the total number of pixels
in the image. The value of the kfactor could vary from
0.1 to
0.2 depending on the
application requirement. If the kvalue is close to
0.2, the noises were almost eliminated
but could rupture the ROI. Meanwhile, with values close to
0.1, some unwanted pixels
might remain still, but the ROI could be extracted unscathed. Additionally, the need to
determine the kfactor manually is also considered as this algorithm’s drawback. In the
Nick method, the local threshold was determined based on the mean and square root of
the difference between the sums of the squares of pixel intensities inside a local window.
K-means
: KM clustering algorithm was one of the most popular unsupervised and
iterative learning algorithms because of its simplicity. Throughout the process of employing
KM clustering for image segmentation, each pixel in the image was allocated to the closest
cluster center. Note that the distance between the pixel and the center was computed based
on the Euclidean distance [
56
]. Due to its simplicity, KM clustering has become popular for
many clustering tasks. However, the KM clustering algorithm had some limitations that
normally arise, such as the possibility of converging to local minima, dead centers problem,
and being sensitive to initialization of cluster centers.
Fuzzy c-means
: FCM clustering was conducted by designating a membership function
towards each data point according to every cluster center on the justification of distance
measures between the data point and cluster center. Instead of allocating each data to a
single center as in the KM algorithm, each data in the FCM clustering algorithm were given
a membership grade representing the amount it belongs to each center [
57
]. However, due
to the fuzzification in the FCM algorithm, certain data points could fall to more than one
cluster. Yet, FCM performs effectively for overlapping data sets in comparison to KM.
Fast k-means
: FKM clustering was developed to get through the drawbacks of the KM
clustering algorithm. According to the traditional KM clustering algorithm, the distance value
in each loop should be determined between each pixel and each clustering core [
58
]. Hence,
a lot of time was consumed by this event. Therefore, the FKM clustering algorithm used the
histogram value and the KM algorithm’s discrete function to reduce time consumption for
training image cluster centers, and also got control of the retraining cluster center problem.
The exact explanation regarding FKM clustering steps can be referred to in [57].
Enhanced k-Means
: The EKM clustering was a modified clustering algorithm that
was enhanced by the conventional KM clustering algorithm. This clustering technique
was a part of the author’s previous work proposed by Nasir et al. and published in [
59
].
Here, EKM clustering has been invented to address the shortcomings of the KM clustering
algorithm regarding the assignation of the cluster center. Furthermore, to facilitate the
distribution of data to the relevant cluster to generate the finest segmented image, clustering
has invented a new transferring mechanism for the clustered members that employ the
idea of variance [
59
]. In addition, the variance value was calculated and often checked
through the clustering process to assess the relationship between the center and its members.
This action should be done because the data assigned to the center that did not meet a
predetermined criterion should naturally be relocated to the appropriate cluster. A detailed
explanation of how the EKM clustering algorithm works can be referred to in [59].
3.3. Proposed Segmentation Procedures for Thick and Thin Blood Smear Images
This section provides detailed information regarding the proposed image-processing
procedures applied to thick and thin blood smear images. Thus, the full framework based
on image-processing techniques that were designed to segment both thick and thin blood
smear images was disclosed in the following sub-sections.
3.3.1. Thick Blood Smear Segmentation Procedure
The framework of the proposed image-processing procedure for thick smear segmen-
tation is shown in Figure 3. In this study, the thick smear images were captured under
several lighting conditions, such as underexposed, overexposed, and normal, which may
Diagnostics 2023,13, 511 13 of 36
result in low-quality images. To obtain a fully segmented malaria parasite region, sev-
eral image-processing techniques were utilized to process the thick smear images. These
processes include image enhancement, color conversion to extract color components, and
image segmentation.
Diagnostics 2023, 13, x FOR PEER REVIEW 13 of 38
distribution of data to the relevant cluster to generate the finest segmented image, clus-
tering has invented a new transferring mechanism for the clustered members that employ
the idea of variance [59]. In addition, the variance value was calculated and often checked
through the clustering process to assess the relationship between the center and its mem-
bers. This action should be done because the data assigned to the center that did not meet
a predetermined criterion should naturally be relocated to the appropriate cluster. A de-
tailed explanation of how the EKM clustering algorithm works can be referred to in [59].
3.3. Proposed Segmentation Procedures for Thick and Thin Blood Smear Images
This section provides detailed information regarding the proposed image-processing
procedures applied to thick and thin blood smear images. Thus, the full framework based
on image-processing techniques that were designed to segment both thick and thin blood
smear images was disclosed in the following sub-sections.
3.3.1. Thick Blood Smear Segmentation Procedure
The framework of the proposed image-processing procedure for thick smear segmen-
tation is shown in Figure 3. In this study, the thick smear images were captured under
several lighting conditions, such as underexposed, overexposed, and normal, which may
result in low-quality images. To obtain a fully segmented malaria parasite region, several
image-processing techniques were utilized to process the thick smear images. These pro-
cesses include image enhancement, color conversion to extract color components, and im-
age segmentation.
Figure 3. The framework of the proposed image-processing procedure for thick smear segmenta-
tion.
Here, the image enhancement technique was applied to malaria images for two main
purposes. First, it was to improve the visual quality of malaria images, such as blurriness
or poor contrast. Second, it was to reduce the effect of staining variation on the malaria
image. Therefore, the image enhancement process was divided into two steps, namely
contrast enhancement and color constancy. Contrast enhancement was performed on ma-
laria images to enhance the region contrast of malaria parasite for easing the segmentation
process.
Figure 3.
The framework of the proposed image-processing procedure for thick smear segmentation.
Here, the image enhancement technique was applied to malaria images for two main
purposes. First, it was to improve the visual quality of malaria images, such as blurriness or
poor contrast. Second, it was to reduce the effect of staining variation on the malaria image.
Therefore, the image enhancement process was divided into two steps, namely contrast
enhancement and color constancy. Contrast enhancement was performed on malaria images
to enhance the region contrast of malaria parasite for easing the segmentation process.
In contrast, color constancy was performed on malaria images to standardize the color
of background areas to appear as if they were captured under similar color staining. In the
image enhancement process, the thick smear images were first enhanced using the modified
global contrast stretching (MGCS) technique to increase the image contrast [
59
]. MGCS is
the modified version of global contrast stretching (GCS) techniques. Additionally, MGCS
techniques work based on the computation of total pixels in a histogram. The selected
minimum and maximum percentage pixel values within the RGB image’s histogram were
used to enhance the ROI. Note that the required parameter of minimum and maximum
percentage values of minp = 0.1 and maxp = 10 was selected to produce new minimal
and maximal RGB values for the contrast stretching process. The MGCS algorithm was
described in more depth in [
60
]. After that, the color constancy technique, known as the
gray world, was applied to the MGCS image. It was to remove color casts and portray
the true colors of the malaria parasites by standardizing the color of background areas
in malaria images to appear as if they were captured under similar color staining. The
gray world was operated based on the presumption that the average pixel values for R, G,
and B components will converge to the same mean value [
61
]. This presumption allows
the elimination of color casts and the image’s color to be uniformly comparable to the
actual scene.
Once the malaria image was enhanced, the image proceeded with color conversion
to extract the color components for easing the segmentation process. Based on the initial
analysis and the findings presented in [
17
], the blue (B) color component was extracted
Diagnostics 2023,13, 511 14 of 36
from the RGB color model to facilitate the segmentation process. Afterwards, the B compo-
nents were segmented by 11 segmentation techniques consisting of local thresholding and
clustering techniques. Adaptive thresholding, Bradley, Niblack, Nick, Sauvola, Phansalkar,
Feng, KM, FCM, FKM, and EKM clustering were the segmentation algorithms used on the
thick smear images, in which the performance of local thresholding methods depends on
the window size selection. Windows sizes were used to get the local information by subdi-
viding the images into several blocks. In the adaptive thresholding method, the selected
parameter of window size was 65
×
65. As for Bradley, Nick, Sauvola, and Phansalkar,
the selected window size parameter was 15
×
15. In the Niblack method, the selected
parameter of window size was 250
×
250. Meanwhile, in Feng’s method, the local threshold
was determined by considering two local windows, such as primary and secondary. The
selected parameter of the primary window size was 5
×
5, and the secondary window size
was 150 ×150.
Meanwhile, the performance of clustering methods depended on the selection of
the cluster number. The clusters for KM, FCM, FKM, and EKM clustering were set into
three clusters, representing the malaria parasites, artifacts, and background region. Thus,
watershed segmentation with the Euclidean distance transform approach was used after
the segmentation process to separate the touching malaria parasite cells. The Euclidean
distance transform calculates the inequality within the neighborhood pixel to separate each
touching parasite.
After isolating the touching malaria parasites using watershed segmentation, some un-
wanted pixels, such as artifacts, staining precipitation, and WBCs, were still encountered in
the segmented images. These unwanted pixels were removed to enhance the segmentation
performance and avoid misidentifying the parasites in the further classification process.
In order to proceed with this procedure, two operations were applied to the resultant
segmented images. The first operation was to obtain the total pixel’s area in the ROI,
where the ROI referred to the malaria parasite. Here, the parasite’s size was one of the
crucial morphological characteristics used to differentiate between the malaria parasite
and non-malaria parasite. Meanwhile, the second operation was to remove the unwanted
pixels after the segmentation process. In order to obtain the total pixel’s area in the ROI, a
regional feature descriptor was used to extract the area in pixels of the segmented malaria
parasite. Based on this operation, it was found that a typical malaria parasite size might
have a value of the area in pixels between 7 to 280 pixels. Thus, any regions that were not
in these pixel ranges were labeled as non-malaria parasites. By utilizing the morphological
operation process, any pixels lower than 7 pixels and higher than 280 pixels were removed
from the segmented image to obtain the final segmented image.
3.3.2. Thin Blood Smear Segmentation Procedure
The framework of the proposed image-processing procedure for thin smear segmenta-
tion is shown in Figure 4. In this study, the thin smear images were captured under various
conditions, as mentioned in Section 3.1. To obtain a fully segmented malaria parasite region
from thin smear samples, several image-processing techniques were utilized to process
the thin smear images. These processes include image enhancement, extraction of color
components, image segmentation, and removal of unwanted pixels.
An image enhancement technique was applied to thin smear images to improve the
visual quality of thin smear images. In addition, the thin smear images were also enhanced
for the image enhancement process using the MGCS technique to increase the image
contrast. Note that the required parameter of minimum and maximum percentage values
of minp = 0.5 and maxp = 10 were selected to produce new minimal and maximal RGB values
for the contrast stretching process. After the malaria image was enhanced, color conversion
was used to extract the color components and facilitate better segmentation. As for the thin
smear image, the green (G) component information was extracted from the enhanced image.
Subsequently, the G components were segmented by 11 segmentation techniques consisting
of local thresholding and clustering techniques. Here, the segmentation algorithms used
Diagnostics 2023,13, 511 15 of 36
on the thin smear images were adaptive thresholding, Bradley, Niblack, Nick, Sauvola,
Phansalkar, Feng, KM, FCM, FKM, and EKM clustering. As mentioned previously, the
performance of local thresholding methods depends on the window size selection. In
the thin smear image, the window size selection for adaptive thresholding, Bradley, Nick,
Sauvola, Phansalkar, and Niblack, including the Feng method, were 175
×
175. Meanwhile,
the cluster number for KM, FCM, FKM, and EKM clustering was set to the value of three
clusters, which stand for the malaria parasites, artifacts, and background region.
Diagnostics 2023, 13, x FOR PEER REVIEW 15 of 38
that were not in these pixel ranges were labeled as non-malaria parasites. By utilizing the
morphological operation process, any pixels lower than 7 pixels and higher than 280 pix-
els were removed from the segmented image to obtain the final segmented image.
3.3.2. Thin Blood Smear Segmentation Procedure
The framework of the proposed image-processing procedure for thin smear segmen-
tation is shown in Figure 4. In this study, the thin smear images were captured under
various conditions, as mentioned in Section 3.1. To obtain a fully segmented malaria par-
asite region from thin smear samples, several image-processing techniques were utilized
to process the thin smear images. These processes include image enhancement, extraction
of color components, image segmentation, and removal of unwanted pixels.
Figure 4. The framework of the proposed image-processing procedure for thin smear segmenta-
tion.
An image enhancement technique was applied to thin smear images to improve the
visual quality of thin smear images. In addition, the thin smear images were also enhanced
for the image enhancement process using the MGCS technique to increase the image con-
trast. Note that the required parameter of minimum and maximum percentage values of
𝑚𝑖𝑛𝑝 = 0.5 and 𝑚𝑎𝑥𝑝 = 10 were selected to produce new minimal and maximal RGB values
for the contrast stretching process. After the malaria image was enhanced, color conver-
sion was used to extract the color components and facilitate better segmentation. As for
the thin smear image, the green (G) component information was extracted from the en-
hanced image. Subsequently, the G components were segmented by 11 segmentation tech-
niques consisting of local thresholding and clustering techniques. Here, the segmentation
algorithms used on the thin smear images were adaptive thresholding, Bradley, Niblack,
Nick, Sauvola, Phansalkar, Feng, KM, FCM, FKM, and EKM clustering. As mentioned
previously, the performance of local thresholding methods depends on the window size
selection. In the thin smear image, the window size selection for adaptive thresholding,
Bradley, Nick, Sauvola, Phansalkar, and Niblack, including the Feng method, were 175 ×
175. Meanwhile, the cluster number for KM, FCM, FKM, and EKM clustering was set to
the value of three clusters, which stand for the malaria parasites, artifacts, and background
region.
After the segmented image was obtained through the image segmentation process,
some unwanted regions, such as small background pixels, were still encountered in the
image. Therefore, the median filter was used as a noise removal to remove these small
background pixels from the segmented image. It was performed by replacing a current
pixel’s value with its neighboring pixels’ median value. In addition, the median filter
could also be used to fill the small holes that might appear on the segmented malaria
parasite due to its good smoothing performance and preserving the edge well without
blurring it. Here, the neighborhood of n × n (n = 7) pixels was used because large neigh-
borhoods produce more severe smoothing. After segmentation and filtering processes
were performed on the malaria image, there was a tendency for the unwanted regions,
such as segmented RBCs, platelets, and artifacts, to appear on the segmented image. This
Figure 4.
The framework of the proposed image-processing procedure for thin smear segmentation.
After the segmented image was obtained through the image segmentation process,
some unwanted regions, such as small background pixels, were still encountered in the
image. Therefore, the median filter was used as a noise removal to remove these small
background pixels from the segmented image. It was performed by replacing a current
pixel’s value with its neighboring pixels’ median value. In addition, the median filter could
also be used to fill the small holes that might appear on the segmented malaria parasite due
to its good smoothing performance and preserving the edge well without blurring it. Here,
the neighborhood of n
×
n(n= 7) pixels was used because large neighborhoods produce
more severe smoothing. After segmentation and filtering processes were performed on
the malaria image, there was a tendency for the unwanted regions, such as segmented
RBCs, platelets, and artifacts, to appear on the segmented image. This can be due to some
properties, such as color and size, which these unwanted regions share with the malaria
parasite. Thus, these unwanted regions should be excluded from the segmented image to
enhance the segmentation performance and to avoid the misdiagnosis of the infected cell
in the further classification process.
In order to reduce this problem, a modified version of the conventional seed-based
region growing algorithm, namely the seeded region growing area extraction (SRGAE)
algorithm proposed by Harun et al. [
62
], was applied to the segmented image. This
algorithm was applied for two main purposes. First, calculate the total area in pixels for the
ROI, where the ROI is referred to as the malaria parasite. Second, remove any unwanted
regions that were bigger in size that cannot be cleaned using the 7
×
7 pixels median filter.
Based on analysis of malaria parasites that had been segmented manually in several malaria
images, it had been found that a typical malaria parasite (ring and mature trophozoite of
P. falciparum) might have a value of the area between 150 to 1000 pixels. Meanwhile, the
other types of the malaria parasite (schizont and gametocyte of P. falciparum; ring, mature
trophozoite, schizont, and gametocyte of P. vivax) might have a value of the area between
3500 to 30,000 pixels. Thus, regions not in these ranges were labeled non-malaria parasites
and excluded from the segmented image during the region-growing process to obtain the
final image. Here, the analysis of different segmentation techniques was done to find the
best segmentation technique that should be used for segmenting thin smear images in all
stages of thin blood smear images.
After analyzing different segmentation techniques, a standard segmentation frame-
work was developed using proposed image-processing procedures to segment malaria
parasites in all stages of thin blood smear images. The proposed procedures for the seg-
Diagnostics 2023,13, 511 16 of 36
mentation of the malaria parasite, consisting of the best image segmentation techniques
determined after the segmentation analysis, were shown in Algorithm 1. The procedures in
Algorithm 1 were conducted on ring, mature trophozoite, schizont, and gametocyte images
captured from the thin blood smears of P. falciparum and P. vivax samples. The detailed
explanation regarding the proposed image-processing procedures of Algorithm 1 if the
malaria parasite had not been segmented properly and a large number of RBCs were still
included in these images. Therefore, these types of images should encounter a second
segmentation process. These processes include enhancing the malaria image using the
MGCS technique with minp = 0.05% and maxp = 10%, extracting the green component image
from the enhanced image, and segmenting the image using the best image segmentation
techniques determined after performing the analysis of segmentation previously.
After a properly segmented malaria image was obtained, the image was filtered using
a media filter to remove small background pixels and provide other segmented regions. At
this time, the neighborhood of 3
×
3 pixels was chosen to prevent the severe smoothing
effect on the segmented image. Afterwards, the SRGAE algorithm was applied to remove
the large unwanted regions in the malaria image. When applying the SRGAE algorithm,
regions not in the 150–1000 pixels and 3500–30,000 pixels were considered non-malaria
parasites and removed from the image during this process. Later, the holes inside the
malaria parasite were filled by applying region filling using a 5
×
5 pixels minimum filter.
Subsequently, 5
×
5 pixels of erosion process was performed to erode the boundary so that
the shape of the malaria parasite was well defined. If the unwanted regions appeared on
the image, the image would again be processed using the SRGAE algorithm. By applying
the proposed segmentation procedures, a smoother malaria parasite region and a clean
segmented malaria image without or less appearance of RBCs region were obtained. The
implementations of the proposed procedures for the segmentation of malaria parasite were
summarized as follows:
Algorithm 1: Procedures for segmentation of thin blood smears of P. falciparum and P. vivax
Input: Ring, trophozoite, schizont, and gametocyte images
Output: Segmented images of P. falciparum and P. vivax
Procedures:
1. Apply the MGCS contrast enhancement technique to the original malaria image.
2. Extract the G colour component from the enhanced RGB image.
3.
Apply the selected segmentation technique that had been determined from the analysis of different segmentation techniques.
4. Check the area of background region, Area[bg] that satisfies this condition,
a.
If the value of Area[bg] >= 450,000 pixels, apply the 3×3 pixels median filter.
b.
Else, go to step 1.
5. Apply the seeded region growing area extraction algorithm.
Note: Regions not in the ranges of 150–1000 and 3500–30,000 were labeled as non-malaria parasites and excluded from the
segmented image during this process.
6. Apply the 5 ×5 pixels minimum filter.
7. Perform the 5 ×5 pixels of the erosion process.
8. Check the area of the foreground region, Area[fg] that satisfies this condition,
a.
If the value of Area[fg] >= 9000 pixels, apply the seeded region growing area extraction algorithm
Note: Regions not in the 3500–30,000 range were labeled as non-malaria parasites and excluded from the segmented
image during this process.
b.
Colorize the segmented image based on the enhanced RGB image.
9. End.
Diagnostics 2023,13, 511 17 of 36
3.4. Evaluation of Image Segmentation Quality
In the present study, image segmentation quality was evaluated using three objective
indices, accuracy, sensitivity, and specificity. The quality of segmentation of an image
was determined based on the pixel similarity of the resultant segmented image against
the manually segmented image. The manually segmented image was obtained by man-
ually editing the original malaria image using Adobe Photoshop CS5.1 software, so that
the image only consists of the malaria parasite. In contrast, the other regions had been
removed from the image. Then, the lab technician validated the visual evaluation of the
resultant manual segmented images. In this research, the primary criterion for assessing
the performance of image segmentation was based on its ability to produce a fully seg-
mented malaria parasite region and segment the malaria parasite from its complicated
blood cell background. The accuracy, sensitivity, and specificity were defined based on
Equations (8)–(10), respectively [63].
Accuracy =TP +TN
TP +T N +FP +FN ×100 (8)
S peci f ici ty =TP
TP +F N ×100 (9)
Sensitivity =TN
TN +FP ×100 (10)
Consequently, for thin blood smears, the performances of the proposed segmentation
method were further evaluated using recall, precision, and F1-score. Precision and recall
are used to assess whether the segmented image is over-segmented or under-segmented.
A low recall value indicated under-segment, whereas a low precision value indicated
over-segment. Here, recall and sensitivity were comparable. A good segmentation would
result in an F1-score of 1, which combines precision and recall as a single statistical measure
for the segmented image [63].
Precision =TP
TP +FP (11)
Recall =TP
TP +F N (12)
F1score =2×Precision ×Recal l
Precision +Recall (13)
3.5. Feature Extraction for Thick and Thin Blood Smear Images
To classify the malaria parasite or non-malaria parasite successfully, the combination
of different types of features, such as size, shape, and color, was used to represent the types
of the malaria parasite. For thick smear, a number 32 input features consisting of size,
shape, texture, and color-based features had been extracted from the segmented objects,
which were given as inputs to the intelligent classifiers. The features used on the segmented
thick blood smear images in this study can be classified into four main categories. These
features are:
Size-based feature: Area, perimeter, solidity, and eccentricity of the segmented object.
Shape-based feature: Hu’s moment invariant of the segmented object.
Texture-based feature: Haralick and Hausdorff dimension.
Color-based feature: Mean and standard deviation of the segmented object’s red,
green, and blue color components.
Meanwhile, for thin smear, 28 input features consisting of size, shape, and color-
based features were extracted from the segmented objects and were given as inputs to the
intelligent classifiers. The features used in this study’s segmented thin blood smear images
can be classified into three main categories. These features are:
Diagnostics 2023,13, 511 18 of 36
Size-based feature: area of the segmented object.
Shape-based feature: Second and third order central moment, affine invariant moment,
and Hu’s moment invariant of the segmented object.
Color-based feature: Mean and variance of red, green, blue, intensity, and saturation
color components of the segmented object.
3.6. Malaria Diagnosis Using Intelligent Classifier
After the features had been extracted, the suitability of utilizing these features for the
classification of the thick and thin smear blood samples was tested using three different
classifiers. These three different classifiers were single-hidden layer feed-forward neural
networks (SLFN) trained by ELM and OS-ELM, and RFs. The classifications of the indi-
vidual segmented object (malaria parasite and non-malaria parasite) inside the thick and
thin smear blood samples using the three intelligent classifiers consist of one task in thick
smear blood samples and five tasks in thin smear blood samples. As for thick smear, the
classification task was to classify malaria and non-malaria parasites. Meanwhile, five tasks
in thin smear are:
Task 1: Classification between the malaria parasite and non-malaria parasite.
Task 2: Classification of malaria parasite as either P. falciparum or P. vivax species.
Task 3: Classification of P. falciparum species into the ring, mature trophozoite, schizont,
and gametocyte stages.
Task 4: Classification of P. vivax species into the ring, mature trophozoite, schizont,
and gametocyte stages.
Task 5: Classification of the individual segmented object inside the malaria sample into
eight categories, such as P. falciparum (Ring, Trophozoite, Schizont, and Gametocyte)
and P. vivax (Ring, Trophozoite, Schizont, and Gametocyte).
Thick blood smear training and validation data distributions were tabulated in Table 1,
whereas thin blood smear training, validation, and testing data distributions were tabulated in
Table 2. In an attempt to classify the malaria parasite and non-malaria parasite inside the thick
blood smear sample, a number of 7500 segmented objects were obtained from 300 malaria
images. A total of 4500 segmented objects were used as training data, and 3000 were used
as validation data. Meanwhile, in a thin blood smear sample, a number of 6300 segmented
objects had been obtained from 4035 malaria images. A total of 3380 segmented objects were
used as training data, while 1260 were used as validation and testing data.
Table 1. Training and validation data distributions for the thick smear.
Categories Training Data Validation Data
Non-malaria parasite 1000 1000
PF 1750 1000
PV 1750 1000
Total Data 4500 3000
Table 2. Training, validation, and testing data distributions for the thin smear.
Categories Training Data Validation Data Testing Data
Non-malaria parasite 720 240 240
PF_Ring 720 240 240
PF_Trophozoite 300 100 100
PF_Schizont 360 120 120
PF_Gametocyte 360 120 120
PV_Ring 300 100 100
PV_Trophozoite 300 100 100
PV_Schizont 360 120 120
PV_Gametocyte 360 120 120
Total Data 3780 1260 1260
Diagnostics 2023,13, 511 19 of 36
3.6.1. Classification Using Extreme Learning Machine (ELM)
In this study, the ELM classification process for training the SLFN proposed by Huang
et al. [
63
] was used to classify the target in thick and thin blood smear images. ELM is a
supervised batch learning algorithm that employs random computational nodes in the
hidden layer and computes its output weights analytically by solving a general linear
system equation. Basically, ELM is made of three layers: input, hidden, and output. Here,
a sigmoid activation function was used for both hidden and output nodes. Note that
these activation functions can effectively avoid the insufficient expression ability of neural
networks caused by linear mapping. During the ELM training process, the analysis of
finding the best number of states and the best number of hidden nodes was conducted to
obtain the best classification result. Here, the number of hidden nodes was optimized by
varying the hidden nodes from 1 to 50 with the step of 1. The state is a seed that can be
used to initialize weights to get consistent results. As for the analysis of states, the number
of states varied between the value of 1 and 4 to optimize the classification performance.
The full implementation of ELM for training an SLFN could be referred to in [64].
3.6.2. Classification Using Online Sequential Extreme Learning Machine (OS-ELM)
In this study, the next classification process was conducted using an SLFN trained
by OS-ELM, which was proposed by Liang et al. [
47
]. OS-ELM is an upgraded online
sequential version of the ELM. As for the OS-ELM, this algorithm can learn the training
data one-by-one and chunk-by-chunk, discarding the data for which training had already
been done [
47
]. Similar to ELM, the analysis of finding the best number of states and
hidden nodes was conducted to obtain the best classification result. However, in sequential
learning, only one epoch was needed, and the training stopped right after all the training
patterns were processed. In terms of performance, the OS-ELM has been proven to be
faster than the other well-known sequential algorithms and produces better generalization
performance with lower training time. The full implementation of OS-ELM for training an
SLFN could be referred to in Liang et al. [47].
3.6.3. Classification Using Random Forest
RF is one of the potential algorithms for building classifiers introduced by Breiman [
65
].
This classifier consists of many binary decision trees, where each tree is a classifier by itself,
given a certain weight for its classification output. A binary decision tree is a method of
using nodes in a tree structure to test the attributes of a dataset. The results of these tests
were used to split the training data into subsets, which were then passed onto the next
layer of the tree. This continues until each subset at a node contains only one class. As an
ensemble of models, the RF used majority voting for the classification task and averaging
for regression to make a final prediction [
65
]. For classification using RF, the decision
tree algorithm used during the classification process is Classification and Regression Tree
(CART) [
65
]. In order to obtain the best classification result in this study, the optimization
parameter for the number of trees in the forest was analyzed. Therefore, the number of
trees would vary from 1 to 50 to optimize this parameter. Note that the procedures for
finding the best number of trees were applied for each classification task.
4. Results and Discussion
This section provides the results of segmentation performance using various seg-
mentation techniques and the result of classification performance of thick and thin blood
smears using three different intelligent classifiers. Here, a total of 11 segmentation tech-
niques: adaptive threshold, Niblack, Sauvola, Phansalkar, Nick, Bradley, Feng, KM, FKM,
EKM, and FCM have been used on thick and thin blood smear images of P. falciparum
and P. vivax species to obtain the best segmented resultant image as can be seen from
Figures 57, respectively.
Diagnostics 2023,13, 511 20 of 36
Diagnostics 2023, 13, x FOR PEER REVIEW 20 of 38
nodes was conducted to obtain the best classification result. However, in sequential learn-
ing, only one epoch was needed, and the training stopped right after all the training pat-
terns were processed. In terms of performance, the OS-ELM