ArticlePublisher preview available

A new approach for microscopic diagnosis of malaria parasites in thick blood smears using pre-trained deep learning models

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract and Figures

One of the deadly endemic diseases in sub-Saharan Africa is malaria. Its prevalence is promoted by lack of sufficient expertise to carry out accurate and timely diagnosis using the standard microscopy method. Where lab technicians are available, the results are usually subjective due to variations in expert judgement. To address this challenge, prompt interventions to improve disease control are needed. The emerging technologies of machine learning that can learn complex image patterns have accelerated research in medical image analysis. In this study, on a dataset of thick blood smear images, we evaluate and compare performance of three pre-trained deep learning architectures namely; faster regional convolutional neural network (faster R-CNN), single-shot multi-box detector (SSD) and RetinaNet through a Tensorflow object detection API. Data augmentation method was applied to optimise performance of the meta architectures. The possibility for mobile phone detector deployment was also investigated. The results revealed that faster R-CNN was the best trained model with a mean average precision of over 0.94 and SSD, was the best model for mobile deployment. We therefore deduce that faster R-CNN is best suited for obtaining high rates of accuracy in malaria detection while SDD is best suited for mobile deployment.
This content is subject to copyright. Terms and conditions apply.
Vol.:(0123456789)
SN Applied Sciences (2020) 2:1255 | https://doi.org/10.1007/s42452-020-3000-0
Research Article
A new approach formicroscopic diagnosis ofmalaria parasites inthick
blood smears using pre‑trained deep learning models
RoseNakasi1· ErnestMwebaze2· AminahZawedde1· JeremyTusubira1· BenjaminAkera1· GilbertMaiga1
Received: 6 December 2019 / Accepted: 1 June 2020 / Published online: 20 June 2020
© Springer Nature Switzerland AG 2020
Abstract
One of the deadly endemic diseases in sub-Saharan Africa is malaria. Its prevalence is promoted by lack of sucient
expertise to carry out accurate and timely diagnosis using the standard microscopy method. Where lab technicians are
available, the results are usually subjective due to variations in expert judgement. To address this challenge, prompt
interventions to improve disease control are needed. The emerging technologies of machine learning that can learn com-
plex image patterns have accelerated research in medical image analysis. In this study, on a dataset of thick blood smear
images, we evaluate and compare performance of three pre-trained deep learning architectures namely; faster regional
convolutional neural network (faster R-CNN), single-shot multi-box detector (SSD) and RetinaNet through a Tensorow
object detection API. Data augmentation method was applied to optimise performance of the meta architectures. The
possibility for mobile phone detector deployment was also investigated. The results revealed that faster R-CNN was the
best trained model with a mean average precision of over 0.94 and SSD, was the best model for mobile deployment. We
therefore deduce that faster R-CNN is best suited for obtaining high rates of accuracy in malaria detection while SDD is
best suited for mobile deployment.
Keywords Deep learning· Malaria detection· Thick blood smear· Mobile detector
1 Introduction
Malaria is one of the top causes of death in sub-Saharan
Africa[1]. Of the 438,000 malaria cases registered, an
estimated 92% resulted in deaths; two-thirds of which
occurred among children under 5years of age[1]. In highly
endemic areas like Uganda, Malaria is the leading cause of
death accounting for over 27% of loss of lives[2]. There-
fore, detecting malaria parasites is key to malaria diagnosis
as it contributes immensely to the prevention and treat-
ment of the deadly disease[3].
The conventional approach for the diagnosis of malaria
is microscopy[3, 4] that requires interpretation of the
results by a skilled technician. Using this approach, a
sample of blood is drawn from a patient and smeared
on a glass slide stained with Giemsa. This blood sample
can be either thin or thick. A thick smear is more ecient
for parasite detection with an 11 times higher sensitivity
rate[5]. Although higher accuracy has been recorded with
thick blood smears, most of the research in automation of
malaria diagnosis has been done with thin blood smears
thus calling for more research on thick smears[6].
Furthermore, there are few skilled lab technicians in the
highly endemic areas to eectively interpret the micros-
copy diagnosis results. When they do, the interpretation
of the results is aected by the level of human variabil-
ity in observation of the parasites which leads to bias in
Supported by SIDA.
* Rose Nakasi, g.nakasirose@gmail.com | 1College ofComputing andInformation Sciences, Makerere University, Kampala,
Uganda. 2Makerere AI Research, Kampala, Uganda.
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
... However, recent developments in pre-trained deep learning models have the potential to revolutionize medical image analysis, especially in scenarios where data are limited and heterogeneous [23]. These models [24][25][26] can serve as feature extractors for a new model trained on a dataset with fewer images, improving performance in different tasks of computer vision. Recent studies have applied pre-trained deep learning models to the classification and detection tasks of differentiating infected parasitized from microscopic images, achieving classification and detection accuracies above 95% and outperforming traditional machine learning algorithms [27]. ...
... Although YOLOv8 is the latest version, there is still ongoing debate about the comparisons between the YOLO versions. While YOLOv5 has higher precision and speed than other state-of-the-art algorithms, such as Faster R-CNN [32] and SSD [26], YOLOv5 has higher accuracy and detection speed compared to YOLOv4 and YOLOv8. However, due to the complexity of the YOLOv8 network, a more simplified version of YOLOv5 has been designed to maximize detection speed and improve computational efficiency. ...
Article
Full-text available
Intestinal parasitic infections pose a grave threat to human health, particularly in tropical and subtropical regions. The traditional manual microscopy system of intestinal parasite detection remains the gold standard procedure for diagnosing parasite cysts or eggs. This approach is costly, time-consuming (30 min per sample), highly tedious, and requires a specialist. However, computer vision, based on deep learning, has made great strides in recent years. Despite the significant advances in deep convolutional neural network-based architectures, little research has been conducted to explore these techniques’ potential in parasitology, specifically for intestinal parasites. This research presents a novel proposal for state-of-the-art transfer learning architecture for the detection and classification of intestinal parasite eggs from images. The ultimate goal is to ensure prompt treatment for patients while also alleviating the burden on experts. Our approach comprised two main stages: image pre-processing and augmentation in the first stage, and YOLOv5 algorithms for detection and classification in the second stage, followed by performance comparison based on different parameters. Remarkably, our algorithms achieved a mean average precision of approximately 97% and a detection time of only 8.5 ms per sample for a dataset of 5393 intestinal parasite images. This innovative approach holds tremendous potential to form a solid theoretical basis for real-time detection and classification in routine clinical examinations, addressing the increasing demand and accelerating the diagnostic process. Our research contributes to the development of cutting-edge technologies for the efficient and accurate detection of intestinal parasite eggs, advancing the field of medical imaging and diagnosis.
... Transfer learning refers to a machine learning paradigm whereby a previously trained model is utilized to improve performance on a related, but different task (Nakasi et al. 2020;Minaee et al. 2020;Choi et al. 2017). The work of Yoo et al. 2020a; Asnaoui and Chawki 2020) used the Inception-V3 model for transfer learning, which entails leveraging the knowledge garnered from training a model on a given dataset and applying it to another dataset that bears some semblance to the initial one, with the aim of enhancing the model's efficacy (Kermany et al. 2018;Yoo et al. 2019). ...
Article
Full-text available
Inflammations that result in severe pharyngitis are frequently brought on by streptococcal pharyngitis. A low average survival rate for oral cancer distinguishes it as an aggressive illness. Tonsillitis is a disease of viral nature that can lead to severe obstructive pulmonary disease, ischemic heart disease and kidney disease. The worldwide COVID-19 pandemic has rekindled interest in using telemedicine to treat patients with pharyngitis, oral cancer and tonsillitis. Deep learning technology has advanced quickly, making it possible to complete categorization training tasks more quickly. The technique of Bag of Visual Words (BOVW) had shown remarkable improvement in accuracy of classification of images. On the other hand, the training of Convolutional Neural Network (CNN) on problem specific dataset blended with transfer learning from popular models like Inception-v3, MobileNet-v2 is taking hours of time. So, in this paper, first we have proposed a improved bag of visual words and results were presented. Secondly, we have applied feature fusion-based approach which mixes the features computed from improved bag of visual words and features taken from Convolutional Neural Network where training took place on four classes (Pharyngitis, Oral Cancer, Tonsillitis, and Healthy) of images with transfer learning for relatively low number of epochs. These features from both CNN and bag of visual words are given to ML models like Light Gradient Boosting Machine and Extreme Gradient Boosting. The experimental results showed significant improvement in accuracy, precision and recall of machine learning model in proposed two cases.
... Although YOLOv8 is the latest version, there is still ongoing debate about the comparisons between the YOLO versions. While YOLOv5 has higher precision and speed than other state-of-the-art algorithms like Faster R-CNN [26] and SSD [27], YOLOv5 has higher accuracy and detection speed compared to YOLOv4 and YOLOv8. However, due to the complexity of the YOLOv8 network, a more simplified version of YOLOv5 has been designed to maximize detection speed and improve computational efficiency. ...
Preprint
Full-text available
Intestinal parasitic infections pose a grave threat to human health, particularly in tropical and subtropical regions. The traditional manual microscopy system of intestinal parasites detection remains the golden standard procedure for the diagnosis of parasites cyst or eggs, but this approach is costly time-consuming (30min/sample), highly tedious, and also required specialist. However, computer vision based on deep learning has made great stride in recent time. Despite the significant advances in deep convolutional neural network-based architectures, little research has been conducted to explore the potential of these techniques in the field of parasitology, specifically for intestinal parasites. Therefore, the goal of this research is to evaluate the performance of proposed state-of-the-art transfer learning architecture for detecting and classifying intestinal parasite eggs from images. We would ensure that patients receive prompt treatment while also relieving experts of extra work if we used such an architecture. Here, in stage first, we applied image pre-processing and augmentation to the dataset, and in stage second, we utilized the YOLOv5 algorithms for detection and classification and then compared their performance based on different parameters. Our algorithms achieved a mean average precision of 97% approximatiely and 8.5 ms detection time per sample for 5,393 intestinal parasite images. Thus, this approach may form a solid theoretical basis for real-time detection and classification in routine clinical examinations while accelerating the process to satisfy increasing demand.
... Various object detection pre-trained models such as Faster R-CNN, Single-shot multi-box detector, and RetinaNet were utilised for malaria parasite detection. The results demonstrated that Faster R-CNN is the most well-trained model with an average precision of over 0.94 [29]. For automatic diagnosis of malaria, stacked CNN [30,31] and an ensemble of residual networks [32] were recently used, with accuracy rate of 99.96% and 98.08%, respectively. ...
Chapter
Manual evaluation of TB bacteria from Ziehl-Neelsen images is a tedious process. Computer Aided Diagnosis for image analysis helps increase the throughput. In Ziehl-Neelsen sputum smear images, bacilli appear very small and often overlap with WBC. This results in a setback in the automation process. Recent advances in CNNs in solving medical image-related diagnosis help in faster diagnosis. The present study is focused on the application of an object detection model in TB bacteria detection process. The study showed that the RetinaNet-based object detection model is suitable for microscopic ZN images resulting in an Average Precision of 0. 91 for WBC and 0.94 for bacilli bacteria.
Article
Full-text available
Squamous cell carcinoma is the most common type of cancer that occurs in squamous cells of epithelial tissue. Histopathological evaluation of tissue samples is the gold standard approach used for carcinoma diagnosis. SCC detection based on various histopathological features often employs traditional machine learning approaches or pixel-based deep CNN models. This study aims to detect keratin pearl, the most prominent SCC feature, by implementing RetinaNet one-stage object detector. Further, we enhance the model performance by incorporating an attention module. The proposed method is more efficient in detection of small keratin pearls. This is the first work detecting keratin pearl resorting to the object detection technique to the extent of our knowledge. We conducted a comprehensive assessment of the model both quantitatively and qualitatively. The experimental results demonstrate that the proposed approach enhanced the mAP by about 4% compared to default RetinaNet model.
Article
Full-text available
Background & objectives: The highly sensitive method for a true understanding of malaria prevalence is of utmost importance for India's elimination strategy. The PCR reaction type with rapid detection, cost-effectiveness, and less workforce should be preferable. Multiplex PCR type accomplishes the present requirement by saving time and resources to find true surveillance data for malaria, especially in low-parasitemia/asymptomatic groups or populations. Methods: The present study focuses on designing multiplex PCR (mPCR) to detect simultaneously Plasmodium genus (PAN) and two common Plasmodium species found in India. It is compared to standard nested PCR on 195 clinical samples to diagnose malaria. The mPCR was designed with a minimum number of primers, leading to less clogging and effective and enhanced detection. It contains one common reverse primer and three forward primers amplifying three targeted genes corresponding to P. falciparum, P. vivax, and Plasmodium genus. Results: The sensitivity and specificity for mPCR were 94.06 and 95.74, respectively. The limit of detection for mPCR was 0.1 parasites/µl. The study has shown a ROC curve area for the mPCR of 0.949 for Plasmodium genus and P. falciparum and 0.897 for P. vivax with standard nPCR. Interpretation & conclusion: The mPCR is rapid in detecting species together, cost-effective, and requires fewer human resources than the standard nPCR. Therefore, the mPCR can be used as an alternative technique for the higher sensitive detection of the malaria parasite. It could also become a vital tool for determining malaria prevalence, facilitating the application of the most effective measures.
Chapter
Full-text available
Artificial Intelligence (AI) technology is becoming one of the master instruments in the new world for many companies worldwide. The future of learning and the process of development have been bundled with innovations in the coming years. The application of AI helps to enhance the quality of employees through training and development at par with industrial needs. AI involves in the functional areas of the human resource segment such as recruitment, training, performance assessment, and employee retention. AI creates a next-gen technological workplace that succeeds in the seamless association between organisational systems and employees. Hence, human capitals are not obsolescent, but their efficiencies are bolstered by upcoming technology. In reality, AI offers organisations with a good amount of freeing up resources for greater tasks. The AI-enabled advanced software will read the employee based on their documents entered. Moreover, the AI application helps to identify various issues of the employees through staff data related to their stress level, late coming, unnecessary leaves, and so on. It has great market potential and is applicable not only in HR operations but also across the organisation. Therefore, AI helps the HR professional to make the decision-making process easier and smarter.
Chapter
Malaria is a threat to the universe, especially in Asian and African countries. In malaria-affected nations, malaria is diagnosed through a visual examination of blood under a microscope. The reliability of the result depends upon the skills of the lab technician. Any Individual or community will be affected if they are misdiagnosed. To overcome this issue, an automated malaria diagnosis system using nearly 27,000 thin blood smear images has been proposed. It involves two steps: 1. Pre-processing the digitized blood smear images to remove the noise and segmenting them to extract the Region of Interest. 2. A lightweight customized deep learning model to classify digital smears either as infected or normal. We have also implemented an automated malaria detection system using various pre-trained models. The result revealed that our model performed with an accuracy of over 99% and a sensitivity of 100%.
Article
Full-text available
Malaria is a blood disease caused by the Plasmodium parasites transmitted through the bite of female Anopheles mosquito. Microscopists commonly examine thick and thin blood smears to diagnose disease and compute parasitemia. However, their accuracy depends on smear quality and expertise in classifying and counting parasitized and uninfected cells. Such an examination could be arduous for large-scale diagnoses resulting in poor quality. State-of-the-art image-analysis based computer-aided diagnosis (CADx) methods using machine learning (ML) techniques, applied to microscopic images of the smears using hand-engineered features demand expertise in analyzing morphological, textural, and positional variations of the region of interest (ROI). In contrast, Convolutional Neural Networks (CNN), a class of deep learning (DL) models promise highly scalable and superior results with end-to-end feature extraction and classification. Automated malaria screening using DL techniques could, therefore, serve as an effective diagnostic aid. In this study, we evaluate the performance of pre-trained CNN based DL models as feature extractors toward classifying parasitized and uninfected cells to aid in improved disease screening. We experimentally determine the optimal model layers for feature extraction from the underlying data. Statistical validation of the results demonstrates the use of pre-trained CNNs as a promising tool for feature extraction for this purpose.
Article
Full-text available
Malaria remains a major burden on global health, with roughly 200 million cases worldwide and more than 400,000 deaths per year. Besides biomedical research and political efforts, modern information technology is playing a key role in many attempts at fighting the disease. One of the barriers toward a successful mortality reduction has been inadequate malaria diagnosis in particular. To improve diagnosis, image analysis software and machine learning methods have been used to quantify parasitemia in microscopic blood slides. This article gives an overview of these techniques and discusses the current developments in image analysis and machine learning for microscopic malaria diagnosis. We organize the different approaches published in the literature according to the techniques used for imaging, image preprocessing, parasite and cell segmentation, feature computation, and automatic cell classification. Readers will find the different techniques listed in tables, with the relevant articles cited next to them, for both thin and thick blood smear images. We also discussed the latest developments in sections devoted to deep learning and smartphone technology for future malaria diagnosis.
Article
Full-text available
The ability to automatically learn task specific feature representations has led to a huge success of deep learning methods. When large training data is scarce, such as in medical imaging problems, transfer learning has been very effective. In this paper, we systematically investigate the process of transferring a Convolutional Neural Network, trained on ImageNet images to perform image classification, to kidney detection problem in ultrasound images. We study how the detection performance depends on the extent of transfer. We show that a transferred and tuned CNN can outperform a state-of-the-art feature engineered pipeline and a hybridization of these two techniques achieves 20\% higher performance. We also investigate how the evolution of intermediate response images from our network. Finally, we compare these responses to state-of-the-art image processing filters in order to gain greater insight into how transfer learning is able to effectively manage widely varying imaging regimes.
Conference Paper
Full-text available
We present a method for detecting objects in images using a single deep neural network. Our approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location. At prediction time, the network generates scores for the presence of each object category in each default box and produces adjustments to the box to better match the object shape. Additionally, the network combines predictions from multiple feature maps with different resolutions to naturally handle objects of various sizes. SSD is simple relative to methods that require object proposals because it completely eliminates proposal generation and subsequent pixel or feature resampling stages and encapsulates all computation in a single network. This makes SSD easy to train and straightforward to integrate into systems that require a detection component. Experimental results on the PASCAL VOC, COCO, and ILSVRC datasets confirm that SSD has competitive accuracy to methods that utilize an additional object proposal step and is much faster, while providing a unified framework for both training and inference. For \(300 \times 300\) input, SSD achieves 74.3 % mAP on VOC2007 test at 59 FPS on a Nvidia Titan X and for \(512 \times 512\) input, SSD achieves 76.9 % mAP, outperforming a comparable state of the art Faster R-CNN model. Compared to other single stage methods, SSD has much better accuracy even with a smaller input image size. Code is available at https:// github. com/ weiliu89/ caffe/ tree/ ssd.
Article
Full-text available
Machine learning and data mining techniques have been used in numerous real-world applications. An assumption of traditional machine learning methodologies is the training data and testing data are taken from the same domain, such that the input feature space and data distribution characteristics are the same. However, in some real-world machine learning scenarios, this assumption does not hold. There are cases where training data is expensive or difficult to collect. Therefore, there is a need to create high-performance learners trained with more easily obtained data from different domains. This methodology is referred to as transfer learning. This survey paper formally defines transfer learning, presents information on current solutions, and reviews applications applied to transfer learning. Lastly, there is information listed on software downloads for various transfer learning solutions and a discussion of possible future research work. The transfer learning solutions surveyed are independent of data size and can be applied to big data environments.
Article
Recent results indicate that the generic descriptors extracted from the convolutional neural networks are very powerful. This paper adds to the mounting evidence that this is indeed the case. We report on a series of experiments conducted for different recognition tasks using the publicly available code and model of the $backslash$overfeat network which was trained to perform object classification on ILSVRC13. We use features extracted from the $backslash$overfeat network as a generic image representation to tackle the diverse range of recognition tasks of object image classification, scene recognition, fine grained recognition, attribute detection and image retrieval applied to a diverse set of datasets. We selected these tasks and datasets as they gradually move further away from the original task and data the $backslash$overfeat network was trained to solve. Astonishingly, we report consistent superior results compared to the highly tuned state-of-the-art systems in all the visual classification tasks on various datasets. For instance retrieval it consistently outperforms low memory footprint methods except for sculptures dataset. The results are achieved using a linear SVM classifier (or $L2$ distance in case of retrieval) applied to a feature representation of size 4096 extracted from a layer in the net. The representations are further modified using simple augmentation techniques e.g. jittering. The results strongly suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.
Conference Paper
The ability to automatically learn task specific feature representations has led to a huge success of deep learning methods. When large training data is scarce, such as in medical imaging problems, transfer learning has been very effective. In this paper, we systematically investigate the process of transferring a Convolutional Neural Network, trained on ImageNet images to perform image classification, to kidney detection problem in ultrasound images. We study how the detection performance depends on the extent of transfer. We show that a transferred and tuned CNN can outperform a state-of-the-art feature engineered pipeline and a hybridization of these two techniques achieves 20 % higher performance. We also investigate how the evolution of intermediate response images from our network. Finally, we compare these responses to state-of-the-art image processing filters in order to gain greater insight into how transfer learning is able to effectively manage widely varying imaging regimes.
Article
Point of care diagnostics using microscopy and computer vision methods have been applied to a number of practical problems, and are particularly relevant to low-income, high disease burden areas. However, this is subject to the limitations in sensitivity and specificity of the computer vision methods used. In general, deep learning has recently revolutionised the field of computer vision, in some cases surpassing human performance for other object recognition tasks. In this paper, we evaluate the performance of deep convolutional neural networks on three different microscopy tasks: diagnosis of malaria in thick blood smears, tuberculosis in sputum samples, and intestinal parasite eggs in stool samples. In all cases accuracy is very high and substantially better than an alternative approach more representative of traditional medical imaging techniques.