A preview of this full-text is provided by Springer Nature.
Content available from Multimedia Tools and Applications
This content is subject to copyright. Terms and conditions apply.
Vol.:(0123456789)
Multimedia Tools and Applications
https://doi.org/10.1007/s11042-024-19638-2
1 3
A lightweight deep learning model withknowledge
distillation forpulmonary diseases detection inchest X‑rays
MohammedA.Asham1· AsmaA.Al‑Shargabi2· RaeedAl‑Sabri1· IbrahimMeftah3
Received: 18 January 2024 / Revised: 31 May 2024 / Accepted: 7 June 2024
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024
Abstract
Accurate and timely diagnosis of pulmonary diseases is critical in the field of medical
imaging. While deep learning models have shown promise in this regard, the current meth-
ods for developing such models often require extensive computing resources and complex
procedures, rendering them impractical. This study focuses on the development of a light-
weight deep-learning model for the detection of pulmonary diseases. Leveraging the bene-
fits of knowledge distillation (KD) and the integration of the ConvMixer block, we propose
a novel lightweight student model based on the MobileNet architecture. The methodology
begins with training multiple teacher model candidates to identify the most suitable teacher
model. Subsequently, KD is employed, utilizing the insights of this robust teacher model
to enhance the performance of the student model. The objective is to reduce the student
model’s parameter size and computational complexity while preserving its diagnostic
accuracy. We perform an in-depth analysis of our proposed model’s performance compared
to various well-established pre-trained student models, including MobileNetV2, ResNet50,
InceptionV3, Xception, and NasNetMobile. Through extensive experimentation and evalu-
ation across diverse datasets, including chest X-rays of different pulmonary diseases such
as pneumonia, COVID-19, tuberculosis, and pneumothorax, we demonstrate the robust-
ness and effectiveness of our proposed model in diagnosing various chest infections. Our
model showcases superior performance, achieving an impressive classification accuracy of
97.92%. We emphasize the significant reduction in model complexity, with 0.63 million
parameters, allowing for efficient inference and rapid prediction times, rendering it ideal
for resource-constrained environments. Outperforming various pre-trained student models
in terms of overall performance and computation cost, our findings underscore the effec-
tiveness of the proposed KD strategy and the integration of the ConvMixer block. This
highlights the importance of incorporating advanced techniques and innovative architec-
tural elements in the development of highly effective models for medical image analysis.
Keywords Pneumonia Detection· Knowledge Distillation· Transfer learning· ConvMixer
Block· Chest Infection Classification· Medical Imaging
Extended author information available on the last page of the article
Content courtesy of Springer Nature, terms of use apply. Rights reserved.