A preview of this full-text is provided by Springer Nature.
Content available from The Journal of Supercomputing
This content is subject to copyright. Terms and conditions apply.
Vol.:(0123456789)
The Journal of Supercomputing (2024) 80:17895–17922
https://doi.org/10.1007/s11227-024-06096-8
1 3
Qsmix: Q‑learning‑based task scheduling approach
formixed‑critical applications onheterogeneous
multi‑cores
FatemehAfshari1· AthenaAbdi1
Accepted: 20 March 2024 / Published online: 6 May 2024
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature
2024
Abstract
In this paper, a Q-learning-based task scheduling approach for mixed-critical appli-
cation on heterogeneous multi-cores (QSMix) to optimize their main design chal-
lenges is proposed. This approach employs reinforcement learning capabilities to
optimize execution time, power consumption, reliability and temperature of the het-
erogeneous multi-cores during task scheduling process. In QSMix, a reward function
is defined to consider all target design parameters simultaneously and is tuned based
on applying punishment for unwanted conditions during the learning. The learning
process of QSMix is led by utilizing the defined reward function during constructing
the Q-table for various execution scenarios. Afterward, the best solution is selected
from the constructed Q-table based on the system’s policy to achieve a near-opti-
mal solution that meets the existing trade-offs among objectives while considering
its constraints properly. To evaluate our proposed QSMix, several experiments are
performed to show its effectiveness in finding appropriate solutions and its gradual
behavior during learning process. Moreover, the performance of QSMix in terms
of optimizing the target design parameters is compared to various related research.
The results confirm that QSMix has average improvement about 9% over related
studies in joint optimization of execution time, power consumption, reliability and
temperature.
Keywords Task scheduling· Mixed-critical applications· Heterogeneous multi-
cores· Q-learning· Multi-objective optimization
* Athena Abdi
a_abdi@kntu.ac.ir
1 Faculty ofComputer Engineering, K. N. Toosi University ofTechnology, Tehran, Iran
Content courtesy of Springer Nature, terms of use apply. Rights reserved.