... Another extensively studied method to reduce the dimensionality of the policy space is to use a decentralized reinforcement learning (RL) training approach [63,64,65,66,67,68,69,70,71,72,73,74,75,24,76,77,78,79,80,81,67,82,83,84,85,86,87,88,89,25,29,31,33,37]. In this approach, vehicles are treated as homogeneous and uncooperative agents that independently choose their own optimal actions based on a shared Q-function. ...