Xiangbin Zhu’s scientific contributions

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (1)


Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-Agent Learning
  • Conference Paper

November 2013

·

24 Reads

·

3 Citations

Xiangbin Zhu

·

·

Coordinating multi-agent reinforcement learning provides a promising approach to scaling learning in large cooperative multi-agent systems. It allows agents to learn local decision policies based on their local observations and rewards, and, meanwhile, coordinates agents' learning processes to ensure the global learning performance. One key question is that how coordination mechanisms impact learning algorithms so that agents' learning processes are guided and coordinated. This paper presents a new shaping approach that effectively integrates coordination mechanisms into local learning processes. This shaping approach uses two-level agent organization structures and combines reward shaping and action shaping. The higher-level agents dynamically and periodically produce the shaping heuristic knowledge based on the learning status of the lower-level agents. The lower-level agents then uses this knowledge to coordinate their local learning processes with other agents. Experimental results show our approach effectively speeds up the convergence of multi-agent learning in large systems.

Citations (1)


... It is extra information which is incorporated by the designer of the system and estimated on the basis of knowledge of the problem [24]. The studies related reward shaping can be categorized into single agent and multi-agent based on RL [25][26][27][28][29]. ...

Reference:

Cooperative Merging Control Based on Reinforcement Learning With Dynamic Waypoint
Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-Agent Learning
  • Citing Conference Paper
  • November 2013