An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng,, Hangyu Mao, Dong Li, Wulong Liu, Chengwei Zhang, Yujing Hu, Yingfeng Chen and, Changjie Fan

TL;DR
This paper introduces a novel multiagent transfer learning framework that improves MARL efficiency by modeling policy transfer as option learning and addressing partial observability challenges.
Contribution
It proposes MAPTF, a framework that learns when and which agent's policy to transfer, and introduces successor representation option learning to enhance accuracy under partial observability.
Findings
Significantly boosts MARL performance in discrete and continuous spaces.
Effectively models policy transfer as option learning.
Addresses partial observability with successor representation.
Abstract
Transfer Learning has shown great potential to enhance single-agent Reinforcement Learning (RL) efficiency. Similarly, Multiagent RL (MARL) can also be accelerated if agents can share knowledge with each other. However, it remains a problem of how an agent should learn from other agents. In this paper, we propose a novel Multiagent Policy Transfer Framework (MAPTF) to improve MARL efficiency. MAPTF learns which agent's policy is the best to reuse for each agent and when to terminate it by modeling multiagent policy transfer as the option learning problem. Furthermore, in practice, the option module can only collect all agent's local experiences for update due to the partial observability of the environment. While in this setting, each agent's experience may be inconsistent with each other, which may cause the inaccuracy and oscillation of the option-value's estimation. Therefore, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Smart Grid Energy Management
