Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning
Shariq Iqbal, Fei Sha

TL;DR
This paper introduces a hierarchical framework for multi-agent reinforcement learning that uses coordinated intrinsic rewards and dynamic exploration strategies to improve performance in sparse reward environments.
Contribution
It proposes a novel hierarchical approach that enables multi-agent coordination through intrinsic rewards and adaptive exploration mode selection.
Findings
Outperforms state-of-the-art methods in cooperative sparse reward tasks.
Effectively adapts exploration strategies to different multi-stage tasks.
Enhances coordination among agents through learned intrinsic reward mechanisms.
Abstract
Solving tasks with sparse rewards is one of the most important challenges in reinforcement learning. In the single-agent setting, this challenge is addressed by introducing intrinsic rewards that motivate agents to explore unseen regions of their state spaces; however, applying these techniques naively to the multi-agent setting results in agents exploring independently, without any coordination among themselves. Exploration in cooperative multi-agent settings can be accelerated and improved if agents coordinate their exploration. In this paper we introduce a framework for designing intrinsic rewards which consider what other agents have explored such that the agents can coordinate. Then, we develop an approach for learning how to dynamically select between several exploration modalities to maximize extrinsic rewards. Concretely, we formulate the approach as a hierarchical policy where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Robot Manipulation and Learning
