Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning
Xinran Li, Zifan Liu, Shibo Chen, Jun Zhang

TL;DR
This paper introduces ICES, a novel method for multi-agent reinforcement learning that uses individual contribution-based intrinsic rewards to improve exploration in sparse reward environments, leveraging global transition information during training.
Contribution
The paper proposes ICES, a new approach that assesses individual agent contributions to guide exploration, separating exploration and exploitation policies for better learning.
Findings
ICES outperforms baselines in cooperative tasks with sparse rewards.
The method effectively guides agents to impactful actions during training.
Experimental results on GRF and SMAC show improved exploration capabilities.
Abstract
In multi-agent reinforcement learning (MARL), effective exploration is critical, especially in sparse reward environments. Although introducing global intrinsic rewards can foster exploration in such settings, it often complicates credit assignment among agents. To address this difficulty, we propose Individual Contributions as intrinsic Exploration Scaffolds (ICES), a novel approach to motivate exploration by assessing each agent's contribution from a global view. In particular, ICES constructs exploration scaffolds with Bayesian surprise, leveraging global transition information during centralized training. These scaffolds, used only in training, help to guide individual agents towards actions that significantly impact the global latent state transitions. Additionally, ICES separates exploration policies from exploitation policies, enabling the former to utilize privileged global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
