TL;DR
This paper introduces BAMS, a neuro-symbolic belief map system that improves multi-agent cooperative learning by providing consistent feedback, leading to faster training and better performance in complex environments.
Contribution
The paper proposes BAMS, a novel belief-map assisted training method that enhances message passing and learning efficiency in multi-agent systems using reinforcement learning.
Findings
BAMS reduced training epochs by 66%.
Agents with BAMS completed tasks 34.62% faster on average.
BAMS improved cooperation in complex environments.
Abstract
In a multi-agent system, agents share their local observations to gain global situational awareness for decision making and collaboration using a message passing system. When to send a message, how to encode a message, and how to leverage the received messages directly affect the effectiveness of the collaboration among agents. When training a multi-agent cooperative game using reinforcement learning (RL), the message passing system needs to be optimized together with the agent policies. This consequently increases the model's complexity and poses significant challenges to the convergence and performance of learning. To address this issue, we propose the Belief-map Assisted Multi-agent System (BAMS), which leverages a neuro-symbolic belief map to enhance training. The belief map decodes the agent's hidden state to provide a symbolic representation of the agent's understanding of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
