From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination
Chang Yao, Youfang Lin, Shoucheng Song, Hao Wu, Yuqing Ma, Shang Han, Kai Lv

TL;DR
This paper introduces RPG, a novel method that leverages relation patterns and task-specific decision-makers to improve continual multi-agent reinforcement learning, effectively preventing forgetting and enabling zero-shot generalization.
Contribution
The paper proposes RPG, a new approach that uses relation patterns and hypernetworks to enhance task adaptation and mitigate catastrophic forgetting in Co-MARL.
Findings
RPG outperforms existing methods on SMAC and LBF benchmarks.
RPG effectively prevents catastrophic forgetting in continual learning.
RPG achieves zero-shot generalization to unseen tasks.
Abstract
Continual Multi-Agent Reinforcement Learning (Co-MARL) requires agents to address catastrophic forgetting issues while learning new coordination policies with the dynamics team. In this paper, we delve into the core of Co-MARL, namely Relation Patterns, which refer to agents' general understanding of interactions. In addition to generality, relation patterns exhibit task-specificity when mapped to different action spaces. To this end, we propose a novel method called General Relation Patterns-Guided Task-Specific Decision-Maker (RPG). In RPG, agents extract relation patterns from dynamic observation spaces using a relation capturer. These task-agnostic relation patterns are then mapped to different action spaces via a task-specific decision-maker generated by a conditional hypernetwork. To combat forgetting, we further introduce regularization items on both the relation capturer and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
