From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination

Chang Yao; Youfang Lin; Shoucheng Song; Hao Wu; Yuqing Ma; Shang Han; Kai Lv

arXiv:2507.06004·cs.MA·July 9, 2025

From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination

Chang Yao, Youfang Lin, Shoucheng Song, Hao Wu, Yuqing Ma, Shang Han, Kai Lv

PDF

Open Access

TL;DR

This paper introduces RPG, a novel method that leverages relation patterns and task-specific decision-makers to improve continual multi-agent reinforcement learning, effectively preventing forgetting and enabling zero-shot generalization.

Contribution

The paper proposes RPG, a new approach that uses relation patterns and hypernetworks to enhance task adaptation and mitigate catastrophic forgetting in Co-MARL.

Findings

01

RPG outperforms existing methods on SMAC and LBF benchmarks.

02

RPG effectively prevents catastrophic forgetting in continual learning.

03

RPG achieves zero-shot generalization to unseen tasks.

Abstract

Continual Multi-Agent Reinforcement Learning (Co-MARL) requires agents to address catastrophic forgetting issues while learning new coordination policies with the dynamics team. In this paper, we delve into the core of Co-MARL, namely Relation Patterns, which refer to agents' general understanding of interactions. In addition to generality, relation patterns exhibit task-specificity when mapped to different action spaces. To this end, we propose a novel method called General Relation Patterns-Guided Task-Specific Decision-Maker (RPG). In RPG, agents extract relation patterns from dynamic observation spaces using a relation capturer. These task-agnostic relation patterns are then mapped to different action spaces via a task-specific decision-maker generated by a conditional hypernetwork. To combat forgetting, we further introduce regularization items on both the relation capturer and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)