Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information

Jan Corazza; Hadi Partovi Aria; Hyohun Kim; Daniel Neider; Zhe Xu

arXiv:2506.07829·cs.LG·October 20, 2025

Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information

Jan Corazza, Hadi Partovi Aria, Hyohun Kim, Daniel Neider, Zhe Xu

PDF

TL;DR

This paper explores how symbolic knowledge about temporal events can improve decentralized multi-agent reinforcement learning by ensuring policy compatibility and accelerating learning.

Contribution

It introduces formal tools for policy compatibility in DMARL and demonstrates that temporal symbolic knowledge speeds up learning.

Findings

01

Symbolic temporal knowledge enhances learning speed in DMARL.

02

Formal compatibility checks improve decentralized policy training.

03

Empirical results show faster convergence with symbolic information.

Abstract

Reinforcement learning (RL) algorithms can find an optimal policy for a single agent to accomplish a particular task. However, many real-world problems require multiple agents to collaborate in order to achieve a common goal. For example, a robot executing a task in a warehouse may require the assistance of a drone to retrieve items from high shelves. In Decentralized Multi-Agent RL (DMARL), agents learn independently and then combine their policies at execution time, but often must satisfy constraints on compatibility of local policies to ensure that they can achieve the global task when combined. In this paper, we study how providing high-level symbolic knowledge to agents can help address unique challenges of this setting, such as privacy constraints, communication limitations, and performance concerns. In particular, we extend the formal tools used to check the compatibility of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.