Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning

Sijia Li; Xinran Li; Shibo Chen; Jun Zhang

arXiv:2601.07463·cs.AI·March 1, 2026

Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning

Sijia Li, Xinran Li, Shibo Chen, Jun Zhang

PDF

Open Access

TL;DR

This paper introduces LOGO, a local-to-global world model for offline multi-agent reinforcement learning that improves data augmentation and policy generalization by leveraging local predictions and uncertainty-aware sampling.

Contribution

The paper proposes a novel local-to-global world model framework for offline MARL, enhancing prediction accuracy and data efficiency while reducing computational costs.

Findings

01

Outperforms state-of-the-art offline MARL methods on 8 benchmarks.

02

Effectively captures agent dependencies through local-to-global modeling.

03

Reduces computational overhead compared to ensemble-based approaches.

Abstract

Offline multi-agent reinforcement learning (MARL) aims to solve cooperative decision-making problems in multi-agent systems using pre-collected datasets. Existing offline MARL methods primarily constrain training within the dataset distribution, resulting in overly conservative policies that struggle to generalize beyond the support of the data. While model-based approaches offer a promising solution by expanding the original dataset with synthetic data generated from a learned world model, the high dimensionality, non-stationarity, and complexity of multi-agent systems make it challenging to accurately estimate the transitions and reward functions in offline MARL. Given the difficulty of directly modeling joint dynamics, we propose a local-to-global (LOGO) world model, a novel framework that leverages local predictions-which are easier to estimate-to infer global state dynamics, thus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning