Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning
Sijia Li, Xinran Li, Shibo Chen, Jun Zhang

TL;DR
This paper introduces LOGO, a local-to-global world model for offline multi-agent reinforcement learning that improves data augmentation and policy generalization by leveraging local predictions and uncertainty-aware sampling.
Contribution
The paper proposes a novel local-to-global world model framework for offline MARL, enhancing prediction accuracy and data efficiency while reducing computational costs.
Findings
Outperforms state-of-the-art offline MARL methods on 8 benchmarks.
Effectively captures agent dependencies through local-to-global modeling.
Reduces computational overhead compared to ensemble-based approaches.
Abstract
Offline multi-agent reinforcement learning (MARL) aims to solve cooperative decision-making problems in multi-agent systems using pre-collected datasets. Existing offline MARL methods primarily constrain training within the dataset distribution, resulting in overly conservative policies that struggle to generalize beyond the support of the data. While model-based approaches offer a promising solution by expanding the original dataset with synthetic data generated from a learned world model, the high dimensionality, non-stationarity, and complexity of multi-agent systems make it challenging to accurately estimate the transitions and reward functions in offline MARL. Given the difficulty of directly modeling joint dynamics, we propose a local-to-global (LOGO) world model, a novel framework that leverages local predictions-which are easier to estimate-to infer global state dynamics, thus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
