GAWM: Global-Aware World Model for Multi-Agent Reinforcement Learning

Zifeng Shi; Meiqin Liu; Senlin Zhang; Ronghao Zheng; Shanling Dong,; Ping Wei

arXiv:2501.10116·cs.MA·January 20, 2025

GAWM: Global-Aware World Model for Multi-Agent Reinforcement Learning

Zifeng Shi, Meiqin Liu, Senlin Zhang, Ronghao Zheng, Shanling Dong,, Ping Wei

PDF

Open Access

TL;DR

GAWM introduces a global-aware world model using Transformer architecture to improve sample efficiency and stability in multi-agent reinforcement learning, enabling better performance in complex environments.

Contribution

The paper proposes GAWM, a novel model-based MARL method that enhances global state representation with a Transformer, improving convergence and stability in complex multi-agent settings.

Findings

01

GAWM outperforms existing methods in SMAC benchmarks.

02

Enhanced global state representation improves training stability.

03

Method achieves superior convergence in complex environments.

Abstract

In recent years, Model-based Multi-Agent Reinforcement Learning (MARL) has demonstrated significant advantages over model-free methods in terms of sample efficiency by using independent environment dynamics world models for data sample augmentation. However, without considering the limited sample size, these methods still lag behind model-free methods in terms of final convergence performance and stability. This is primarily due to the world model's insufficient and unstable representation of global states in partially observable environments. This limitation hampers the ability to ensure global consistency in the data samples and results in a time-varying and unstable distribution mismatch between the pseudo data samples generated by the world model and the real samples. This issue becomes particularly pronounced in more complex multi-agent environments. To address this challenge, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics