RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents
Zifeng Shi, Meiqin Liu, Senlin Zhang, Ronghao Zheng, Shanling Dong

TL;DR
RMIO is a model-based multi-agent reinforcement learning framework that effectively reconstructs missing observations and enhances decision-making stability in scenarios with observation loss, outperforming existing methods.
Contribution
RMIO introduces a novel approach combining world model-based reconstruction, inter-agent information sharing, and limited communication to handle observation loss in MARL.
Findings
Outperforms state-of-the-art in SMAC and MaMuJoCo environments.
Achieves better asymptotic convergence and policy robustness.
Effectively handles observation loss scenarios.
Abstract
In recent years, model-based reinforcement learning (MBRL) has emerged as a solution to address sample complexity in multi-agent reinforcement learning (MARL) by modeling agent-environment dynamics to improve sample efficiency. However, most MBRL methods assume complete and continuous observations from each agent during the inference stage, which can be overly idealistic in practical applications. A novel model-based MARL approach called RMIO is introduced to address this limitation, specifically designed for scenarios where observation is lost in some agent. RMIO leverages the world model to reconstruct missing observations, and further reduces reconstruction errors through inter-agent information integration to ensure stable multi-agent decision-making. Secondly, unlike CTCE methods such as MAMBA, RMIO adopts the CTDE paradigm in standard environment, and enabling limited…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Semantic Web and Ontologies
