NetWorld: Communication-Based Diffusion World Model for Multi-Agent Reinforcement Learning in Wireless Networks
Kechen Meng, Rongpeng Li, Yansha Deng, Zhifeng Zhao, and Honggang Zhang

TL;DR
NetWorld introduces a diffusion-based world model for multi-agent reinforcement learning in wireless networks, enabling few-shot generalization, reducing online interactions, and improving scalability and efficiency in resource allocation tasks.
Contribution
The paper proposes a novel diffusion world model framework with a two-stage training process and a lightweight communication mechanism for scalable multi-agent learning in wireless networks.
Findings
Outperforms MARL baselines in three wireless network tasks.
Achieves higher sample efficiency and generalization.
Demonstrates scalability to large distributed networks.
Abstract
As wireless communication networks grow in scale and complexity, diverse resource allocation tasks become increasingly critical. Multi-Agent Reinforcement Learning (MARL) provides a promising solution for distributed control, yet it often requires costly real-world interactions and lacks generalization across diverse tasks. Meanwhile, recent advances in Diffusion Models (DMs) have demonstrated strong capabilities in modeling complex dynamics and supporting high-fidelity simulation. Motivated by these challenges and opportunities, we propose a Communication-based Diffusion World Model (NetWorld) to enable few-shot generalization across heterogeneous MARL tasks in wireless networks. To improve applicability to large-scale distributed networks, NetWorld adopts the Distributed Training with Decentralized Execution (DTDE) paradigm and is organized into a two-stage framework: (i) pre-training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Software-Defined Networks and 5G · Advanced MIMO Systems Optimization
