NetWorld: Communication-Based Diffusion World Model for Multi-Agent Reinforcement Learning in Wireless Networks

Kechen Meng; Rongpeng Li; Yansha Deng; Zhifeng Zhao; and Honggang Zhang

arXiv:2602.00558·cs.NI·February 3, 2026

NetWorld: Communication-Based Diffusion World Model for Multi-Agent Reinforcement Learning in Wireless Networks

Kechen Meng, Rongpeng Li, Yansha Deng, Zhifeng Zhao, and Honggang Zhang

PDF

Open Access

TL;DR

NetWorld introduces a diffusion-based world model for multi-agent reinforcement learning in wireless networks, enabling few-shot generalization, reducing online interactions, and improving scalability and efficiency in resource allocation tasks.

Contribution

The paper proposes a novel diffusion world model framework with a two-stage training process and a lightweight communication mechanism for scalable multi-agent learning in wireless networks.

Findings

01

Outperforms MARL baselines in three wireless network tasks.

02

Achieves higher sample efficiency and generalization.

03

Demonstrates scalability to large distributed networks.

Abstract

As wireless communication networks grow in scale and complexity, diverse resource allocation tasks become increasingly critical. Multi-Agent Reinforcement Learning (MARL) provides a promising solution for distributed control, yet it often requires costly real-world interactions and lacks generalization across diverse tasks. Meanwhile, recent advances in Diffusion Models (DMs) have demonstrated strong capabilities in modeling complex dynamics and supporting high-fidelity simulation. Motivated by these challenges and opportunities, we propose a Communication-based Diffusion World Model (NetWorld) to enable few-shot generalization across heterogeneous MARL tasks in wireless networks. To improve applicability to large-scale distributed networks, NetWorld adopts the Distributed Training with Decentralized Execution (DTDE) paradigm and is organized into a two-stage framework: (i) pre-training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Software-Defined Networks and 5G · Advanced MIMO Systems Optimization