Generative Learning Enhanced Intelligent Resource Management for Cell-Free Delay Deterministic Communications
Shuangbo Xiong, Cheng Zhang, Wen Wang, Wenwu Yu, and Yongming Huang

TL;DR
This paper introduces a novel offline pretraining framework using virtual CMDP modeling and evidence-aware Gaussian Mixture Models to enhance resource management in CF-MIMO systems, improving energy efficiency and delay compliance.
Contribution
It proposes a new pretraining approach for safe deep reinforcement learning in CF-MIMO, addressing data sparsity and distribution drift, with significant efficiency and performance gains.
Findings
Pretrained agent achieves twice the initial energy efficiency.
Maintains a 1% delay violation rate.
Converges to 4.7% higher energy efficiency with fewer exploration steps.
Abstract
Cell-free multiple-input multiple-output (CF-MIMO) architecture significantly enhances wireless network performance, offering a promising solution for delay-sensitive applications. This paper investigates the resource allocation problem in CF-MIMO systems, aiming to maximize energy efficiency (EE) while satisfying delay violation rate constraint. We design a Proximal Policy Optimization (PPO) with a primal-dual method to solve it. To address the low sample efficiency and safety risks caused by cold-start of the designed safe deep reinforcement learning (DRL) method, we propose a novel offline pretraining framework based on virtual constrained Markov decision process (CMDP) modeling. The virtual CMDP consists of reward and cost prediction module, initial-state distribution module and state transition module. Notably, we propose an evidence-aware conditional Gaussian Mixture Model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
