Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks
Shufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li,, Shivendra Panwar

TL;DR
This paper introduces a structured reinforcement learning approach for delay-optimized data transmission in dense mmWave networks, leveraging problem structure to develop a computationally efficient algorithm with proven regret bounds.
Contribution
It proposes a novel structured RL algorithm, mmDPT-TS, that exploits the problem's inherent structure to achieve near-optimal delay performance with practical computational efficiency.
Findings
mmDPT-TS outperforms existing methods in realistic network simulations.
The algorithm achieves (\u221a{T}) Bayesian regret, indicating strong theoretical guarantees.
A low-complexity index policy is developed for RMAB-F, guiding the RL approach.
Abstract
We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i.e., users sending data packet requests to access points (APs) via uplinks and APs transmitting requested data packets to users via downlinks. Our objective is to minimize the average delay in the system due to APs' limited service capacity and unreliable wireless channels between APs and users. This problem can be formulated as a restless multi-armed bandits problem with fairness constraint (RMAB-F). Since finding the optimal policy for RMAB-F is intractable, existing learning algorithms are computationally expensive and not suitable for practical dynamic dense mmWave networks. In this paper, we propose a structured reinforcement learning (RL) solution for mmDPT by exploiting the inherent structure encoded in RMAB-F. To achieve this, we first design a low-complexity and provably…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMillimeter-Wave Propagation and Modeling · Advanced MIMO Systems Optimization · Wireless Body Area Networks
Methodstravel james
