Structured Reinforcement Learning for Delay-Optimal Data Transmission in   Dense mmWave Networks

Shufan Wang; Guojun Xiong; Shichen Zhang; Huacheng Zeng; Jian Li,; Shivendra Panwar

arXiv:2404.16920·cs.NI·April 29, 2024

Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks

Shufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li,, Shivendra Panwar

PDF

Open Access

TL;DR

This paper introduces a structured reinforcement learning approach for delay-optimized data transmission in dense mmWave networks, leveraging problem structure to develop a computationally efficient algorithm with proven regret bounds.

Contribution

It proposes a novel structured RL algorithm, mmDPT-TS, that exploits the problem's inherent structure to achieve near-optimal delay performance with practical computational efficiency.

Findings

01

mmDPT-TS outperforms existing methods in realistic network simulations.

02

The algorithm achieves (\u221a{T}) Bayesian regret, indicating strong theoretical guarantees.

03

A low-complexity index policy is developed for RMAB-F, guiding the RL approach.

Abstract

We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i.e., users sending data packet requests to access points (APs) via uplinks and APs transmitting requested data packets to users via downlinks. Our objective is to minimize the average delay in the system due to APs' limited service capacity and unreliable wireless channels between APs and users. This problem can be formulated as a restless multi-armed bandits problem with fairness constraint (RMAB-F). Since finding the optimal policy for RMAB-F is intractable, existing learning algorithms are computationally expensive and not suitable for practical dynamic dense mmWave networks. In this paper, we propose a structured reinforcement learning (RL) solution for mmDPT by exploiting the inherent structure encoded in RMAB-F. To achieve this, we first design a low-complexity and provably…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMillimeter-Wave Propagation and Modeling · Advanced MIMO Systems Optimization · Wireless Body Area Networks

Methodstravel james