Reinforcement Learning-based Joint User Scheduling and Link Configuration in Millimeter-wave Networks
Yi Zhang, Robert W. Heath Jr

TL;DR
This paper introduces reinforcement learning algorithms for joint user scheduling and link configuration in mmWave networks, aiming to minimize system delay through dynamic, online decision-making.
Contribution
It develops two reinforcement learning solutions, DRL and MAB-based, for complex joint scheduling and configuration in mmWave networks, demonstrating their effectiveness.
Findings
DRL achieves better delay performance.
MAB-based method trains faster.
Both methods effectively reduce system delay.
Abstract
In this paper, we develop algorithms for joint user scheduling and three types of mmWave link configuration: relay selection, codebook optimization, and beam tracking in millimeter wave (mmWave) networks. Our goal is to design an online controller that dynamically schedules users and configures their links to minimize the system delay. To solve this complex scheduling problem, we model it as a dynamic decision-making process and develop two reinforcement learning-based solutions. The first solution is based on deep reinforcement learning (DRL), which leverages the proximal policy optimization to train a neural network-based solution. Due to the potential high sample complexity of DRL, we also propose an empirical multi-armed bandit (MAB)-based solution, which decomposes the decision-making process into a sequential of sub-actions and exploits classic maxweight scheduling and Thompson…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MIMO Systems Optimization · Millimeter-Wave Propagation and Modeling · Cooperative Communication and Network Coding
