Reinforcement Learning-based Joint User Scheduling and Link   Configuration in Millimeter-wave Networks

Yi Zhang; Robert W. Heath Jr

arXiv:2207.03526·cs.NI·October 25, 2022

Reinforcement Learning-based Joint User Scheduling and Link Configuration in Millimeter-wave Networks

Yi Zhang, Robert W. Heath Jr

PDF

Open Access

TL;DR

This paper introduces reinforcement learning algorithms for joint user scheduling and link configuration in mmWave networks, aiming to minimize system delay through dynamic, online decision-making.

Contribution

It develops two reinforcement learning solutions, DRL and MAB-based, for complex joint scheduling and configuration in mmWave networks, demonstrating their effectiveness.

Findings

01

DRL achieves better delay performance.

02

MAB-based method trains faster.

03

Both methods effectively reduce system delay.

Abstract

In this paper, we develop algorithms for joint user scheduling and three types of mmWave link configuration: relay selection, codebook optimization, and beam tracking in millimeter wave (mmWave) networks. Our goal is to design an online controller that dynamically schedules users and configures their links to minimize the system delay. To solve this complex scheduling problem, we model it as a dynamic decision-making process and develop two reinforcement learning-based solutions. The first solution is based on deep reinforcement learning (DRL), which leverages the proximal policy optimization to train a neural network-based solution. Due to the potential high sample complexity of DRL, we also propose an empirical multi-armed bandit (MAB)-based solution, which decomposes the decision-making process into a sequential of sub-actions and exploits classic maxweight scheduling and Thompson…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced MIMO Systems Optimization · Millimeter-Wave Propagation and Modeling · Cooperative Communication and Network Coding