Deep Reinforcement Learning for mmWave Initial Beam Alignment
Daniel Tandler, Sebastian D\"orner, Marc Gauger, Stephan ten Brink

TL;DR
This paper explores the use of deep reinforcement learning for initial beam alignment in mmWave communications, showing that with proper modifications, it can perform competitively with existing methods in realistic scenarios.
Contribution
The study demonstrates that action space shaping significantly enhances deep reinforcement learning performance in mmWave beam alignment tasks.
Findings
Action space shaping improves RL performance
RL can match state-of-the-art methods in simulations
Modifications enable RL to handle realistic problem sizes
Abstract
We investigate the applicability of deep reinforcement learning algorithms to the adaptive initial access beam alignment problem for mmWave communications using the state-of-the-art proximal policy optimization algorithm as an example. In comparison to recent unsupervised learning based approaches developed to tackle this problem, deep reinforcement learning has the potential to address a new and wider range of applications, since, in principle, no (differentiable) model of the channel and/or the whole system is required for training, and only agent-environment interactions are necessary to learn an algorithm (be it online or using a recorded dataset). We show that, although the chosen off-the-shelf deep reinforcement learning agent fails to perform well when trained on realistic problem sizes, introducing action space shaping in the form of beamforming modules vastly improves the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMillimeter-Wave Propagation and Modeling · Energy Harvesting in Wireless Networks · Advanced MIMO Systems Optimization
