Robust Beam Codebooks for mmWave/THz Systems: Toward a Stochastic RL Approach
Anouar Nechi, Rainer Buchty, Mladen Berekovic, Saleh Mulhem

TL;DR
This paper proposes a robust multi-agent reinforcement learning framework for designing adaptive beam codebooks in mmWave/THz massive MIMO systems, improving performance under NLoS conditions and hardware impairments without relying on explicit channel knowledge.
Contribution
It introduces a novel RL-based approach for adaptive codebook design that is robust to practical challenges, outperforming traditional methods in complex propagation environments.
Findings
SAC outperforms DDPG and TD3 in robustness and beamforming gains.
The RL framework effectively handles hardware impairments and feedback noise.
Simulation results confirm improved stability and performance in NLoS scenarios.
Abstract
Millimeter-wave (mmWave) and terahertz (THz) massive MIMO systems often rely on predefined beamforming codebooks, which are usually suboptimal in Non-Line-of-Sight (NLoS) conditions and for hardware-limited transceivers. Reinforcement Learning (RL) enables adaptive, data-driven codebook design without explicit Channel State Information (CSI), but the robustness of such algorithms in practical conditions is underexplored. This paper introduces a robust multi-agent RL framework that learns beam codebooks directly from environmental feedback, eliminating the need for prior channel knowledge. Our method is well-suited for real-world deployments facing unpredictable propagation and hardware constraints. We conduct a comprehensive analysis of three off-policy algorithms, Deep Deterministic Policy Gradient (DDPG), Twin Delayed DDPG (TD3), and Soft Actor-Critic (SAC), evaluating their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMillimeter-Wave Propagation and Modeling · Advanced MIMO Systems Optimization · Advanced Wireless Communication Technologies
