Online Learning for Position-Aided Millimeter Wave Beam Training
Vutha Va, Takayuki Shimizu, Gaurav Bansal, Robert W. Heath Jr

TL;DR
This paper introduces online learning algorithms based on multi-armed bandit frameworks to efficiently perform beam training in millimeter wave communications, significantly reducing overhead and improving beam alignment accuracy.
Contribution
It develops novel online learning algorithms for beam pair selection and refinement using a multi-armed bandit approach with risk-aware features and optimistic optimization.
Findings
Achieves 1dB gain over exhaustive search within 100 steps
Uses only 30 beam pairs for training, reducing overhead
Effective for 16x16 antenna arrays in millimeter wave systems
Abstract
Accurate beam alignment is essential for beam-based millimeter wave communications. Conventional beam sweeping solutions often have large overhead, which is unacceptable for mobile applications like vehicle-to-everything. Learning-based solutions that leverage sensor data like position to identify good beam directions are one approach to reduce the overhead. Most existing solutions, though, are supervised-learning where the training data is collected beforehand. In this paper, we use a multi-armed bandit framework to develop online learning algorithms for beam pair selection and refinement. The beam pair selection algorithm learns coarse beam directions in some predefined beam codebook, e.g., in discrete angles separated by the 3dB beamwidths. The beam refinement fine-tunes the identified directions to match the peak of the power angular spectrum at that position. The beam pair…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMillimeter-Wave Propagation and Modeling · Microwave Engineering and Waveguides · Advanced MIMO Systems Optimization
