Multi-User mmWave Beam and Rate Adaptation via Combinatorial Satisficing Bandits
Emre \"Ozy{\i}ld{\i}r{\i}m, Bar{\i}\c{s} Yayc{\i}, Umut Eren Akturk, Cem Tekin

TL;DR
This paper introduces SAT-CTS, a novel learning policy for multi-user mmWave systems that efficiently adapts beams and rates to meet quality-of-service targets, with proven regret bounds and practical benefits.
Contribution
It presents the first finite-time regret bounds for combinatorial semi-bandits with satisficing objectives and proposes a lightweight, threshold-aware policy for beam and rate adaptation.
Findings
SAT-CTS reduces satisficing regret effectively.
It maintains competitive standard regret and fairness.
Experiments show improved throughput and fairness across users.
Abstract
We study downlink beam and rate adaptation in a multi-user mmWave MISO system where multiple base stations (BSs), each using analog beamforming from finite codebooks, serve multiple single-antenna user equipments (UEs) with a unique beam per UE and discrete data transmission rates. BSs learn about transmission success based on ACK/NACK feedback. To encode service goals, we introduce a satisficing throughput threshold and cast joint beam and rate adaptation as a combinatorial semi-bandit over beam-rate tuples. Within this framework, we propose SAT-CTS, a lightweight, threshold-aware policy that blends conservative confidence estimates with posterior sampling, steering learning toward meeting rather than merely maximizing. Our main theoretical contribution provides the first finite-time regret bounds for combinatorial semi-bandits with satisficing objective: when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
