Rate-Constrained Remote Contextual Bandits
Francesco Pase, Deniz G\"und\"uz, Michele Zorzi

TL;DR
This paper investigates a rate-constrained multi-agent contextual bandit problem, analyzing the fundamental limits of policy compression and proposing practical coding schemes to optimize regret under communication constraints.
Contribution
It introduces a theoretical framework for rate-limited policy transmission in CMABs, characterizes the information-theoretic limits, and proposes practical compression schemes.
Findings
Identifies two rate regions with linear and sub-linear regret.
Derives the fundamental limits of lossy compression for the policy.
Proposes a practical coding scheme with numerical validation.
Abstract
We consider a rate-constrained contextual multi-armed bandit (RC-CMAB) problem, in which a group of agents are solving the same contextual multi-armed bandit (CMAB) problem. However, the contexts are observed by a remotely connected entity, i.e., the decision-maker, that updates the policy to maximize the returned rewards, and communicates the arms to be sampled by the agents to a controller over a rate-limited communications channel. This framework can be applied to personalized ad placement, whenever the content owner observes the website visitors, and hence has the context, but needs to transmit the ads to be shown to a controller that is in charge of placing the marketing content. Consequently, the rate-constrained CMAB (RC-CMAB) problem requires the study of lossy compression schemes for the policy to be employed whenever the constraint on the channel rate does not allow the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Cognitive Radio Networks and Spectrum Sensing · Energy Harvesting in Wireless Networks
