AdaLinUCB: Opportunistic Learning for Contextual Bandits
Xueying Guo, Xiaoxiao Wang, Xin Liu

TL;DR
This paper introduces AdaLinUCB, an adaptive algorithm for opportunistic contextual bandits with variable exploration costs, achieving lower regret and outperforming existing methods in fluctuating environments.
Contribution
The paper proposes AdaLinUCB, a novel adaptive algorithm for opportunistic contextual bandits that adjusts exploration based on environmental conditions, with proven regret bounds and empirical superiority.
Findings
AdaLinUCB achieves O((log T)^2) regret bound.
It outperforms traditional LinUCB in environments with large exploration cost fluctuations.
Empirical results show significant performance improvements over existing algorithms.
Abstract
In this paper, we propose and study opportunistic contextual bandits - a special case of contextual bandits where the exploration cost varies under different environmental conditions, such as network load or return variation in recommendations. When the exploration cost is low, so is the actual regret of pulling a sub-optimal arm (e.g., trying a suboptimal recommendation). Therefore, intuitively, we could explore more when the exploration cost is relatively low and exploit more when the exploration cost is relatively high. Inspired by this intuition, for opportunistic contextual bandits with Linear payoffs, we propose an Adaptive Upper-Confidence-Bound algorithm (AdaLinUCB) to adaptively balance the exploration-exploitation trade-off for opportunistic learning. We prove that AdaLinUCB achieves O((log T)^2) problem-dependent regret upper bound, which has a smaller coefficient than that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Machine Learning and Algorithms
