AdaLinUCB: Opportunistic Learning for Contextual Bandits

Xueying Guo; Xiaoxiao Wang; Xin Liu

arXiv:1902.07802·cs.LG·May 29, 2019·1 cites

AdaLinUCB: Opportunistic Learning for Contextual Bandits

Xueying Guo, Xiaoxiao Wang, Xin Liu

PDF

Open Access

TL;DR

This paper introduces AdaLinUCB, an adaptive algorithm for opportunistic contextual bandits with variable exploration costs, achieving lower regret and outperforming existing methods in fluctuating environments.

Contribution

The paper proposes AdaLinUCB, a novel adaptive algorithm for opportunistic contextual bandits that adjusts exploration based on environmental conditions, with proven regret bounds and empirical superiority.

Findings

01

AdaLinUCB achieves O((log T)^2) regret bound.

02

It outperforms traditional LinUCB in environments with large exploration cost fluctuations.

03

Empirical results show significant performance improvements over existing algorithms.

Abstract

In this paper, we propose and study opportunistic contextual bandits - a special case of contextual bandits where the exploration cost varies under different environmental conditions, such as network load or return variation in recommendations. When the exploration cost is low, so is the actual regret of pulling a sub-optimal arm (e.g., trying a suboptimal recommendation). Therefore, intuitively, we could explore more when the exploration cost is relatively low and exploit more when the exploration cost is relatively high. Inspired by this intuition, for opportunistic contextual bandits with Linear payoffs, we propose an Adaptive Upper-Confidence-Bound algorithm (AdaLinUCB) to adaptively balance the exploration-exploitation trade-off for opportunistic learning. We prove that AdaLinUCB achieves O((log T)^2) problem-dependent regret upper bound, which has a smaller coefficient than that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Machine Learning and Algorithms