Tracking Most Significant Shifts in Nonparametric Contextual Bandits

Joe Suk; Samory Kpotufe

arXiv:2307.05341·stat.ML·November 21, 2023

Tracking Most Significant Shifts in Nonparametric Contextual Bandits

Joe Suk, Samory Kpotufe

PDF

Open Access 1 Video

TL;DR

This paper investigates nonparametric contextual bandits with changing reward functions, establishing minimax regret bounds based on the number of changes and proposing a local, significance-based change measure for better adaptivity.

Contribution

It introduces a new notion of experienced significant shifts that accounts for local and impactful changes, enabling adaptive algorithms without prior knowledge of change parameters.

Findings

01

Established minimax dynamic regret rates in nonparametric contextual bandits.

02

Proposed a locality-aware change measure called experienced significant shifts.

03

Showed that adaptive algorithms can achieve minimax rates using this new change measure.

Abstract

We study nonparametric contextual bandits where Lipschitz mean reward functions may change over time. We first establish the minimax dynamic regret rate in this less understood setting in terms of number of changes $L$ and total-variation $V$ , both capturing all changes in distribution over context space, and argue that state-of-the-art procedures are suboptimal in this setting. Next, we tend to the question of an adaptivity for this setting, i.e. achieving the minimax rate without knowledge of $L$ or $V$ . Quite importantly, we posit that the bandit problem, viewed locally at a given context $X_{t}$ , should not be affected by reward changes in other parts of context space $X$ . We therefore propose a notion of change, which we term experienced significant shifts, that better accounts for locality, and thus counts considerably less changes than $L$ and $V$ . Furthermore, similar to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Tracking Most Significant Shifts in Nonparametric Contextual Bandits· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Decision-Making and Behavioral Economics · Data Stream Mining Techniques