Revisiting Weighted Strategy for Non-stationary Parametric Bandits

Jing Wang; Peng Zhao; Zhi-Hua Zhou

arXiv:2303.02691·cs.LG·June 8, 2023·1 cites

Revisiting Weighted Strategy for Non-stationary Parametric Bandits

Jing Wang, Peng Zhao, Zhi-Hua Zhou

PDF

Open Access

TL;DR

This paper refines the analysis of weighted strategies for non-stationary parametric bandits, leading to simpler algorithms with improved regret bounds across various models like linear, generalized linear, and self-concordant bandits.

Contribution

It introduces a refined analysis framework that simplifies weighted algorithms and enhances regret bounds for multiple non-stationary parametric bandit models.

Findings

01

Simplified weighted bandit algorithms with competitive efficiency.

02

Improved regret bounds for generalized linear and self-concordant bandits.

03

Demonstrated effectiveness through theoretical analysis and bounds.

Abstract

Non-stationary parametric bandits have attracted much attention recently. There are three principled ways to deal with non-stationarity, including sliding-window, weighted, and restart strategies. As many non-stationary environments exhibit gradual drifting patterns, the weighted strategy is commonly adopted in real-world applications. However, previous theoretical studies show that its analysis is more involved and the algorithms are either computationally less efficient or statistically suboptimal. This paper revisits the weighted strategy for non-stationary parametric bandits. In linear bandits (LB), we discover that this undesirable feature is due to an inadequate regret analysis, which results in an overly complex algorithm design. We propose a refined analysis framework, which simplifies the derivation and importantly produces a simpler weight-based algorithm that is as efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Decision-Making and Behavioral Economics