Diminishing Exploration: A Minimalist Approach to Piecewise Stationary   Multi-Armed Bandits

Kuan-Ta Li; Ping-Chun Hsieh; Yu-Chih Huang

arXiv:2410.05734·cs.LG·October 10, 2024

Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits

Kuan-Ta Li, Ping-Chun Hsieh, Yu-Chih Huang

PDF

Open Access

TL;DR

This paper introduces a minimalist exploration method for piecewise-stationary multi-armed bandits that does not require prior knowledge of change points and improves empirical regret performance.

Contribution

It proposes diminishing exploration, a new generic mechanism that enhances existing change detection algorithms without needing to know the number of change points.

Findings

01

Achieves near-optimal regret scaling.

02

Outperforms traditional uniform exploration in simulations.

03

Does not require knowledge of change points M.

Abstract

The piecewise-stationary bandit problem is an important variant of the multi-armed bandit problem that further considers abrupt changes in the reward distributions. The main theme of the problem is the trade-off between exploration for detecting environment changes and exploitation of traditional bandit algorithms. While this problem has been extensively investigated, existing works either assume knowledge about the number of change points $M$ or require extremely high computational complexity. In this work, we revisit the piecewise-stationary bandit problem from a minimalist perspective. We propose a novel and generic exploration mechanism, called diminishing exploration, which eliminates the need for knowledge about $M$ and can be used in conjunction with an existing change detection-based algorithm to achieve near-optimal regret scaling. Simulation results show that despite oblivious…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management