Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Kuan-Ta Li, Ping-Chun Hsieh, Yu-Chih Huang

TL;DR
This paper introduces a minimalist exploration method for piecewise-stationary multi-armed bandits that does not require prior knowledge of change points and improves empirical regret performance.
Contribution
It proposes diminishing exploration, a new generic mechanism that enhances existing change detection algorithms without needing to know the number of change points.
Findings
Achieves near-optimal regret scaling.
Outperforms traditional uniform exploration in simulations.
Does not require knowledge of change points M.
Abstract
The piecewise-stationary bandit problem is an important variant of the multi-armed bandit problem that further considers abrupt changes in the reward distributions. The main theme of the problem is the trade-off between exploration for detecting environment changes and exploitation of traditional bandit algorithms. While this problem has been extensively investigated, existing works either assume knowledge about the number of change points or require extremely high computational complexity. In this work, we revisit the piecewise-stationary bandit problem from a minimalist perspective. We propose a novel and generic exploration mechanism, called diminishing exploration, which eliminates the need for knowledge about and can be used in conjunction with an existing change detection-based algorithm to achieve near-optimal regret scaling. Simulation results show that despite oblivious…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management
