Unifying Clustered and Non-stationary Bandits
Chuanhao Li, Qingyun Wu, Hongning Wang

TL;DR
This paper unifies the approaches to non-stationary bandits and online clustering of bandits through a test of homogeneity, enabling change detection and cluster identification within a single framework, supported by theoretical and empirical validation.
Contribution
It introduces a unified framework that combines change detection and clustering in bandit problems using a test of homogeneity, advancing the integration of these research areas.
Findings
Rigorous regret bounds established for the unified method.
Extensive experiments show improved adaptability across environments.
Framework effectively detects changes and identifies clusters in real-time.
Abstract
Non-stationary bandits and online clustering of bandits lift the restrictive assumptions in contextual bandits and provide solutions to many important real-world scenarios. Though the essence in solving these two problems overlaps considerably, they have been studied independently. In this paper, we connect these two strands of bandit research under the notion of test of homogeneity, which seamlessly addresses change detection for non-stationary bandit and cluster identification for online clustering of bandit in a unified solution framework. Rigorous regret analysis and extensive empirical evaluations demonstrate the value of our proposed solution, especially its flexibility in handling various environment assumptions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Mobile Crowdsensing and Crowdsourcing
