Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of Plasticity
Zhiyuan Su, Sunhao Dai, Xiao Zhang

TL;DR
This paper introduces Selective Reinitialization (SeRe), a framework that enhances neural bandit clustering algorithms by dynamically resetting underperforming units, thereby improving adaptability and reducing regret in non-stationary environments.
Contribution
The paper proposes SeRe, a novel method for neural bandits that selectively reinitializes units to prevent loss of plasticity, supported by theoretical regret guarantees and extensive real-world experiments.
Findings
SeRe improves adaptability in neural bandits.
SeRe reduces cumulative regret in non-stationary environments.
SeRe outperforms traditional CNB algorithms in experiments.
Abstract
Clustering of Bandits (CB) methods enhance sequential decision-making by grouping bandits into clusters based on similarity and incorporating cluster-level contextual information, demonstrating effectiveness and adaptability in applications like personalized streaming recommendations. However, when extending CB algorithms to their neural version (commonly referred to as Clustering of Neural Bandits, or CNB), they suffer from loss of plasticity, where neural network parameters become rigid and less adaptable over time, limiting their ability to adapt to non-stationary environments (e.g., dynamic user preferences in recommendation). To address this challenge, we propose Selective Reinitialization (SeRe), a novel bandit learning framework that dynamically preserves the adaptability of CNB algorithms in evolving environments. SeRe leverages a contribution utility metric to identify and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
