Meta Clustering of Neural Bandits
Yikun Ban, Yunzhe Qi, Tianxin Wei, Lihui Liu, Jingrui He

TL;DR
This paper introduces M-CNB, a meta-learning algorithm for clustering neural bandits that adapts to user heterogeneity and correlations, with proven guarantees and superior experimental performance.
Contribution
The paper proposes a novel meta-learner-based clustering algorithm for neural bandits with theoretical guarantees and improved empirical results over existing methods.
Findings
M-CNB outperforms state-of-the-art baselines in recommendation tasks.
The algorithm provides instance-dependent performance guarantees.
Effective in both recommendation and online classification scenarios.
Abstract
The contextual bandit has been identified as a powerful framework to formulate the recommendation process as a sequential decision-making process, where each item is regarded as an arm and the objective is to minimize the regret of rounds. In this paper, we study a new problem, Clustering of Neural Bandits, by extending previous work to the arbitrary reward function, to strike a balance between user heterogeneity and user correlations in the recommender system. To solve this problem, we propose a novel algorithm called M-CNB, which utilizes a meta-learner to represent and rapidly adapt to dynamic clusters, along with an informative Upper Confidence Bound (UCB)-based exploration strategy. We provide an instance-dependent performance guarantee for the proposed algorithm that withstands the adversarial context, and we further prove the guarantee is at least as good as state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Mental Health Research Topics · Machine Learning in Healthcare
