Meta Clustering of Neural Bandits

Yikun Ban; Yunzhe Qi; Tianxin Wei; Lihui Liu; Jingrui He

arXiv:2408.05586·cs.LG·September 30, 2024

Meta Clustering of Neural Bandits

Yikun Ban, Yunzhe Qi, Tianxin Wei, Lihui Liu, Jingrui He

PDF

Open Access

TL;DR

This paper introduces M-CNB, a meta-learning algorithm for clustering neural bandits that adapts to user heterogeneity and correlations, with proven guarantees and superior experimental performance.

Contribution

The paper proposes a novel meta-learner-based clustering algorithm for neural bandits with theoretical guarantees and improved empirical results over existing methods.

Findings

01

M-CNB outperforms state-of-the-art baselines in recommendation tasks.

02

The algorithm provides instance-dependent performance guarantees.

03

Effective in both recommendation and online classification scenarios.

Abstract

The contextual bandit has been identified as a powerful framework to formulate the recommendation process as a sequential decision-making process, where each item is regarded as an arm and the objective is to minimize the regret of $T$ rounds. In this paper, we study a new problem, Clustering of Neural Bandits, by extending previous work to the arbitrary reward function, to strike a balance between user heterogeneity and user correlations in the recommender system. To solve this problem, we propose a novel algorithm called M-CNB, which utilizes a meta-learner to represent and rapidly adapt to dynamic clusters, along with an informative Upper Confidence Bound (UCB)-based exploration strategy. We provide an instance-dependent performance guarantee for the proposed algorithm that withstands the adversarial context, and we further prove the guarantee is at least as good as state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Mental Health Research Topics · Machine Learning in Healthcare