Neural Combinatorial Clustered Bandits for Recommendation Systems
Baran Atalar, Carlee Joe-Wong

TL;DR
This paper introduces NeUClust, a neural network-based clustering algorithm for combinatorial bandits in recommendation systems, achieving improved regret and reward performance without requiring a known optimization oracle.
Contribution
It proposes a novel neural clustering approach for combinatorial bandits that eliminates the need for a known oracle and provides theoretical regret guarantees.
Findings
NeUClust outperforms existing algorithms in real-world recommendation datasets.
Achieves a regret bound of tenilde;O( ilde{d}\u221a{T}) with neural tangent kernel analysis.
Demonstrates effective learning of reward functions using neural networks in complex combinatorial settings.
Abstract
We consider the contextual combinatorial bandit setting where in each round, the learning agent, e.g., a recommender system, selects a subset of "arms," e.g., products, and observes rewards for both the individual base arms, which are a function of known features (called "context"), and the super arm (the subset of arms), which is a function of the base arm rewards. The agent's goal is to simultaneously learn the unknown reward functions and choose the highest-reward arms. For example, the "reward" may represent a user's probability of clicking on one of the recommended products. Conventional bandit models, however, employ restrictive reward function models in order to obtain performance guarantees. We make use of deep neural networks to estimate and learn the unknown reward functions and propose Neural UCB Clustering (NeUClust), which adopts a clustering approach to select the super…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Data Stream Mining Techniques
MethodsBalanced Selection
