Nonparametric Kernel Clustering with Bandit Feedback
Victor Thuot (MISTEA), Sebastian Vogt (LRR-TUM), Debarghya Ghoshdastidar (LRR-TUM), Nicolas Verzelen (MISTEA)

TL;DR
This paper presents a nonparametric kernel-based clustering algorithm with bandit feedback, capable of adaptively grouping items based on their distributions without assuming parametric forms, applicable to real-world data.
Contribution
It introduces a novel nonparametric clustering framework using kernel mean embeddings, along with the KABC algorithm that provides theoretical guarantees and adapts to unknown distribution differences.
Findings
Algorithm achieves instance-dependent guarantees.
Framework applicable to real-world datasets.
Provides theoretical correctness and sampling analysis.
Abstract
Clustering with bandit feedback refers to the problem of partitioning a set of items, where the clustering algorithm can sequentially query the items to receive noisy observations. The problem is formally posed as the task of partitioning the arms of an N-armed stochastic bandit according to their underlying distributions, grouping two arms together if and only if they share the same distribution, using samples collected sequentially and adaptively. This setting has gained attention in recent years due to its applicability in recommendation systems and crowdsourcing. Existing works on clustering with bandit feedback rely on a strong assumption that the underlying distributions are sub-Gaussian. As a consequence, the existing methods mainly cover settings with linearly-separable clusters, which has little practical relevance. We introduce a framework of ``nonparametric clustering with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques
