Online Coresets for Clustering with Bregman Divergences
Rachit Chhaya, Jayesh Choudhari, Anirban Dasgupta, Supratim Shit

TL;DR
This paper introduces online algorithms for creating small, lightweight coresets for clustering with Bregman divergences, enabling efficient, incremental clustering with small additive errors and scalable sizes.
Contribution
The paper presents the first online coreset algorithms for Bregman divergence clustering, including non-parametric variants with size independent of the number of clusters.
Findings
Coresets have small additive error similar to previous lightweight coresets.
Algorithms achieve update time $O(d)$ per incoming point.
Non-parametric coresets have size independent of the number of clusters.
Abstract
We present algorithms that create coresets in an online setting for clustering problems according to a wide subset of Bregman divergences. Notably, our coresets have a small additive error, similar in magnitude to the lightweight coresets Bachem et. al. 2018, and take update time for every incoming point where is dimension of the point. Our first algorithm gives online coresets of size for -clusterings according to any -similar Bregman divergence. We further extend this algorithm to show existence of a non-parametric coresets, where the coreset size is independent of , the number of clusters, for the same subclass of Bregman divergences. Our non-parametric coresets are larger by a factor of ( is number of points) and have similar (small) additive guarantee. At the same time our coresets also function as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacility Location and Emergency Management · Statistical Methods and Inference · Domain Adaptation and Few-Shot Learning
MethodsCoresets
