Mini-Batch Kernel $k$-means
Ben Jourdan, Gregory Schwartzman

TL;DR
This paper introduces the first mini-batch kernel $k$-means algorithm that significantly accelerates clustering on large datasets with minimal quality loss, supported by theoretical guarantees and extensive experiments.
Contribution
It presents a novel mini-batch kernel $k$-means algorithm with proven convergence and approximation guarantees, enabling scalable kernel clustering.
Findings
Achieves 10-100x speedup over full kernel $k$-means.
Maintains clustering quality with minimal loss.
Provides theoretical analysis with convergence and approximation bounds.
Abstract
We present the first mini-batch kernel -means algorithm, offering an order of magnitude improvement in running time compared to the full batch algorithm. A single iteration of our algorithm takes time, significantly faster than the time required by the full batch kernel -means, where is the dataset size and is the batch size. Extensive experiments demonstrate that our algorithm consistently achieves a 10-100x speedup with minimal loss in quality, addressing the slow runtime that has limited kernel -means adoption in practice. We further complement these results with a theoretical analysis under an early stopping condition, proving that with a batch size of , the algorithm terminates in iterations with high probability, where bounds the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition
MethodsEarly Stopping
