A New Framework for Convex Clustering in Kernel Spaces: Finite Sample Bounds, Consistency and Performance Insights

Shubhayan Pan; Kushal Bose; Debolina Paul; Saptarshi Chakraborty; Swagatam Das

arXiv:2511.05159·stat.ML·May 15, 2026

A New Framework for Convex Clustering in Kernel Spaces: Finite Sample Bounds, Consistency and Performance Insights

Shubhayan Pan, Kushal Bose, Debolina Paul, Saptarshi Chakraborty, Swagatam Das

PDF

TL;DR

This paper introduces a kernelized convex clustering method that operates in RKHS, providing theoretical guarantees and demonstrating superior performance on complex datasets.

Contribution

It extends convex clustering to kernel spaces, enabling effective clustering of non-linear and non-convex data with proven convergence and finite sample bounds.

Findings

01

Kernelized convex clustering handles complex data distributions.

02

Theoretical convergence and finite sample bounds are established.

03

Experimental results show improved performance over existing methods.

Abstract

Convex clustering is a well-regarded clustering method, resembling the similar centroid-based approach of Lloyd's $k$ -means, without requiring a predefined cluster count. It starts with each data point as its centroid and iteratively merges them. Despite its advantages, this method can fail when dealing with data exhibiting linearly non-separable or non-convex structures. To mitigate the limitations, we propose a kernelized extension of the convex clustering method. This approach projects the data points into a Reproducing Kernel Hilbert Space (RKHS) using a feature map, enabling convex clustering in this transformed space. This kernelization not only allows for better handling of complex data distributions but also produces an embedding in a finite-dimensional vector space. We provide a comprehensive theoretical underpinning for our kernelized approach, proving algorithmic convergence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.