A Unified Framework for Clustering Constrained Data without Locality Property
Hu Ding, Jinhui Xu

TL;DR
This paper introduces a unified geometric framework called Peeling-and-Enclosing for high-dimensional constrained clustering that overcomes the loss of locality property, providing efficient near-linear time approximation algorithms.
Contribution
The paper presents a novel geometric framework and techniques for constrained clustering without locality, enabling efficient high-dimensional approximation algorithms.
Findings
Nearly linear time complexity $O(n(\log n)^{k+1}d)$
Produces $O((\log n)^{k})$ candidate solutions
Achieves $(1+\epsilon)$-approximation for constrained $k$-means and $k$-median
Abstract
In this paper, we consider a class of constrained clustering problems of points in , where could be rather high. A common feature of these problems is that their optimal clusterings no longer have the locality property (due to the additional constraints), which is a key property required by many algorithms for their unconstrained counterparts. To overcome the difficulty caused by the loss of locality, we present in this paper a unified framework, called {\em Peeling-and-Enclosing (PnE)}, to iteratively solve two variants of the constrained clustering problems, {\em constrained -means clustering} (-CMeans) and {\em constrained -median clustering} (-CMedian). Our framework is based on two standalone geometric techniques, called {\em Simplex Lemma} and {\em Weaker Simplex Lemma}, for -CMeans and -CMedian, respectively. The simplex lemma (or weaker…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Geometry and Mesh Generation · Facility Location and Emergency Management · Data Management and Algorithms
