Linear Time Algorithm for Projective Clustering
Hu Ding, Jinhui Xu

TL;DR
This paper introduces the first linear time algorithm for projective clustering in high-dimensional spaces, using uniform random sampling to efficiently handle outliers and achieve near-optimal solutions.
Contribution
It presents a novel uniform sampling approach for projective clustering, enabling linear time solutions and extending to regular and $L_{\tau}$-sense clustering.
Findings
Achieves linear time solutions for general, regular, and $L_{\tau}$-sense projective clustering.
Provides a PTAS for regular projective clustering under certain conditions.
Extends techniques to $L_{\tau}$$ projective clustering for any $1 \le \tau < \infty$.
Abstract
Projective clustering is a problem with both theoretical and practical importance and has received a great deal of attentions in recent years. Given a set of points in space, projective clustering is to find a set of lower dimensional -flats so that the average distance (or squared distance) from points in to their closest flats is minimized. Existing approaches for this problem are mainly based on adaptive/volume sampling or core-sets techniques which suffer from several limitations. In this paper, we present the first uniform random sampling based approach for this challenging problem and achieve linear time solutions for three cases, general projective clustering, regular projective clustering, and sense projective clustering. For the general projective clustering problem, we show that for any given small numbers $0<\gamma,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Optimization and Search Problems · Advanced Clustering Algorithms Research
