Approximation and Streaming Algorithms for Projective Clustering via Random Projections
Michael Kerber, Sharath Raghvendra

TL;DR
This paper introduces a dimension reduction technique using random projections to efficiently approximate projective clustering problems in high-dimensional data, enabling improved algorithms for streaming and large datasets.
Contribution
It presents a novel dimension reduction method that preserves clustering quality and applies it to develop new approximation and streaming algorithms for projective clustering.
Findings
Random projections provide $ ext{ extepsilon}$-approximations for projective clustering.
The subspace dimension is independent of the number of clusters.
The approach significantly reduces space complexity in streaming scenarios.
Abstract
Let be a set of points in . In the projective clustering problem, given and norm , we have to compute a set of -dimensional flats such that is minimized; here represents the (Euclidean) distance of to the closest flat in . We let denote the minimal value and interpret to be . When and and , the problem corresponds to the -median, -mean and the -center clustering problems respectively. For every , and , we show that the orthogonal projection of onto a randomly chosen flat of dimension will -approximate . This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Computational Geometry and Mesh Generation · Sparse and Compressive Sensing Techniques
