Dimensionality Reduction for Sum-of-Distances Metric
Zhili Feng, Praneeth Kacham, David P. Woodruff

TL;DR
This paper introduces a dimensionality reduction method that approximates the sum of distances to any shape in a low-dimensional subspace, enabling efficient data storage and computation for clustering and approximation problems.
Contribution
The authors present a novel algorithm that reduces data dimensionality for sum-of-distances problems, applicable to various clustering and subspace approximation tasks, with improved runtime and storage efficiency.
Findings
Achieves an $ ext{epsilon}$-approximation with a low-dimensional subspace of size $O(k^3/ ext{epsilon}^6)$.
Reduces data storage from $nnz(A)$ to $(n+d)k^3/ ext{epsilon}^6$, improving efficiency.
Provides faster algorithms for dense matrices and constructs small coresets for key problems.
Abstract
We give a dimensionality reduction procedure to approximate the sum of distances of a given set of points in to any "shape" that lies in a -dimensional subspace. Here, by "shape" we mean any set of points in . Our algorithm takes an input in the form of an matrix , where each row of denotes a data point, and outputs a subspace of dimension such that the projections of each of the points onto the subspace and the distances of each of the points to the subspace are sufficient to obtain an -approximation to the sum of distances to any arbitrary shape that lies in a -dimensional subspace of . These include important problems such as -median, -subspace approximation, and subspace clustering with . Dimensionality reduction reduces the data storage requirement to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Medical Image Segmentation Techniques · Sparse and Compressive Sensing Techniques
