Randomized Dimensionality Reduction for Euclidean Maximization and Diversity Measures
Jie Gao, Rajesh Jayaram, Benedikt Kolbe, Shay Sapir, Chris Schwiegelshohn, Sandeep Silwal, Erik Waingarten

TL;DR
This paper demonstrates that randomized dimensionality reduction preserves the quality of solutions for various Euclidean maximization and diversity problems, with the required dimension depending on the dataset's intrinsic doubling dimension.
Contribution
It establishes that a target dimension proportional to the dataset's doubling dimension suffices for approximate solution preservation, contrasting classical results that depend on dataset size.
Findings
Dimension reduction preserves near-optimal solutions effectively.
Target dimension of O(λ_X) is sufficient and necessary for certain problems.
Empirical results show solution quality and speedups in practice.
Abstract
Randomized dimensionality reduction is a widely-used algorithmic technique for speeding up large-scale Euclidean optimization problems. In this paper, we study dimension reduction for a variety of maximization problems, including max-matching, max-spanning tree, max TSP, as well as various measures for dataset diversity. For these problems, we show that the effect of dimension reduction is intimately tied to the \emph{doubling dimension} of the underlying dataset -- a quantity measuring intrinsic dimensionality of point sets. Specifically, we prove that a target dimension of suffices to approximately preserve the value of any near-optimal solution,which we also show is necessary for some of these problems. This is in contrast to classical dimension reduction results, whose dependence increases with the dataset size . We also provide empirical results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsBayesian Methods and Mixture Models
