Adaptive Metric Dimensionality Reduction

Lee-Ad Gottlieb; Aryeh Kontorovich; Robert Krauthgamer

arXiv:1302.2752·cs.LG·March 26, 2015

Adaptive Metric Dimensionality Reduction

Lee-Ad Gottlieb, Aryeh Kontorovich, Robert Krauthgamer

PDF

TL;DR

This paper introduces an adaptive, data-dependent approach to dimensionality reduction in metric spaces, providing theoretical bounds and an efficient algorithm analogous to PCA, to improve supervised learning tasks.

Contribution

It offers a new generalization bound for Lipschitz functions in nearly doubling metric spaces and proposes an efficient PCA-like algorithm for intrinsic dimension approximation.

Findings

01

Generalization bounds for Lipschitz functions in doubling metric spaces

02

An efficient algorithm approximating data's intrinsic dimension

03

Enhanced efficiency and generalization in supervised learning

Abstract

We study adaptive data-dependent dimensionality reduction in the context of supervised learning in general metric spaces. Our main statistical contribution is a generalization bound for Lipschitz functions in metric spaces that are doubling, or nearly doubling. On the algorithmic front, we describe an analogue of PCA for metric spaces: namely an efficient procedure that approximates the data's intrinsic dimension, which is often much lower than the ambient dimension. Our approach thus leverages the dual benefits of low dimensionality: (1) more efficient algorithms, e.g., for proximity search, and (2) more optimistic generalization bounds.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPrincipal Components Analysis