On Sparsity and Sub-Gaussianity in the Johnson-Lindenstrauss Lemma

Aur\'elien Garivier (UMPA-ENSL; MC2); Emmanuel Pilliat (UMPA-ENSL)

arXiv:2409.06275·math.ST·September 25, 2024

On Sparsity and Sub-Gaussianity in the Johnson-Lindenstrauss Lemma

Aur\'elien Garivier (UMPA-ENSL, MC2), Emmanuel Pilliat (UMPA-ENSL)

PDF

Open Access

TL;DR

This paper offers a straightforward proof of the Johnson-Lindenstrauss lemma for sub-Gaussian variables and explores the impact of sparsity on the projection dimension, balancing computational efficiency and accuracy.

Contribution

It provides a simple proof emphasizing sub-Gaussianity and extends the analysis to sparse projections, highlighting the trade-offs involved.

Findings

01

Sub-Gaussian variables suffice for Johnson-Lindenstrauss projections.

02

Sparsity in projections reduces computational cost.

03

Sparsity limits the achievable target dimension.

Abstract

We provide a simple proof of the Johnson-Lindenstrauss lemma for sub-Gaussian variables. We extend the analysis to identify how sparse projections can be, and what the cost of sparsity is on the target dimension.The Johnson-Lindenstrauss lemma is the theoretical core of the dimensionality reduction methods based on random projections. While its original formulation involves matrices with Gaussian entries, the computational cost of random projections can be drastically reduced by the use of simpler variables, especially if they vanish with a high probability. In this paper, we propose a simple and elementary analysis of random projections under classical assumptions that emphasizes the key role of sub-Gaussianity. Furthermore, we show how to extend it to sparse projections, emphasizing the limits induced by the sparsity of the data itself.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications