On Sparsity and Sub-Gaussianity in the Johnson-Lindenstrauss Lemma
Aur\'elien Garivier (UMPA-ENSL, MC2), Emmanuel Pilliat (UMPA-ENSL)

TL;DR
This paper offers a straightforward proof of the Johnson-Lindenstrauss lemma for sub-Gaussian variables and explores the impact of sparsity on the projection dimension, balancing computational efficiency and accuracy.
Contribution
It provides a simple proof emphasizing sub-Gaussianity and extends the analysis to sparse projections, highlighting the trade-offs involved.
Findings
Sub-Gaussian variables suffice for Johnson-Lindenstrauss projections.
Sparsity in projections reduces computational cost.
Sparsity limits the achievable target dimension.
Abstract
We provide a simple proof of the Johnson-Lindenstrauss lemma for sub-Gaussian variables. We extend the analysis to identify how sparse projections can be, and what the cost of sparsity is on the target dimension.The Johnson-Lindenstrauss lemma is the theoretical core of the dimensionality reduction methods based on random projections. While its original formulation involves matrices with Gaussian entries, the computational cost of random projections can be drastically reduced by the use of simpler variables, especially if they vanish with a high probability. In this paper, we propose a simple and elementary analysis of random projections under classical assumptions that emphasizes the key role of sub-Gaussianity. Furthermore, we show how to extend it to sparse projections, emphasizing the limits induced by the sparsity of the data itself.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications
