Understanding Sparse JL for Feature Hashing
Meena Jagadeesan

TL;DR
This paper explores the use of higher sparsity levels in sparse Johnson-Lindenstrauss transforms for feature hashing, showing both theoretical advantages and empirical improvements in norm preservation over the sparsity 1 case.
Contribution
It provides a tight tradeoff analysis for sparse JL with general sparsity s, extending previous work and demonstrating benefits of s > 1 in feature vector applications.
Findings
Theoretical demonstration of improved norm preservation with s > 1.
Empirical evidence supporting the advantages of higher sparsity.
Generalization of previous tight tradeoff results for sparse JL.
Abstract
Feature hashing and other random projection schemes are commonly used to reduce the dimensionality of feature vectors. The goal is to efficiently project a high-dimensional feature vector living in into a much lower-dimensional space , while approximately preserving Euclidean norm. These schemes can be constructed using sparse random projections, for example using a sparse Johnson-Lindenstrauss (JL) transform. A line of work introduced by Weinberger et. al (ICML '09) analyzes the accuracy of sparse JL with sparsity 1 on feature vectors with small -to- norm ratio. Recently, Freksen, Kamma, and Larsen (NeurIPS '18) closed this line of work by proving a tight tradeoff between -to- norm ratio and accuracy for sparse JL with sparsity . In this paper, we demonstrate the benefits of using sparsity greater than …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques
