Understanding Sparse JL for Feature Hashing

Meena Jagadeesan

arXiv:1903.03605·stat.ML·March 27, 2020·6 cites

Understanding Sparse JL for Feature Hashing

Meena Jagadeesan

PDF

Open Access 1 Repo

TL;DR

This paper explores the use of higher sparsity levels in sparse Johnson-Lindenstrauss transforms for feature hashing, showing both theoretical advantages and empirical improvements in norm preservation over the sparsity 1 case.

Contribution

It provides a tight tradeoff analysis for sparse JL with general sparsity s, extending previous work and demonstrating benefits of s > 1 in feature vector applications.

Findings

01

Theoretical demonstration of improved norm preservation with s > 1.

02

Empirical evidence supporting the advantages of higher sparsity.

03

Generalization of previous tight tradeoff results for sparse JL.

Abstract

Feature hashing and other random projection schemes are commonly used to reduce the dimensionality of feature vectors. The goal is to efficiently project a high-dimensional feature vector living in $R^{n}$ into a much lower-dimensional space $R^{m}$ , while approximately preserving Euclidean norm. These schemes can be constructed using sparse random projections, for example using a sparse Johnson-Lindenstrauss (JL) transform. A line of work introduced by Weinberger et. al (ICML '09) analyzes the accuracy of sparse JL with sparsity 1 on feature vectors with small $ℓ_{\infty}$ -to- $ℓ_{2}$ norm ratio. Recently, Freksen, Kamma, and Larsen (NeurIPS '18) closed this line of work by proving a tight tradeoff between $ℓ_{\infty}$ -to- $ℓ_{2}$ norm ratio and accuracy for sparse JL with sparsity $1$ . In this paper, we demonstrate the benefits of using sparsity $s$ greater than $1$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mjagadeesan/sparsejl-featurehashing
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques