Binary embeddings with structured hashed projections
Anna Choromanska, Krzysztof Choromanski, Mariusz Bojarski and, Tony Jebara, Sanjiv Kumar, Yann LeCun

TL;DR
This paper investigates the use of structured random projections for binary embeddings, providing theoretical guarantees for distance preservation and demonstrating their efficiency and effectiveness in neural networks and classifiers.
Contribution
It introduces the first theoretical analysis of structured matrices in nonlinear hashing, extending Johnson-Lindenstrauss results and confirming their practical utility.
Findings
Structured matrices preserve angular distances accurately.
Proven theoretical guarantees for structured projections in nonlinear settings.
Empirical validation shows effectiveness in neural networks and classifiers.
Abstract
We consider the hashing mechanism for constructing binary embeddings, that involves pseudo-random projections followed by nonlinear (sign function) mappings. The pseudo-random projection is described by a matrix, where not all entries are independent random variables but instead a fixed "budget of randomness" is distributed across the matrix. Such matrices can be efficiently stored in sub-quadratic or even linear space, provide reduction in randomness usage (i.e. number of required random values), and very often lead to computational speed ups. We prove several theoretical results showing that projections via various structured matrices followed by nonlinear mappings accurately preserve the angular distance between input high-dimensional vectors. To the best of our knowledge, these results are the first that give theoretical ground for the use of general structured matrices in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Face and Expression Recognition · Sparse and Compressive Sensing Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
