Efficient data hashing with structured binary embeddings
Krzysztof Choromanski

TL;DR
This paper introduces a novel binary hashing method using highly structured embedding matrices, achieving efficient data hashing with strong theoretical guarantees and reduced memory usage without compromising hash quality.
Contribution
It presents the first theoretical analysis of structured binary hashing mechanisms, including circulant and Toeplitz matrices, demonstrating their effectiveness and efficiency.
Findings
Efficient hashing with structured matrices like circulant and Toeplitz.
Strong theoretical guarantees for the proposed binary hashing method.
Reduced memory usage without loss of hash quality.
Abstract
We present here new mechanisms for hashing data via binary embeddings. Contrary to most of the techniques presented before, the embedding matrix of our mechanism is highly structured. That enables us to perform hashing more efficiently and use less memory. What is crucial and nonintuitive is the fact that imposing structured mechanism does not affect the quality of the produced hash. To the best of our knowledge, we are the first to give strong theoretical guarantees of the proposed binary hashing method by proving the efficiency of the mechanism for several classes of structured projection matrices. As a corollary, we obtain binary hashing mechanisms with strong concentration results for circulant and Topelitz matrices. Our approach is however much more general.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Image and Video Retrieval Techniques · Advanced Steganography and Watermarking Techniques
