Compression for Quadratic Similarity Queries: Finite Blocklength and Practical Schemes
Fabian Steiner, Steffen Dempfle, Amir Ingber, Tsachy Weissman

TL;DR
This paper investigates practical compression schemes for quadratic similarity queries, providing finite blocklength bounds, numerical evaluations, and implementation insights using spherical codes and lattices.
Contribution
It introduces a finite blocklength achievability bound based on shape-gain quantization and analyzes practical implementation with wrapped spherical codes and lattices.
Findings
Numerical results converge to asymptotic limits
Provides nonasymptotic bounds on compression performance
Demonstrates practical schemes using wrapped spherical codes
Abstract
We study the problem of compression for the purpose of similarity identification, where similarity is measured by the mean square Euclidean distance between vectors. While the asymptotical fundamental limits of the problem - the minimal compression rate and the error exponent - were found in a previous work, in this paper we focus on the nonasymptotic domain and on practical, implementable schemes. We first present a finite blocklength achievability bound based on shape-gain quantization: The gain (amplitude) of the vector is compressed via scalar quantization and the shape (the projection on the unit sphere) is quantized using a spherical code. The results are numerically evaluated and they converge to the asymptotic values as predicted by the error exponent. We then give a nonasymptotic lower bound on the performance of any compression scheme, and compare to the upper (achievability)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Data Compression Techniques · DNA and Biological Computing
