The space complexity of inner product filters
Rasmus Pagh, Johan Sivertsen

TL;DR
This paper investigates the minimal deterministic space complexity for inner product estimation between high-dimensional vectors, providing tight bounds and an improved upper bound for distinguishing inner products above or below certain thresholds.
Contribution
It establishes tight space complexity bounds for deterministic inner product estimation, improving previous bounds and handling the case where vectors are known or unknown.
Findings
Exact space bounds are characterized as $d \, \log_2(\frac{\sqrt{1-\beta}}{\varepsilon}) \pm \Theta(d)$ bits.
The upper bound is constructive and improves prior results by up to a factor of 2.
The lower bound applies even when one vector is known exactly, ensuring tightness of the bounds.
Abstract
Motivated by the problem of filtering candidate pairs in inner product similarity joins we study the following inner product estimation problem: Given parameters , and unit vectors consider the task of distinguishing between the cases and where is the inner product of vectors and . The goal is to distinguish these cases based on information on each vector encoded independently in a bit string of the shortest length possible. In contrast to much work on compressing vectors using randomized dimensionality reduction, we seek to solve the problem deterministically, with no probability of error. Inner product estimation can be solved in general via estimating with an additive error bounded by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
