The space complexity of inner product filters

Rasmus Pagh; Johan Sivertsen

arXiv:1909.10766·cs.DS·January 14, 2020

The space complexity of inner product filters

Rasmus Pagh, Johan Sivertsen

PDF

TL;DR

This paper investigates the minimal deterministic space complexity for inner product estimation between high-dimensional vectors, providing tight bounds and an improved upper bound for distinguishing inner products above or below certain thresholds.

Contribution

It establishes tight space complexity bounds for deterministic inner product estimation, improving previous bounds and handling the case where vectors are known or unknown.

Findings

01

Exact space bounds are characterized as $d \, \log_2(\frac{\sqrt{1-\beta}}{\varepsilon}) \pm \Theta(d)$ bits.

02

The upper bound is constructive and improves prior results by up to a factor of 2.

03

The lower bound applies even when one vector is known exactly, ensuring tightness of the bounds.

Abstract

Motivated by the problem of filtering candidate pairs in inner product similarity joins we study the following inner product estimation problem: Given parameters $d \in N$ , $α > β \geq 0$ and unit vectors $x, y \in R^{d}$ consider the task of distinguishing between the cases $⟨ x, y ⟩ \leq β$ and $⟨ x, y ⟩ \geq α$ where $⟨ x, y ⟩ = \sum_{i = 1}^{d} x_{i} y_{i}$ is the inner product of vectors $x$ and $y$ . The goal is to distinguish these cases based on information on each vector encoded independently in a bit string of the shortest length possible. In contrast to much work on compressing vectors using randomized dimensionality reduction, we seek to solve the problem deterministically, with no probability of error. Inner product estimation can be solved in general via estimating $⟨ x, y ⟩$ with an additive error bounded by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.