TL;DR
This paper introduces probabilistic polynomials for symmetric Boolean functions, enabling a subquadratic time algorithm for Hamming nearest neighbors in high dimensions, with applications to various similarity measures.
Contribution
It provides a constructive method for low-degree probabilistic polynomials for symmetric functions and applies this to develop the first subquadratic algorithm for Hamming nearest neighbors in high dimensions.
Findings
Probabilistic polynomials of degree O(√(n log(1/ε))) for symmetric Boolean functions.
First subquadratic time algorithm for batch Hamming distance in superlogarithmic dimensions.
Applications to maximum inner product, L1 closest pair, and Jaccard similarity.
Abstract
We show how to compute any symmetric Boolean function on variables over any field (as well as the integers) with a probabilistic polynomial of degree and error at most . The degree dependence on and is optimal, matching a lower bound of Razborov (1987) and Smolensky (1987) for the MAJORITY function. The proof is constructive: a low-degree polynomial can be efficiently sampled from the distribution. This polynomial construction is combined with other algebraic ideas to give the first subquadratic time algorithm for computing a (worst-case) batch of Hamming distances in superlogarithmic dimensions, exactly. To illustrate, let . Suppose we are given a database of vectors in and a collection of query vectors in the same dimension. For all ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Probabilistic Polynomials and Hamming Nearest Neighbors· youtube
