Multi-Resolution Hashing for Fast Pairwise Summations
Moses Charikar, Paris Siminelakis

TL;DR
This paper introduces a novel hashing-based framework for efficiently approximating pairwise summations over large datasets, especially for functions depending on unknown query vectors, achieving sublinear query times.
Contribution
It develops a new method combining harmonic analysis and hashing techniques to enable fast, approximate pairwise summations for a broad class of functions, surpassing previous approaches.
Findings
Achieves sublinear query time for pairwise summation approximation.
Constructs efficient hashing schemes for log-convex functions of inner products.
Extends framework to vector functions and general Euclidean spaces.
Abstract
A basic computational primitive in the analysis of massive datasets is summing simple functions over a large number of objects. Modern applications pose an additional challenge in that such functions often depend on a parameter vector (query) that is unknown a priori. Given a set of points and a pairwise function , we study the problem of designing a data-structure that enables sublinear-time approximation of the summation for any query . By combining ideas from Harmonic Analysis (partitions of unity and approximation theory) with Hashing-Based-Estimators [Charikar, Siminelakis FOCS'17], we provide a general framework for designing such data structures through hashing that reaches far beyond what previous techniques allowed. A key design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
