Sublinear Sketches for Approximate Nearest Neighbor and Kernel Density Estimation
Ved Danait, Srijan Das, Sujoy Bhore

TL;DR
This paper introduces new sublinear space and time sketching algorithms for approximate nearest neighbor search and kernel density estimation in dynamic data streams, with theoretical guarantees and practical validation.
Contribution
It presents the first sublinear sketches for ANN and A-KDE that balance memory, accuracy, and efficiency in streaming models, extending prior work on exact methods.
Findings
Achieves sublinear memory and query time for ANN in streaming models.
Provides the first sublinear sketch guarantee for A-KDE in the Sliding-Window model.
Experimental results show low error and lightweight sketches on real datasets.
Abstract
Approximate Nearest Neighbor (ANN) search and Approximate Kernel Density Estimation (A-KDE) are fundamental problems at the core of modern machine learning, with broad applications in data analysis, information systems, and large-scale decision making. In massive and dynamic data streams, a central challenge is to design compact sketches that preserve essential structural properties of the data while enabling efficient queries. In this work, we develop new sketching algorithms that achieve sublinear space and query time guarantees for both ANN and A-KDE for a dynamic stream of data. For ANN in the streaming model, under natural assumptions, we design a sublinear sketch that requires only memory by storing only a sublinear () fraction of the total inputs, where is a parameter of the LSH family, and . Our method supports…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
