Breaking the Curse of Dimensionality: On the Stability of Modern Vector Retrieval
Vihan Lakshman, Blaise Munyampirwa, Julian Shun, Benjamin Coleman

TL;DR
This paper investigates the stability of high-dimensional vector retrieval methods, demonstrating how certain metrics and conditions can mitigate the curse of dimensionality in practical applications.
Contribution
We extend stability theory to various vector retrieval settings, providing new conditions and insights to improve high-dimensional search robustness.
Findings
Chamfer distance preserves stability in multi-vector search
Large penalties can induce stability in filtered search
Novel stability conditions for sparse vector search
Abstract
Modern vector databases enable efficient retrieval over high-dimensional neural embeddings, powering applications from web search to retrieval-augmented generation. However, classical theory predicts such tasks should suffer from the curse of dimensionality, where distances between points become nearly indistinguishable, thereby crippling efficient nearest-neighbor search. We revisit this paradox through the lens of stability, the property that small perturbations to a query do not radically alter its nearest neighbors. Building on foundational results, we extend stability theory to three key retrieval settings widely used in practice: (i) multi-vector search, where we prove that the popular Chamfer distance metric preserves single-vector stability, while average pooling aggregation may destroy it; (ii) filtered vector search, where we show that sufficiently large penalties for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Information Retrieval and Search Behavior · Face and Expression Recognition
