From HNSW to Information-Theoretic Binarization: Rethinking the Architecture of Scalable Vector Search
Seyed Moein Abtahi, Majid Fekri, Tara Khani, Akramul Azim

TL;DR
This paper proposes an information-theoretic binary vector search architecture that offers comparable accuracy to traditional high-precision methods but with lower latency, cost, and better scalability for semantic search systems.
Contribution
It introduces a novel binarization and scoring framework that enables exhaustive binary search, eliminating accuracy loss and reducing operational costs compared to existing ANN architectures.
Findings
Comparable retrieval quality to full-precision systems
Significantly lower latency and higher throughput at scale
Enables serverless, cost-effective deployment models
Abstract
Modern semantic search and retrieval-augmented generation (RAG) systems rely predominantly on in-memory approximate nearest neighbor (ANN) indexes over high-precision floating-point vectors, resulting in escalating operational cost and inherent trade-offs between latency, throughput, and retrieval accuracy. This paper analyzes the architectural limitations of the dominant "HNSW + float32 + cosine similarity" stack and evaluates existing cost-reduction strategies, including storage disaggregation and lossy vector quantization, which inevitably sacrifice either performance or accuracy. We introduce and empirically evaluate an alternative information-theoretic architecture based on maximally informative binarization (MIB), efficient bitwise distance metrics, and an information-theoretic scoring (ITS) mechanism. Unlike conventional ANN systems, this approach enables exhaustive search over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Advanced Database Systems and Queries · Data Management and Algorithms
