Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search
Kushagra Agrawal, Nisharg Nargund, Oishani Banerjee

TL;DR
This paper introduces a game-theoretic approach to optimize latent-space compression in transformer-based vector search, significantly improving similarity and utility while balancing efficiency, and can be integrated into existing retrieval systems.
Contribution
It presents a novel game-theoretic framework for latent-space compression that enhances semantic similarity and efficiency in vector search systems.
Findings
Higher average similarity (0.9981 vs. 0.5517) achieved
Improved utility scores (0.8873 vs. 0.5194)
Modest increase in query time
Abstract
Vector similarity search plays a pivotal role in modern information retrieval systems, especially when powered by transformer-based embeddings. However, the scalability and efficiency of such systems are often hindered by the high dimensionality of latent representations. In this paper, we propose a novel game-theoretic framework for optimizing latent-space compression to enhance both the efficiency and semantic utility of vector search. By modeling the compression strategy as a zero-sum game between retrieval accuracy and storage efficiency, we derive a latent transformation that preserves semantic similarity while reducing redundancy. We benchmark our method against FAISS, a widely-used vector search library, and demonstrate that our approach achieves a significantly higher average similarity (0.9981 vs. 0.5517) and utility (0.8873 vs. 0.5194), albeit with a modest increase in query…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
