Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search

Kushagra Agrawal; Nisharg Nargund; Oishani Banerjee

arXiv:2508.18877·cs.IR·February 3, 2026

Optimization of Latent-Space Compression using Game-Theoretic Techniques for Transformer-Based Vector Search

Kushagra Agrawal, Nisharg Nargund, Oishani Banerjee

PDF

TL;DR

This paper introduces a game-theoretic approach to optimize latent-space compression in transformer-based vector search, significantly improving similarity and utility while balancing efficiency, and can be integrated into existing retrieval systems.

Contribution

It presents a novel game-theoretic framework for latent-space compression that enhances semantic similarity and efficiency in vector search systems.

Findings

01

Higher average similarity (0.9981 vs. 0.5517) achieved

02

Improved utility scores (0.8873 vs. 0.5194)

03

Modest increase in query time

Abstract

Vector similarity search plays a pivotal role in modern information retrieval systems, especially when powered by transformer-based embeddings. However, the scalability and efficiency of such systems are often hindered by the high dimensionality of latent representations. In this paper, we propose a novel game-theoretic framework for optimizing latent-space compression to enhance both the efficiency and semantic utility of vector search. By modeling the compression strategy as a zero-sum game between retrieval accuracy and storage efficiency, we derive a latent transformation that preserves semantic similarity while reducing redundancy. We benchmark our method against FAISS, a widely-used vector search library, and demonstrate that our approach achieves a significantly higher average similarity (0.9981 vs. 0.5517) and utility (0.8873 vs. 0.5194), albeit with a modest increase in query…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.