Scaling Laws for Embedding Dimension in Information Retrieval
Julian Killingback, Mahta Rafiee, Madine Manas, Hamed Zamani

TL;DR
This paper investigates how increasing embedding dimensions affects dense retrieval performance, revealing power-law scaling behavior and providing practical guidelines for model and embedding size selection.
Contribution
It offers a comprehensive analysis of the relationship between embedding dimension and retrieval performance, deriving scaling laws and highlighting diminishing returns and potential performance degradation.
Findings
Performance scales with embedding dimension following a power law.
Diminishing returns observed as embedding size increases.
Performance may degrade on less aligned tasks with larger embeddings.
Abstract
Dense retrieval, which encodes queries and documents into a single dense vector, has become the dominant neural retrieval approach due to its simplicity and compatibility with fast approximate nearest neighbor algorithms. As the tasks dense retrieval performs grow in complexity, the fundamental limitations of the underlying data structure and similarity metric -- namely vectors and inner-products -- become more apparent. Prior recent work has shown theoretical limitations inherent to single vectors and inner-products that are generally tied to the embedding dimension. Given the importance of embedding dimension for retrieval capacity, understanding how dense retrieval performance changes as embedding dimension is scaled is fundamental to building next generation retrieval models that balance effectiveness and efficiency. In this work, we conduct a comprehensive analysis of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Neural Networks and Applications · Image Retrieval and Classification Techniques
