TL;DR
This paper introduces RoarGraph, a novel graph index designed for efficient cross-modal approximate nearest neighbor search, especially for out-of-distribution queries, significantly improving search speed and accuracy.
Contribution
The paper presents RoarGraph, a projected bipartite graph that leverages query distribution insights to enhance cross-modal ANNS performance for OOD workloads.
Findings
RoarGraph achieves up to 3.56x faster search speed at 90% recall.
It effectively handles OOD queries by addressing their spatial deviation.
The approach outperforms existing methods on modern datasets.
Abstract
Approximate Nearest Neighbor Search (ANNS) is a fundamental and critical component in many applications, including recommendation systems and large language model-based applications. With the advancement of multimodal neural models, which transform data from different modalities into a shared high-dimensional space as feature vectors, cross-modal ANNS aims to use the data vector from one modality (e.g., texts) as the query to retrieve the most similar items from another (e.g., images or videos). However, there is an inherent distribution gap between embeddings from different modalities, and cross-modal queries become Out-of-Distribution (OOD) to the base data. Consequently, state-of-the-art ANNS approaches suffer poor performance for OOD workloads. In this paper, we quantitatively analyze the properties of the OOD workloads to gain an understanding of their ANNS efficiency. Unlike…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Balanced Selection
