TL;DR
This paper introduces a novel retrieval method that uses multiple query vectors and anomalous pattern detection to improve retrieval accuracy across various datasets.
Contribution
The work proposes a new retrieval approach that considers multiple queries simultaneously and leverages anomalous pattern detection to identify relevant vectors.
Findings
Larger query sets generally improve retrieval performance.
Performance gains are most significant when increasing from 1 to 8 queries.
The method is validated on image, text, and tabular datasets.
Abstract
A classical vector retrieval problem typically considers a \emph{single} query embedding vector as input and retrieves the most similar embedding vectors from a vector database. However, complex reasoning and retrieval tasks frequently require \emph{multiple query vectors}, rather than a single one. In this work, we propose a retrieval method that considers multiple query vectors simultaneously and retrieves the most relevant vectors from the database using concepts from anomalous pattern detection. Specifically, our approach leverages a set of query vectors (with ), and identifies the subset of vector dimensions within that standout (anomalous) from the rest of dimensions. Next, we scan the vector database to retrieve the set of vectors that are also anomalous across the previously identified vector dimensions and return them as our retrieved set of vectors. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
