With Argus Eyes: Assessing Retrieval Gaps via Uncertainty Scoring to Detect and Remedy Retrieval Blind Spots
Zeinab Sadat Taghavi, Ali Modarressi, Hinrich Schutze, Andreas Marfurt

TL;DR
This paper identifies and predicts retrieval blind spots in neural retrievers used in RAG systems, and introduces ARGUS to improve retrievability of low-score entities through targeted augmentation, enhancing system robustness.
Contribution
It reveals the existence of retrieval blind spots caused by training biases, proposes RPS for pre-index prediction, and introduces ARGUS for targeted document augmentation to address these blind spots.
Findings
Blind spots can be predicted from embedding geometry without retrieval evaluation.
ARGUS improves retriever performance by up to 4.5 nDCG points.
Preemptive blind spot mitigation enhances RAG system robustness.
Abstract
Reliable retrieval-augmented generation (RAG) systems depend fundamentally on the retriever's ability to find relevant information. We show that neural retrievers used in RAG systems have blind spots, which we define as the failure to retrieve entities that are relevant to the query, but have low similarity to the query embedding. We investigate the training-induced biases that cause such blind spot entities to be mapped to inaccessible parts of the embedding space, resulting in low retrievability. Using a large-scale dataset constructed from Wikidata relations and first paragraphs of Wikipedia, and our proposed Retrieval Probability Score (RPS), we show that blind spot risk in standard retrievers (e.g., CONTRIEVER, REASONIR) can be predicted pre-index from entity embedding geometry, avoiding expensive retrieval evaluations. To address these blind spots, we introduce ARGUS, a pipeline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Topic Modeling · Advanced Graph Neural Networks
