ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge

Zeinab Sadat Taghavi; Ali Modarressi; Yunpu Ma; Hinrich Sch\"utze

arXiv:2506.14407·cs.CL·September 26, 2025

ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge

Zeinab Sadat Taghavi, Ali Modarressi, Yunpu Ma, Hinrich Sch\"utze

PDF

Open Access 1 Repo 1 Datasets

TL;DR

ImpliRet introduces a benchmark that evaluates document-side reasoning capabilities in retrieval systems, revealing significant challenges for current models in understanding implicit facts through complex relationships.

Contribution

The paper presents ImpliRet, a novel benchmark focusing on implicit fact retrieval in documents, shifting the reasoning challenge from query to document side, and evaluates various retrieval models on this task.

Findings

01

Current retrieval models perform poorly on implicit fact retrieval, with nDCG@10 only 14.91%.

02

Even long-context models like GPT-4-mini struggle, scoring only 55.54% with thirty documents.

03

Document-side reasoning remains a significant challenge for NLP retrieval systems.

Abstract

Retrieval systems are central to many NLP pipelines, but often rely on surface-level cues such as keyword overlap and lexical semantic similarity. To evaluate retrieval beyond these shallow signals, recent benchmarks introduce reasoning-heavy queries; however, they primarily shift the burden to query-side processing techniques -- like prompting or multi-hop retrieval -- that can help resolve complexity. In contrast, we present Impliret, a benchmark that shifts the reasoning challenge to document-side processing: The queries are simple, but relevance depends on facts stated implicitly in documents through temporal (e.g., resolving "two days ago"), arithmetic, and world knowledge relationships. We evaluate a range of sparse and dense retrievers, all of which struggle in this setting: the best nDCG@10 is only 14.91%. We also test whether long-context models can overcome this limitation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zeinabtaghavi/impliret
pytorchOfficial

Datasets

zeinabTaghavi/ImpliRet
dataset· 88 dl
88 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsLayer Normalization · Dropout · Absolute Position Encodings · Dense Connections · Byte Pair Encoding · Softmax · Label Smoothing · Transformer · GPT-4