Entities as Retrieval Signals: A Systematic Study of Coverage, Supervision, and Evaluation in Entity-Oriented Ranking
Shubham Chatterjee

TL;DR
This paper investigates the limitations of entity-based retrieval, revealing that evaluation issues, not model failure, hinder performance, and emphasizes the need for better datasets and evaluation metrics.
Contribution
It systematically analyzes the gap between semantic relevance and discriminative entity signals, highlighting the importance of dataset quality and evaluation methods.
Findings
Entity signals cover only 19.7% of relevant documents.
Most neural rerankers do not significantly outperform baseline models.
Supervision strategies focus on semantic relatedness rather than discriminativeness.
Abstract
Entity-oriented retrieval assumes that relevant documents exhibit query-relevant entities, yet evaluations report conflicting results. We show this inconsistency stems not from model failure, but from evaluation. On TREC Robust04, we evaluate six neural rerankers and 437 unsupervised configurations against BM25. Across 443 systems, none improves MAP by more than 0.05 under open-world evaluation over the full candidate set, despite strong gains under entity-restricted settings. The best configuration matches the official Robust04 best system and outperforms most neural rerankers, indicating that the architecture is not the limiting factor. Instead, the bottleneck is the entity channel: even under idealized selection, entity signals cover only 19.7\% of relevant documents, and no method achieves both high coverage and discrimination. We explain this via a distinction between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
