PosIR: Position-Aware Heterogeneous Information Retrieval Benchmark
Ziyang Zeng, Dun Zhang, Yu Yan, Xu Sun, Cuiqiaoshu Pan, Yudong Zhou, Yuqing Yang

TL;DR
PosIR introduces a comprehensive benchmark to systematically diagnose position bias in information retrieval across multiple languages and domains, revealing pervasive biases in current models and providing insights into their internal mechanisms.
Contribution
This paper presents the first standardized benchmark, PosIR, for diagnosing position bias in diverse retrieval scenarios, with a novel length-controlled bucketing strategy to isolate positional effects.
Findings
Embedding models show poor performance on long documents in PosIR.
Position bias is widespread and increases with document length.
Gradient saliency analysis reveals distinct internal mechanisms for positional preferences.
Abstract
In real-world documents, the information relevant to a user query may reside anywhere from the beginning to the end. This makes position bias -- a systematic tendency of retrieval models to favor or neglect content based on its location -- a critical concern. Although recent studies have identified such bias, existing analyses focus predominantly on English, fail to disentangle document length from information position, and lack a standardized framework for systematic diagnosis. To address these limitations, we introduce PosIR (Position-Aware Information Retrieval), the first standardized benchmark designed to systematically diagnose position bias in diverse retrieval scenarios. PosIR comprises 310 datasets spanning 10 languages and 31 domains, with relevance tied to precise reference spans. At its methodological core, PosIR employs a length-controlled bucketing strategy that groups…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Information Retrieval and Search Behavior · Topic Modeling
