Joint Upper & Lower Bound Normalization for IR Evaluation
Shubhra Kanti Karmaker Santu, Dongji Feng

TL;DR
This paper introduces a new evaluation framework for IR metrics that incorporates query-specific lower-bound normalization, improving the discriminatory power and consistency of metrics like nDCG and MAP across datasets.
Contribution
The paper proposes a novel lower-bound normalization for IR metrics, providing a new perspective and demonstrating its benefits through empirical case-studies.
Findings
UL normalized metrics show higher discriminatory power.
UL normalization reduces significance of differences in uninformative queries.
UL normalized metrics demonstrate better consistency across datasets.
Abstract
In this paper, we present a novel perspective towards IR evaluation by proposing a new family of evaluation metrics where the existing popular metrics (e.g., nDCG, MAP) are customized by introducing a query-specific lower-bound (LB) normalization term. While original nDCG, MAP etc. metrics are normalized in terms of their upper bounds based on an ideal ranked list, a corresponding LB normalization for them has not yet been studied. Specifically, we introduce two different variants of the proposed LB normalization, where the lower bound is estimated from a randomized ranking of the corresponding documents present in the evaluation set. We next conducted two case-studies by instantiating the new framework for two popular IR evaluation metric (with two variants, e.g., DCG_UL_V1,2 and MSP_UL_V1,2 ) and then comparing against the traditional metric without the proposed LB normalization.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Advanced Image and Video Retrieval Techniques · Face and Expression Recognition
