A Family of Rank Similarity Measures based on Maximized Effectiveness Difference
Luchen Tan, Clarke L. A. Clarke

TL;DR
This paper introduces a family of rank similarity measures based on maximizing effectiveness differences, which can compare search result rankings without relevance judgments and reflect user behavior assumptions.
Contribution
The paper proposes and validates a new family of rank similarity measures derived from effectiveness measures, with solutions for standard metrics like nDCG, MAP, and ERR.
Findings
MED is a metric regardless of the effectiveness measure.
MED reveals meaningful differences between retrieval runs.
MED satisfies key desiderata for rank similarity measures.
Abstract
Rank similarity measures provide a method for quantifying differences between search engine results without the need for relevance judgments. For example, the providers of a search service might use such measures to estimate the impact of a proposed algorithmic change across a large number of queries - perhaps millions - identifying those queries where the impact is greatest. In this paper, we propose and validate a family of rank similarity measures, each derived from an associated effectiveness measure. Each member of the family is based on the maximization of effectiveness difference under this associated measure. Computing this maximized effectiveness difference (MED) requires the solution of an optimization problem that varies in difficulty, depending on the associated measure. We present solutions for several standard effectiveness measures, including nDCG, MAP, and ERR. Through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Advanced Image and Video Retrieval Techniques · Web Data Mining and Analysis
