Categorical, Ratio, and Professorial Data: The Case for Reciprocal Rank

Alistair Moffat

arXiv:2312.12672·cs.IR·December 21, 2023·21 cites

Categorical, Ratio, and Professorial Data: The Case for Reciprocal Rank

Alistair Moffat

PDF

Open Access

TL;DR

This paper advocates for using reciprocal rank as a ratio-scale measure for search engine results, enabling meaningful averaging and comparison of retrieval systems beyond traditional categorical methods.

Contribution

It supports the use of reciprocal rank as a valid ratio-scale metric for SERP evaluation, challenging restrictive conditions on effectiveness mappings.

Findings

01

Reciprocal rank can be justified as a ratio-scale measure.

02

Averaging SERP scores is valid under certain effectiveness mappings.

03

Supports broader use of numeric metrics in search evaluation.

Abstract

Search engine results pages are usually abstracted as binary relevance vectors and hence are categorical data, meaning that only a limited set of operations is permitted, most notably tabulation of occurrence frequencies, with determination of medians and averages not possible. To compare retrieval systems it is thus usual to make use of a categorical-to-numeric effectiveness mapping. A previous paper has argued that any desired categorical-to-numeric mapping may be used, provided only that there is an argued connection between each category of SERP and the score that is assigned to that category by the mapping. Further, once that plausible connection has been established, then the mapped values can be treated as real-valued observations on a ratio scale, allowing the computation of averages. This article is written in support of that point of view, and to respond to ongoing claims that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Semantic Web and Ontologies · Rough Sets and Fuzzy Logic

MethodsSparse Evolutionary Training