Response to Moffat's Comment on "Towards Meaningful Statements in IR Evaluation: Mapping Evaluation Measures to Interval Scales"
Marco Ferrante, Nicola Ferro, Norbert Fuhr

TL;DR
This paper defends the importance of measurement theory and the concept of meaningfulness in IR evaluation, emphasizing their role in producing valid, robust, and generalizable results, especially amid ongoing debates about scale types.
Contribution
It clarifies misconceptions about measurement scales, highlights the significance of meaningfulness, and advocates for a theoretical foundation to improve IR evaluation practices.
Findings
Evaluation measures' scale types impact statistical analysis.
Meaningfulness ensures invariance of experimental inferences.
Theoretical grounding can enhance robustness of IR evaluation.
Abstract
Moffat recently commented on our previous work. Our work focused on how laying the foundations of our evaluation methodology into the theory of measurement can improve our knowledge and understanding of the evaluation measures we use in IR and how it can shed light on the different types of scales adopted by our evaluation measures; we also provided evidence, through extensive experimentation, on the impact of the different types of scales on the statistical analyses, as well as on the impact of departing from their assumptions. Moreover, we investigated, for the first time in IR, the concept of meaningfulness, i.e. the invariance of the experimental statements and inferences you draw, and proposed it as a way to ensure more valid and generalizabile results. Moffat's comments build on: (i) misconceptions about the representational theory of measurement, such as what an interval scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Statistical and Computational Modeling · Neural Networks and Applications
MethodsNone
