Streamlining Evaluation with ir-measures
Sean MacAvaney, Craig Macdonald, Iadh Ounis

TL;DR
ir-measures is a new tool that simplifies the calculation of various information retrieval evaluation metrics by providing a unified interface to multiple tools, facilitating easier and more comprehensive system evaluation.
Contribution
It introduces a unified tool that automates and simplifies the calculation of diverse IR evaluation measures, including recent proposals, encouraging broader adoption.
Findings
Streamlines evaluation process for IR systems
Supports recent and traditional evaluation measures
Facilitates adoption of new metrics
Abstract
We present ir-measures, a new tool that makes it convenient to calculate a diverse set of evaluation measures used in information retrieval. Rather than implementing its own measure calculations, ir-measures provides a common interface to a handful of evaluation tools. The necessary tools are automatically invoked (potentially multiple times) to calculate all the desired metrics, simplifying the evaluation process for the user. The tool also makes it easier for researchers to use recently-proposed measures (such as those from the C/W/L framework) alongside traditional measures, potentially encouraging their adoption.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Recommender Systems and Techniques · Data Quality and Management
