Principled Multi-Aspect Evaluation Measures of Rankings

Maria Maistro; Lucas Chaves Lima; Jakob Grue Simonsen; Christina Lioma

arXiv:2212.00492·cs.IR·December 2, 2022

Principled Multi-Aspect Evaluation Measures of Rankings

Maria Maistro, Lucas Chaves Lima, Jakob Grue Simonsen, Christina Lioma

PDF

1 Repo

TL;DR

This paper introduces a new, theoretically sound multi-aspect evaluation measure for ranking systems that can handle any number and type of aspects, improving discrimination over existing methods.

Contribution

It proposes a principled evaluation method for multiple aspects of rankings that addresses limitations of previous approaches and is applicable to any number and type of aspects.

Findings

01

More discriminative than state-of-the-art methods

02

Handles any number and type of aspects

03

Overcomes theoretical anomalies of previous measures

Abstract

Information Retrieval evaluation has traditionally focused on defining principled ways of assessing the relevance of a ranked list of documents with respect to a query. Several methods extend this type of evaluation beyond relevance, making it possible to evaluate different aspects of a document ranking (e.g., relevance, usefulness, or credibility) using a single measure (multi-aspect evaluation). However, these methods either are (i) tailor-made for specific aspects and do not extend to other types or numbers of aspects, or (ii) have theoretical anomalies, e.g. assign maximum score to a ranking where all documents are labelled with the lowest grade with respect to all aspects (e.g., not relevant, not credible, etc.). We present a theoretically principled multi-aspect evaluation method that can be used for any number, and any type, of aspects. A thorough empirical evaluation using up…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lcschv/toma
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.