An Effectiveness Metric for Ordinal Classification: Formal Properties   and Experimental Results

Enrique Amig\'o; Julio Gonzalo; Stefano Mizzaro; Jorge; Carrillo-de-Albornoz

arXiv:2006.01245·cs.CL·February 22, 2022

An Effectiveness Metric for Ordinal Classification: Formal Properties and Experimental Results

Enrique Amig\'o, Julio Gonzalo, Stefano Mizzaro, Jorge, Carrillo-de-Albornoz

PDF

1 Repo

TL;DR

This paper introduces a new metric for ordinal classification that effectively captures class ordering information, grounded in Measurement and Information Theory, and demonstrates its advantages through theoretical analysis and experiments.

Contribution

It proposes the Closeness Evaluation Measure, a novel metric that unifies and generalizes existing evaluation metrics for ordinal classification tasks.

Findings

01

The new metric captures quality aspects from multiple traditional metrics.

02

It generalizes popular nominal and interval scale metrics.

03

Experimental results validate its effectiveness on synthetic and real data.

Abstract

In Ordinal Classification tasks, items have to be assigned to classes that have a relative ordering, such as positive, neutral, negative in sentiment analysis. Remarkably, the most popular evaluation metrics for ordinal classification tasks either ignore relevant information (for instance, precision/recall on each of the classes ignores their relative ordering) or assume additional information (for instance, Mean Average Error assumes absolute distances between classes). In this paper we propose a new metric for Ordinal Classification, Closeness Evaluation Measure, that is rooted on Measurement Theory and Information Theory. Our theoretical analysis and experimental results over both synthetic data and data from NLP shared tasks indicate that the proposed metric captures quality aspects from different traditional tasks simultaneously. In addition, it generalizes some popular…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EvALLTEAM/EvALLToolkit
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.