On Rank Graduation Metrics for High Dimensional Ordinal Data
Gennaro Auricchio, Adelaide Emma Bernardelli, Paolo Giudici, Giuseppe Toscani

TL;DR
This paper introduces a new family of Rank Graduation metrics for evaluating machine learning classification accuracy on high-dimensional ordinal data, addressing the challenge of lacking natural metrics.
Contribution
It develops a mathematical framework for comparing ordinal data using RGX metrics, linking them to existing metrics and demonstrating their effectiveness through extensive experiments.
Findings
RGX metrics effectively quantify explained variability in ordinal data
They demonstrate robustness and interpretability across diverse datasets
RGX metrics outperform traditional measures in certain classification scenarios
Abstract
Evaluating the reliability of machine learning classifications remains a fundamental challenge in Artificial Intelligence (AI), particularly when the target variable is multidimensional. Classification variables can be expressed by means of a categorical scale which, at best, is ordinal. Because ordinal data lack a natural metric structure in their underlying space, most conventional distance measures aimed at assessing the accuracy of machine learning classifications cannot be directly or meaningfully applied. In this paper, we develop a mathematical framework for comparing ordinal data based on a family of Rank Graduation \emph{metrics}. We demonstrate that these metrics can quantify the proportion of variability of the response explained by the predictions, in a similar manner as the predictive for continuous response variables. After establishing theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Imbalanced Data Classification Techniques
