Good Classification Measures and How to Find Them
Martijn G\"osgens, Anton Zhiyanov, Alexey Tikhonov, Liudmila, Prokhorenkova

TL;DR
This paper systematically analyzes various classification performance measures, defining desirable properties, proving an impossibility theorem, and proposing a new family of measures including Matthews Correlation Coefficient and Symmetric Balanced Accuracy.
Contribution
It introduces a formal framework for evaluating classification measures, proves an impossibility theorem, and proposes a new family of measures satisfying most desirable properties.
Findings
Identified which measures satisfy which properties
Proved an impossibility theorem for measure properties
Proposed a new family of classification measures
Abstract
Several performance measures can be used for evaluating classification results: accuracy, F-measure, and many others. Can we say that some of them are better than others, or, ideally, choose one measure that is best in all situations? To answer this question, we conduct a systematic analysis of classification performance measures: we formally define a list of desirable properties and theoretically analyze which measures satisfy which properties. We also prove an impossibility theorem: some desirable properties cannot be simultaneously satisfied. Finally, we propose a new family of measures satisfying all desirable properties except one. This family includes the Matthews Correlation Coefficient and a so-called Symmetric Balanced Accuracy that was not previously used in classification literature. We believe that our systematic approach gives an important tool to practitioners for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsImbalanced Data Classification Techniques · Advanced Statistical Methods and Models · Face and Expression Recognition
