Consistent Classification Algorithms for Multi-class Non-Decomposable Performance Metrics
Harish G. Ramaswamy, Harikrishna Narasimhan, Shivani Agarwal

TL;DR
This paper develops a unified framework and efficient algorithms for learning classifiers optimized for multi-class non-decomposable performance metrics, such as macro F-measure and G-mean, ensuring consistency and polynomial runtime.
Contribution
It introduces a unified analysis framework and a polynomial-time, consistent learning algorithm for a broad class of multi-class non-decomposable metrics, extending binary case results.
Findings
Optimal classifiers can be obtained via cost-sensitive classification.
Proposed algorithm is consistent for concave multi-class metrics.
Algorithm runs in polynomial time relative to the number of classes.
Abstract
We study consistency of learning algorithms for a multi-class performance metric that is a non-decomposable function of the confusion matrix of a classifier and cannot be expressed as a sum of losses on individual data points; examples of such performance metrics include the macro F-measure popular in information retrieval and the G-mean metric used in class-imbalanced problems. While there has been much work in recent years in understanding the consistency properties of learning algorithms for `binary' non-decomposable metrics, little is known either about the form of the optimal classifier for a general multi-class non-decomposable metric, or about how these learning algorithms generalize to the multi-class case. In this paper, we provide a unified framework for analysing a multi-class non-decomposable performance metric, where the problem of finding the optimal classifier for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Imbalanced Data Classification Techniques · Machine Learning and Data Classification
