Learning with Multiclass AUC: Theory and Algorithms
Zhiyong Yang, Qianqian Xu, Shilong Bao, Xiaochun Cao, Qingming Huang

TL;DR
This paper introduces a novel framework for optimizing multiclass AUC metrics, addressing class imbalance issues and providing theoretical guarantees, with practical acceleration methods and validation on real datasets.
Contribution
It proposes the first comprehensive approach for multiclass AUC optimization, including theoretical analysis, scalable algorithms, and empirical validation.
Findings
The framework asymptotically reaches the Bayes optimal scoring function.
It provides imbalance-aware generalization error bounds.
Experimental results show improved performance on real-world datasets.
Abstract
The Area under the ROC curve (AUC) is a well-known ranking metric for problems such as imbalanced learning and recommender systems. The vast majority of existing AUC-optimization-based machine learning methods only focus on binary-class cases, while leaving the multiclass cases unconsidered. In this paper, we start an early trial to consider the problem of learning multiclass scoring functions via optimizing multiclass AUC metrics. Our foundation is based on the M metric, which is a well-known multiclass extension of AUC. We first pay a revisit to this metric, showing that it could eliminate the imbalance issue from the minority class pairs. Motivated by this, we propose an empirical surrogate risk minimization framework to approximately optimize the M metric. Theoretically, we show that: (i) optimizing most of the popular differentiable surrogate losses suffices to reach the Bayes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Anomaly Detection Techniques and Applications · Advanced Statistical Process Monitoring
