Towards Understanding Generalization of Macro-AUC in Multi-label Learning
Guoqiang Wu, Chongxuan Li, Yilong Yin

TL;DR
This paper investigates the theoretical generalization properties of Macro-AUC in multi-label learning, highlighting the impact of label imbalance and proposing new bounds and algorithms.
Contribution
It characterizes the generalization bounds of algorithms based on surrogate losses for Macro-AUC, emphasizing the role of label imbalance and introducing a new concentration inequality.
Findings
Univariate loss algorithms are more sensitive to label imbalance.
Pairwise and reweighted loss algorithms perform better under imbalance.
Empirical results support the theoretical analysis.
Abstract
Macro-AUC is the arithmetic mean of the class-wise AUCs in multi-label learning and is commonly used in practice. However, its theoretical understanding is far lacking. Toward solving it, we characterize the generalization properties of various learning algorithms based on the corresponding surrogate losses w.r.t. Macro-AUC. We theoretically identify a critical factor of the dataset affecting the generalization bounds: \emph{the label-wise class imbalance}. Our results on the imbalance-aware error bounds show that the widely-used univariate loss-based algorithm is more sensitive to the label-wise class imbalance than the proposed pairwise and reweighted loss-based ones, which probably implies its worse performance. Moreover, empirical results on various datasets corroborate our theory findings. To establish it, technically, we propose a new (and more general) McDiarmid-type…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Imbalanced Data Classification Techniques · Face and Expression Recognition
