Stochastic AUC Maximization with Deep Neural Networks
Mingrui Liu, Zhuoning Yuan, Yiming Ying, Tianbao Yang

TL;DR
This paper extends stochastic AUC maximization to deep neural networks, proposing new algorithms under the Polyak-ojasiewicz condition that improve convergence and effectiveness for imbalanced data classification.
Contribution
It introduces a novel non-convex min-max formulation for deep neural network-based stochastic AUC maximization and develops faster algorithms leveraging the PL condition.
Findings
Proposed algorithms achieve faster convergence rates.
Experimental results validate the effectiveness of the methods.
Algorithms outperform existing approaches in imbalanced data tasks.
Abstract
Stochastic AUC maximization has garnered an increasing interest due to better fit to imbalanced data classification. However, existing works are limited to stochastic AUC maximization with a linear predictive model, which restricts its predictive power when dealing with extremely complex data. In this paper, we consider stochastic AUC maximization problem with a deep neural network as the predictive model. Building on the saddle point reformulation of a surrogated loss of AUC, the problem can be cast into a {\it non-convex concave} min-max problem. The main contribution made in this paper is to make stochastic AUC maximization more practical for deep neural networks and big data with theoretical insights as well. In particular, we propose to explore Polyak-\L{}ojasiewicz (PL) condition that has been proved and observed in deep learning, which enables us to develop new stochastic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
