Directly and Efficiently Optimizing Prediction Error and AUC of Linear Classifiers
Hiva Ghanbari, Katya Scheinberg

TL;DR
This paper introduces a gradient-based optimization method for directly maximizing prediction accuracy and AUC of linear classifiers, leveraging closed-form expressions under normality assumptions to improve efficiency and effectiveness.
Contribution
It derives closed-form expressions for expected error and AUC for linear predictors under normal data, enabling direct and efficient optimization.
Findings
The method achieves high-quality solutions with low computational complexity.
It performs well even with non-normal data due to empirical moment approximation.
The approach outperforms traditional surrogate-based optimization methods.
Abstract
The predictive quality of machine learning models is typically measured in terms of their (approximate) expected prediction error or the so-called Area Under the Curve (AUC) for a particular data distribution. However, when the models are constructed by the means of empirical risk minimization, surrogate functions such as the logistic loss are optimized instead. This is done because the empirical approximations of the expected error and AUC functions are nonconvex and nonsmooth, and more importantly have zero derivative almost everywhere. In this work, we show that in the case of linear predictors, and under the assumption that the data has normal distribution, the expected error and the expected AUC are not only smooth, but have closed form expressions, which depend on the first and second moments of the normal distribution. Hence, we derive derivatives of these two functions and use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Face and Expression Recognition
MethodsLogistic Regression
