PACMAN: PAC-style bounds accounting for the Mismatch between Accuracy and Negative log-loss
Matias Vera, Leonardo Rey Vega, Pablo Piantanida

TL;DR
This paper introduces PACMAN, a PAC-style framework that accounts for the mismatch between training loss (negative log-loss) and testing metric (accuracy) in classification, providing new bounds on generalization error.
Contribution
It develops a novel point-wise PAC analysis that explicitly considers the loss mismatch using likelihood ratios and information-theoretic bounds.
Findings
Derived PAC bounds that incorporate loss mismatch.
Comparison shows improved insights over traditional generalization bounds.
Provides a theoretical foundation for understanding accuracy and loss discrepancy.
Abstract
The ultimate performance of machine learning algorithms for classification tasks is usually measured in terms of the empirical error probability (or accuracy) based on a testing dataset. Whereas, these algorithms are optimized through the minimization of a typically different--more convenient--loss function based on a training set. For classification tasks, this loss function is often the negative log-loss that leads to the well-known cross-entropy risk which is typically better behaved (from a numerical perspective) than the error probability. Conventional studies on the generalization error do not usually take into account the underlying mismatch between losses at training and testing phases. In this work, we introduce an analysis based on point-wise PAC approach over the generalization gap considering the mismatch of testing based on the accuracy metric and training on the negative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Fault Detection and Control Systems
