Improving Generalization via Uncertainty Driven Perturbations
Matteo Pagliardini, Gilberto Manunza, Martin Jaggi, Michael I. Jordan,, Tatjana Chavdarova

TL;DR
This paper introduces uncertainty-driven perturbations (UDP) to enhance model generalization by increasing decision boundary margins, reducing simplicity bias, and improving robustness in both linear and nonlinear models.
Contribution
The paper proposes a novel UDP method that maximizes model uncertainty to improve margins and generalization, extending max-margin guarantees to linear models and demonstrating empirical benefits for nonlinear models.
Findings
UDP guarantees maximum margin on linear models.
UDP increases margins on challenging datasets.
UDP reduces simplicity bias and improves feature learning.
Abstract
Recently Shah et al., 2020 pointed out the pitfalls of the simplicity bias - the tendency of gradient-based algorithms to learn simple models - which include the model's high sensitivity to small input perturbations, as well as sub-optimal margins. In particular, while Stochastic Gradient Descent yields max-margin boundary on linear models, such guarantee does not extend to non-linear models. To mitigate the simplicity bias, we consider uncertainty-driven perturbations (UDP) of the training data points, obtained iteratively by following the direction that maximizes the model's estimated uncertainty. The uncertainty estimate does not rely on the input's label and it is highest at the decision boundary, and - unlike loss-driven perturbations - it allows for using a larger range of values for the perturbation magnitude. Furthermore, as real-world datasets have non-isotropic distances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms
