Improving Generalization via Uncertainty Driven Perturbations

Matteo Pagliardini; Gilberto Manunza; Martin Jaggi; Michael I. Jordan,; Tatjana Chavdarova

arXiv:2202.05737·cs.LG·March 2, 2022

Improving Generalization via Uncertainty Driven Perturbations

Matteo Pagliardini, Gilberto Manunza, Martin Jaggi, Michael I. Jordan,, Tatjana Chavdarova

PDF

Open Access

TL;DR

This paper introduces uncertainty-driven perturbations (UDP) to enhance model generalization by increasing decision boundary margins, reducing simplicity bias, and improving robustness in both linear and nonlinear models.

Contribution

The paper proposes a novel UDP method that maximizes model uncertainty to improve margins and generalization, extending max-margin guarantees to linear models and demonstrating empirical benefits for nonlinear models.

Findings

01

UDP guarantees maximum margin on linear models.

02

UDP increases margins on challenging datasets.

03

UDP reduces simplicity bias and improves feature learning.

Abstract

Recently Shah et al., 2020 pointed out the pitfalls of the simplicity bias - the tendency of gradient-based algorithms to learn simple models - which include the model's high sensitivity to small input perturbations, as well as sub-optimal margins. In particular, while Stochastic Gradient Descent yields max-margin boundary on linear models, such guarantee does not extend to non-linear models. To mitigate the simplicity bias, we consider uncertainty-driven perturbations (UDP) of the training data points, obtained iteratively by following the direction that maximizes the model's estimated uncertainty. The uncertainty estimate does not rely on the input's label and it is highest at the decision boundary, and - unlike loss-driven perturbations - it allows for using a larger range of values for the perturbation magnitude. Furthermore, as real-world datasets have non-isotropic distances…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms