Enhancing Sharpness-Aware Optimization Through Variance Suppression

Bingcong Li; Georgios B. Giannakis

arXiv:2309.15639·cs.LG·December 25, 2023·2 cites

Enhancing Sharpness-Aware Optimization Through Variance Suppression

Bingcong Li, Georgios B. Giannakis

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces VaSSO, a variance suppression technique that stabilizes sharpness-aware optimization, improving generalization and robustness of deep neural networks beyond existing methods like SAM.

Contribution

It proposes a novel variance suppression approach to enhance the stability and effectiveness of sharpness-aware minimization in deep learning.

Findings

01

VaSSO improves model generalization over SAM.

02

VaSSO enhances robustness against label noise.

03

Experiments show numerical improvements in image classification and translation.

Abstract

Sharpness-aware minimization (SAM) has well documented merits in enhancing generalization of deep neural networks, even without sizable data augmentation. Embracing the geometry of the loss function, where neighborhoods of 'flat minima' heighten generalization ability, SAM seeks 'flat valleys' by minimizing the maximum loss caused by an adversary perturbing parameters within the neighborhood. Although critical to account for sharpness of the loss function, such an 'over-friendly adversary' can curtail the outmost level of generalization. The novel approach of this contribution fosters stabilization of adversaries through variance suppression (VaSSO) to avoid such friendliness. VaSSO's provable stability safeguards its numerical improvement over SAM in model-agnostic tasks, including image classification and machine translation. In addition, experiments confirm that VaSSO endows SAM with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bingcongli/vasso
pytorchOfficial

Videos

Enhancing Sharpness-Aware Optimization Through Variance Suppression· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Machine Learning and Data Classification

MethodsSegment Anything Model