Towards Understanding the Role of Sharpness-Aware Minimization Algorithms for Out-of-Distribution Generalization
Samuel Schapiro, Han Zhao

TL;DR
This paper investigates the effectiveness of Sharpness-Aware Minimization (SAM) algorithms for out-of-distribution generalization, providing empirical comparisons, theoretical bounds, and insights into their performance in various OOD settings.
Contribution
It offers a comprehensive comparison of SAM variants for OOD generalization, introduces theoretical bounds, and explores SAM's role in gradual domain adaptation.
Findings
Original SAM outperforms Adam baseline by 4.76% in zero-shot OOD
Strongest SAM variants outperform Adam by 8.01% in zero-shot OOD
SAM outperforms Adam by 0.82% on average in GDA setting
Abstract
Recently, sharpness-aware minimization (SAM) has emerged as a promising method to improve generalization by minimizing sharpness, which is known to correlate well with generalization ability. Since the original proposal of SAM, many variants of SAM have been proposed to improve its accuracy and efficiency, but comparisons have mainly been restricted to the i.i.d. setting. In this paper we study SAM for out-of-distribution (OOD) generalization. First, we perform a comprehensive comparison of eight SAM variants on zero-shot OOD generalization, finding that the original SAM outperforms the Adam baseline by and the strongest SAM variants outperform the Adam baseline by on average. We then provide an OOD generalization bound in terms of sharpness for this setting. Next, we extend our study of SAM to the related setting of gradual domain adaptation (GDA), another form of OOD…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Reservoir Engineering and Simulation Methods · Medical Image Segmentation Techniques
MethodsSharpness-Aware Minimization · Segment Anything Model · Adam
