Model Generalization: A Sharpness Aware Optimization Perspective

Jozef Marus Coldenhoff; Chengkun Li; Yurui Zhu

arXiv:2208.06915·cs.LG·August 16, 2022·1 cites

Model Generalization: A Sharpness Aware Optimization Perspective

Jozef Marus Coldenhoff, Chengkun Li, Yurui Zhu

PDF

Open Access 1 Repo

TL;DR

This paper investigates how sharpness-aware optimization methods like SAM and ASAM enhance model generalization, demonstrating their effectiveness through experiments and highlighting areas for further research.

Contribution

The paper proposes three experiments to validate the generalization benefits of sharpness-aware optimization techniques, especially focusing on ASAM's performance on un-normalized data.

Findings

01

Sharpness-aware methods improve model generalization.

02

ASAM enhances performance on un-normalized data.

03

Further research needed for ASAM's robustness.

Abstract

Sharpness-Aware Minimization (SAM) and adaptive sharpness-aware minimization (ASAM) aim to improve the model generalization. And in this project, we proposed three experiments to valid their generalization from the sharpness aware perspective. And our experiments show that sharpness aware-based optimization techniques could help to provide models with strong generalization ability. Our experiments also show that ASAM could improve the generalization performance on un-normalized data, but further research is needed to confirm this.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CharlieLeee/Sharpness-Aware-Generalization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Machine Learning and Data Classification

MethodsAttentive Walk-Aggregating Graph Neural Network · Sharpness-Aware Minimization