X-SAM: Boosting Sharpness-Aware Minimization with Dominant-Eigenvector Gradient Correction
Hongru Duan, Yongle Chen, Lei Guan

TL;DR
X-SAM enhances Sharpness-Aware Minimization by correcting gradients along the dominant eigenvector of the Hessian, leading to better regularization, convergence, and generalization in training deep models.
Contribution
We introduce X-SAM, a novel method that explicitly aligns gradient updates with the Hessian's dominant eigenvector to improve SAM's effectiveness.
Findings
X-SAM achieves faster convergence in training.
X-SAM improves model generalization performance.
Theoretical analysis confirms X-SAM's convergence properties.
Abstract
Sharpness-Aware Minimization (SAM) aims to improve generalization by minimizing a worst-case perturbed loss over a small neighborhood of model parameters. However, during training, its optimization behavior does not always align with theoretical expectations, since both sharp and flat regions may yield a small perturbed loss. In such cases, the gradient may still point toward sharp regions, failing to achieve the intended effect of SAM. To address this issue, we investigate SAM from a spectral and geometric perspective: specifically, we utilize the angle between the gradient and the leading eigenvector of the Hessian as a measure of sharpness. Our analysis illustrates that when this angle is less than or equal to ninety degrees, the effect of SAM's sharpness regularization can be weakened. Furthermore, we propose an explicit eigenvector-aligned SAM (X-SAM), which corrects the gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
