Monge SAM: Robust Reparameterization-Invariant Sharpness-Aware Minimization Based on Loss Geometry
Albert Kj{\o}ller Jacobsen, Georgios Arvanitidis

TL;DR
Monge SAM introduces a reparametrization-invariant sharpness-aware minimization method based on loss geometry, improving robustness and generalization in deep neural networks.
Contribution
It proposes a novel reparametrization-invariant version of SAM using a Riemannian metric, enhancing robustness and theoretical understanding.
Findings
M-SAM is computationally as efficient as SAM.
M-SAM interpolates between SAM and gradient descent.
Empirical results show improved robustness to hyperparameters.
Abstract
Recent studies on deep neural networks show that flat minima of the loss landscape correlate with improved generalization. Sharpness-aware minimization (SAM) efficiently finds flat regions by updating the parameters according to the gradient at an adversarial perturbation. The perturbation depends on the Euclidean metric, making SAM non-invariant under reparametrizations, which blurs sharpness and generalization. We propose Monge SAM (M-SAM), a reparametrization invariant version of SAM by considering a Riemannian metric in the parameter space induced naturally by the loss surface. Compared to previous approaches, M-SAM works under any modeling choice, relies only on mild assumptions while being as computationally efficient as SAM. We theoretically argue that M-SAM varies between SAM and gradient descent (GD), which increases robustness to hyperparameter selection and reduces attraction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
MethodsSharpness-Aware Minimization · Segment Anything Model
