Meta Curvature-Aware Minimization for Domain Generalization
Ziyang Chen, Yiwen Ye, Feilong Tang, Yongsheng Pan, and Yong Xia

TL;DR
This paper introduces Meta Curvature-Aware Minimization (MeCAM), a novel training method that encourages models to converge to flatter minima, thereby improving domain generalization performance across multiple benchmark datasets.
Contribution
The paper proposes a new curvature metric and an algorithm, MeCAM, that jointly minimizes training loss, SAM surrogate gap, and meta-learning surrogate gap for better domain generalization.
Findings
MeCAM outperforms existing DG methods on five benchmark datasets.
Theoretical analysis shows improved generalization error and convergence rate.
Empirical results demonstrate superior model robustness and accuracy.
Abstract
Domain generalization (DG) aims to enhance the ability of models trained on source domains to generalize effectively to unseen domains. Recently, Sharpness-Aware Minimization (SAM) has shown promise in this area by reducing the sharpness of the loss landscape to obtain more generalized models. However, SAM and its variants sometimes fail to guide the model toward a flat minimum, and their training processes exhibit limitations, hindering further improvements in model generalization. In this paper, we first propose an improved model training process aimed at encouraging the model to converge to a flat minima. To achieve this, we design a curvature metric that has a minimal effect when the model is far from convergence but becomes increasingly influential in indicating the curvature of the minima as the model approaches a local minimum. Then we derive a novel algorithm from this metric,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques
MethodsSharpness-Aware Minimization · Segment Anything Model
