Flatness-Aware Minimization for Domain Generalization
Xingxuan Zhang, Renzhe Xu, Han Yu, Yancheng Dong, Pengfei Tian, Peng, Cu

TL;DR
This paper introduces FAD, a flatness-aware optimization method for domain generalization that improves model robustness by optimizing loss landscape flatness, supported by theoretical analysis and superior experimental results.
Contribution
Proposes FAD, a novel flatness-aware optimization approach for domain generalization, with theoretical analysis and demonstrated effectiveness over existing methods.
Findings
FAD outperforms baseline methods on various DG datasets.
FAD finds flatter minima compared to other flatness-aware optimizers.
Theoretical bounds on FAD's generalization error are established.
Abstract
Domain generalization (DG) seeks to learn robust models that generalize well under unknown distribution shifts. As a critical aspect of DG, optimizer selection has not been explored in depth. Currently, most DG methods follow the widely used benchmark, DomainBed, and utilize Adam as the default optimizer for all datasets. However, we reveal that Adam is not necessarily the optimal choice for the majority of current DG methods and datasets. Based on the perspective of loss landscape flatness, we propose a novel approach, Flatness-Aware Minimization for Domain Generalization (FAD), which can efficiently optimize both zeroth-order and first-order flatness simultaneously for DG. We provide theoretical analyses of the FAD's out-of-distribution (OOD) generalization error and convergence. Our experimental results demonstrate the superiority of FAD on various DG datasets. Additionally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Neonatal and fetal brain pathology · Multimodal Machine Learning Applications
MethodsAdam
