Gradient-Guided Annealing for Domain Generalization
Aristotelis Ballas, Christos Diou

TL;DR
This paper introduces Gradient-Guided Annealing (GGA), a novel training method that aligns gradients across domains during early training to improve model robustness and generalization to unseen data distributions.
Contribution
The paper proposes GGA, a new algorithm that enhances domain generalization by iteratively annealing model parameters to align gradients across domains, leading to better robustness.
Findings
GGA achieves state-of-the-art results on five benchmarks.
GGA improves existing domain generalization algorithms when combined.
GGA enhances robustness against domain shifts.
Abstract
Domain Generalization (DG) research has gained considerable traction as of late, since the ability to generalize to unseen data distributions is a requirement that eludes even state-of-the-art training algorithms. In this paper we observe that the initial iterations of model training play a key role in domain generalization effectiveness, since the loss landscape may be significantly different across the training and test distributions, contrary to the case of i.i.d. data. Conflicts between gradients of the loss components of each domain lead the optimization procedure to undesirable local minima that do not capture the domain-invariant features of the target classes. We propose alleviating domain conflicts in model optimization, by iteratively annealing the parameters of a model in the early stages of training and searching for points where gradients align between domains. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Text and Document Classification Technologies
MethodsSparse Evolutionary Training · ALIGN
