Moment Alignment: Unifying Gradient and Hessian Matching for Domain Generalization
Yuen Chen, Haozhe Si, Guojun Zhang, Han Zhao

TL;DR
This paper introduces a theoretical framework for domain generalization based on moment alignment, unifying existing approaches and proposing a computationally efficient algorithm that improves generalization across unseen domains.
Contribution
The paper develops a new theory of moment alignment for domain generalization, unifies previous methods, and introduces CMA, a closed-form algorithm that enhances efficiency and effectiveness.
Findings
CMA outperforms ERM and state-of-the-art methods in experiments.
Moment alignment unifies invariant risk minimization, gradient, and Hessian matching.
Theoretical bounds relate domain alignment to transfer measure and generalization.
Abstract
Domain generalization (DG) seeks to develop models that generalize well to unseen target domains, addressing the prevalent issue of distribution shifts in real-world applications. One line of research in DG focuses on aligning domain-level gradients and Hessians to enhance generalization. However, existing methods are computationally inefficient and the underlying principles of these approaches are not well understood. In this paper, we develop the theory of moment alignment for DG. Grounded in \textit{transfer measure}, a principled framework for quantifying generalizability between two domains, we first extend the definition of transfer measure to domain generalization that includes multiple source domains and establish a target error bound. Then, we prove that aligning derivatives across domains improves transfer measure both when the feature extractor induces an invariant optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Advanced Graph Neural Networks
