Domain Adversarial Training: A Game Perspective
David Acuna, Marc T Law, Guojun Zhang, Sanja Fidler

TL;DR
This paper reinterprets domain-adversarial training as a game, identifies convergence issues with gradient descent, and proposes high-order ODE solvers like Runge-Kutta to improve stability and transfer performance.
Contribution
It introduces a game-theoretic analysis of domain-adversarial training and proposes high-order ODE solvers as a drop-in replacement for gradient descent to enhance convergence and performance.
Findings
Up to 3.5% accuracy improvement in domain adaptation tasks
High-order ODE solvers provide more stable training
Faster convergence with fewer training iterations
Abstract
The dominant line of work in domain adaptation has focused on learning invariant representations using domain-adversarial training. In this paper, we interpret this approach from a game theoretical perspective. Defining optimal solutions in domain-adversarial training as a local Nash equilibrium, we show that gradient descent in domain-adversarial training can violate the asymptotic convergence guarantees of the optimizer, oftentimes hindering the transfer performance. Our analysis leads us to replace gradient descent with high-order ODE solvers (i.e., Runge-Kutta), for which we derive asymptotic convergence guarantees. This family of optimizers is significantly more stable and allows more aggressive learning rates, leading to high performance gains when used as a drop-in replacement over standard optimizers. Our experiments show that in conjunction with state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM
