Mind the (optimality) Gap: A Gap-Aware Learning Rate Scheduler for Adversarial Nets
Hussein Hazimeh, Natalia Ponomareva

TL;DR
This paper introduces a novel adaptive learning rate scheduler for adversarial nets that maintains a balanced training process, reduces tuning efforts, and improves model quality in tasks like image generation and domain adaptation.
Contribution
The paper proposes a gap-aware learning rate scheduler that dynamically adjusts the adversary's learning rate based on the loss of an ideal adversarial network, enhancing training stability and performance.
Findings
Scheduler reduces training divergence in adversarial nets.
Requires less hyperparameter tuning, saving time and resources.
Achieves up to 27% improvement in image quality and 3% in domain accuracy.
Abstract
Adversarial nets have proved to be powerful in various domains including generative modeling (GANs), transfer learning, and fairness. However, successfully training adversarial nets using first-order methods remains a major challenge. Typically, careful choices of the learning rates are needed to maintain the delicate balance between the competing networks. In this paper, we design a novel learning rate scheduler that dynamically adapts the learning rate of the adversary to maintain the right balance. The scheduler is driven by the fact that the loss of an ideal adversarial net is a constant known a priori. The scheduler is thus designed to keep the loss of the optimized adversarial net close to that of an ideal network. We run large-scale experiments to study the effectiveness of the scheduler on two popular applications: GANs for image generation and adversarial nets for domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
MethodsTest
