Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach
Simiao Zuo, Chen Liang, Haoming Jiang, Xiaodong Liu, Pengcheng He,, Jianfeng Gao, Weizhu Chen, Tuo Zhao

TL;DR
This paper introduces SALT, a novel Stackelberg game formulation for adversarial regularization in deep learning, using unrolled optimization to improve model generalization in NLP tasks.
Contribution
It proposes a leader-follower Stackelberg game approach for adversarial regularization, capturing strategic interactions with unrolled optimization, outperforming existing methods.
Findings
SALT outperforms existing adversarial regularization methods on NLP tasks.
The Stackelberg formulation improves model fitting to unperturbed data.
Unrolled optimization effectively captures the leader's strategic advantage.
Abstract
Adversarial regularization has been shown to improve the generalization performance of deep learning models in various natural language processing tasks. Existing works usually formulate the method as a zero-sum game, which is solved by alternating gradient descent/ascent algorithms. Such a formulation treats the adversarial and the defending players equally, which is undesirable because only the defending player contributes to the generalization performance. To address this issue, we propose Stackelberg Adversarial Regularization (SALT), which formulates adversarial regularization as a Stackelberg game. This formulation induces a competition between a leader and a follower, where the follower generates perturbations, and the leader trains the model subject to the perturbations. Different from conventional approaches, in SALT, the leader is in an advantageous position. When the leader…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
