Achieve Optimal Adversarial Accuracy for Adversarial Deep Learning using Stackelberg Game
Xiao-Shan Gao, Shuang Liu, Lijia Yu

TL;DR
This paper formulates adversarial deep learning as sequential Stackelberg games, proving the existence of equilibria and demonstrating that the resulting DNNs achieve optimal adversarial accuracy within their structure.
Contribution
It introduces a sequential game-theoretic framework for adversarial deep learning, establishing equilibrium existence and optimal robustness for DNNs.
Findings
Existence of Stackelberg equilibria for DNN-based adversarial games.
Equilibrium DNNs achieve the highest adversarial accuracy among similar models.
Trade-offs between robustness and accuracy are analyzed from a game theory perspective.
Abstract
Adversarial deep learning is to train robust DNNs against adversarial attacks, which is one of the major research focuses of deep learning. Game theory has been used to answer some of the basic questions about adversarial deep learning such as the existence of a classifier with optimal robustness and the existence of optimal adversarial samples for a given class of classifiers. In most previous work, adversarial deep learning was formulated as a simultaneous game and the strategy spaces are assumed to be certain probability distributions in order for the Nash equilibrium to exist. But, this assumption is not applicable to the practical situation. In this paper, we give answers to these basic questions for the practical case where the classifiers are DNNs with a given structure, by formulating the adversarial deep learning as sequential games. The existence of Stackelberg equilibria for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
