TL;DR
This paper introduces FRAT, an innovative algorithm that models and computes optimal randomized strategies for both defenders and attackers in adversarial neural network settings, ensuring convergence to equilibrium and demonstrating practical effectiveness.
Contribution
FRAT is the first algorithm to efficiently find mixed Nash equilibria in a continuous-time adversarial game with randomized strategies, using a novel flow on probability distributions.
Findings
FRAT converges to a mixed Nash equilibrium in theory.
Experimental results show FRAT's efficiency on CIFAR datasets.
FRAT effectively balances defense and attack strategies in adversarial settings.
Abstract
The vulnerability of deep neural network models to adversarial example attacks is a practical challenge in many artificial intelligence applications. A recent line of work shows that the use of randomization in adversarial training is the key to find optimal strategies against adversarial example attacks. However, in a fully randomized setting where both the defender and the attacker can use randomized strategies, there are no efficient algorithm for finding such an optimal strategy. To fill the gap, we propose the first algorithm of its kind, called FRAT, which models the problem with a new infinite-dimensional continuous-time flow on probability distribution spaces. FRAT maintains a lightweight mixture of models for the defender, with flexibility to efficiently update mixing weights and model parameters at each iteration. Furthermore, FRAT utilizes lightweight sampling subroutines to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
