Chaos Theory and Adversarial Robustness
Jonathan S. Kent

TL;DR
This paper introduces a Chaos Theory-inspired metric called the susceptibility ratio to analyze and quantify neural network robustness against adversarial attacks, revealing depth-related vulnerabilities and enabling efficient robustness estimation.
Contribution
It presents a novel susceptibility ratio metric, analyzes its relation to model depth and robustness, and offers a practical method for approximating robustness radii in large models.
Findings
Susceptibility to attacks increases with model depth.
The susceptibility ratio correlates with post-attack accuracy.
Efficient approximation of robustness radii for large models is possible.
Abstract
Neural networks, being susceptible to adversarial attacks, should face a strict level of scrutiny before being deployed in critical or adversarial applications. This paper uses ideas from Chaos Theory to explain, analyze, and quantify the degree to which neural networks are susceptible to or robust against adversarial attacks. To this end, we present a new metric, the "susceptibility ratio," given by , which captures how greatly a model's output will be changed by perturbations to a given input. Our results show that susceptibility to attack grows significantly with the depth of the model, which has safety implications for the design of neural networks for production environments. We provide experimental evidence of the relationship between and the post-attack accuracy of classification models, as well as a discussion of its application to tasks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Machine Learning in Materials Science
