Vulnerability of Deep Learning

Richard Kenway

arXiv:1803.06111·stat.ML·March 19, 2018

Vulnerability of Deep Learning

Richard Kenway

PDF

Open Access

TL;DR

This paper introduces a Renormalisation Group framework to evaluate and improve the robustness of deep learning networks against small input perturbations and adversarial attacks.

Contribution

It proposes a Monte Carlo RG-based numerical scheme to identify sensitive directions and relevant operators in trained deep networks, assessing their vulnerability.

Findings

01

Identifies input sensitivities linked to RG fixed points

02

Provides a method to detect and analyze adversarial vulnerabilities

03

Enables testing of network robustness against perturbations

Abstract

The Renormalisation Group (RG) provides a framework in which it is possible to assess whether a deep-learning network is sensitive to small changes in the input data and hence prone to error, or susceptible to adversarial attack. Distinct classification outputs are associated with different RG fixed points and sensitivity to small changes in the input data is due to the presence of relevant operators at a fixed point. A numerical scheme, based on Monte Carlo RG ideas, is proposed for identifying the existence of relevant operators and the corresponding directions of greatest sensitivity in the input data. Thus, a trained deep-learning network may be tested for its robustness and, if it is vulnerable to attack, dangerous perturbations of the input data identified.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning