Harnessing the Vulnerability of Latent Layers in Adversarially Trained   Models

Mayank Singh; Abhishek Sinha; Nupur Kumari; Harshitha Machiraju,; Balaji Krishnamurthy; Vineeth N Balasubramanian

arXiv:1905.05186·cs.LG·June 27, 2019·23 cites

Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models

Mayank Singh, Abhishek Sinha, Nupur Kumari, Harshitha Machiraju,, Balaji Krishnamurthy, Vineeth N Balasubramanian

PDF

Open Access 1 Repo

TL;DR

This paper investigates the vulnerability of latent layers in adversarially trained neural networks, revealing their susceptibility to attacks and proposing a new training method to enhance robustness at the feature level.

Contribution

It introduces Latent Adversarial Training (LAT), a novel fine-tuning approach targeting latent layer robustness, and Latent Attack (LA), a new adversarial example construction algorithm.

Findings

01

LAT improves adversarial accuracy against PGD attacks

02

Latent layers are more vulnerable than input layers in robust models

03

Achieves state-of-the-art results on MNIST, CIFAR-10, CIFAR-100 datasets

Abstract

Neural networks are vulnerable to adversarial attacks -- small visually imperceptible crafted noise which when added to the input drastically changes the output. The most effective method of defending against these adversarial attacks is to use the methodology of adversarial training. We analyze the adversarially trained robust models to study their vulnerability against adversarial attacks at the level of the latent layers. Our analysis reveals that contrary to the input layer which is robust to adversarial attack, the latent layer of these robust models are highly susceptible to adversarial perturbations of small magnitude. Leveraging this information, we introduce a new technique Latent Adversarial Training (LAT) which comprises of fine-tuning the adversarially trained models to ensure the robustness at the feature layers. We also propose Latent Attack (LA), a novel algorithm for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

msingh27/LAT_adversarial_robustness
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications