Built-in Vulnerabilities to Imperceptible Adversarial Perturbations

Thomas Tanay; Jerone T. A. Andrews; Lewis D. Griffin

arXiv:1806.07409·cs.CV·May 9, 2019·6 cites

Built-in Vulnerabilities to Imperceptible Adversarial Perturbations

Thomas Tanay, Jerone T. A. Andrews, Lewis D. Griffin

PDF

Open Access

TL;DR

This paper demonstrates that making neural networks more vulnerable to imperceptible adversarial attacks is surprisingly straightforward, revealing new insights into the nature of adversarial examples and potential security risks.

Contribution

It introduces a generic tilting attack that injects vulnerabilities into pre-trained models and demonstrates how minimal poisoning can induce backdoor vulnerabilities.

Findings

01

Vulnerabilities can be injected without affecting test performance.

02

A tilting attack increases sensitivity to low-variance data components.

03

Low poisoning rates can induce backdoor vulnerabilities in state-of-the-art networks.

Abstract

Designing models that are robust to small adversarial perturbations of their inputs has proven remarkably difficult. In this work we show that the reverse problem---making models more vulnerable---is surprisingly easy. After presenting some proofs of concept on MNIST, we introduce a generic tilting attack that injects vulnerabilities into the linear layers of pre-trained networks by increasing their sensitivity to components of low variance in the training data without affecting their performance on test data. We illustrate this attack on a multilayer perceptron trained on SVHN and use it to design a stand-alone adversarial module which we call a steganogram decoder. Finally, we show on CIFAR-10 that a poisoning attack with a poisoning rate as low as 0.1% can induce vulnerabilities to chosen imperceptible backdoor signals in state-of-the-art networks. Beyond their practical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications