Efficient Defenses Against Adversarial Attacks

Valentina Zantedeschi; Maria-Irina Nicolae; Ambrish Rawat

arXiv:1707.06728·cs.LG·August 31, 2017

Efficient Defenses Against Adversarial Attacks

Valentina Zantedeschi, Maria-Irina Nicolae, Ambrish Rawat

PDF

TL;DR

This paper introduces a new, easy-to-implement defense mechanism for deep neural networks that enhances robustness against adversarial attacks without significant training overhead.

Contribution

The proposed method improves DNN robustness by reinforcing model structure, outperforming existing defenses, and maintaining efficiency and accuracy on clean data.

Findings

01

Effective against multiple attack types

02

Outperforms state-of-the-art defenses

03

Minimal training overhead

Abstract

Following the recent adoption of deep neural networks (DNN) accross a wide range of applications, adversarial attacks against these models have proven to be an indisputable threat. Adversarial samples are crafted with a deliberate intention of undermining a system. In the case of DNNs, the lack of better understanding of their working has prevented the development of efficient defenses. In this paper, we propose a new defense method based on practical observations which is easy to integrate into models and performs better than state-of-the-art defenses. Our proposed solution is meant to reinforce the structure of a DNN, making its prediction more stable and less likely to be fooled by adversarial samples. We conduct an extensive experimental study proving the efficiency of our method against multiple attacks, comparing it to numerous defenses, both in white-box and black-box setups.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.