Efficient Defenses Against Adversarial Attacks
Valentina Zantedeschi, Maria-Irina Nicolae, Ambrish Rawat

TL;DR
This paper introduces a new, easy-to-implement defense mechanism for deep neural networks that enhances robustness against adversarial attacks without significant training overhead.
Contribution
The proposed method improves DNN robustness by reinforcing model structure, outperforming existing defenses, and maintaining efficiency and accuracy on clean data.
Findings
Effective against multiple attack types
Outperforms state-of-the-art defenses
Minimal training overhead
Abstract
Following the recent adoption of deep neural networks (DNN) accross a wide range of applications, adversarial attacks against these models have proven to be an indisputable threat. Adversarial samples are crafted with a deliberate intention of undermining a system. In the case of DNNs, the lack of better understanding of their working has prevented the development of efficient defenses. In this paper, we propose a new defense method based on practical observations which is easy to integrate into models and performs better than state-of-the-art defenses. Our proposed solution is meant to reinforce the structure of a DNN, making its prediction more stable and less likely to be fooled by adversarial samples. We conduct an extensive experimental study proving the efficiency of our method against multiple attacks, comparing it to numerous defenses, both in white-box and black-box setups.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
