SAD: Saliency-based Defenses Against Adversarial Examples
Richard Tran, David Patrick, Michael Geyer, Amanda Fernandez

TL;DR
This paper introduces a saliency-based defense mechanism against adversarial attacks on neural networks, focusing on preserving important image regions while reducing perturbations to improve robustness.
Contribution
The work presents a novel saliency-guided data cleaning method that targets important image regions to defend against adversarial attacks more effectively.
Findings
Significant improvements in saliency metrics compared to existing defenses.
Effective reduction of adversarial perturbations while preserving key image features.
Enhanced robustness of models against multiple attack methods.
Abstract
With the rise in popularity of machine and deep learning models, there is an increased focus on their vulnerability to malicious inputs. These adversarial examples drift model predictions away from the original intent of the network and are a growing concern in practical security. In order to combat these attacks, neural networks can leverage traditional image processing approaches or state-of-the-art defensive models to reduce perturbations in the data. Defensive approaches that take a global approach to noise reduction are effective against adversarial attacks, however their lossy approach often distorts important data within the image. In this work, we propose a visual saliency based approach to cleaning data affected by an adversarial attack. Our model leverages the salient regions of an adversarial image in order to provide a targeted countermeasure while comparatively reducing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
