Does Saliency-Based Training bring Robustness for Deep Neural Networks   in Image Classification?

Ali Karkehabadi

arXiv:2306.16581·cs.CV·June 30, 2023

Does Saliency-Based Training bring Robustness for Deep Neural Networks in Image Classification?

Ali Karkehabadi

PDF

Open Access

TL;DR

This paper investigates whether saliency-guided training improves the robustness of deep neural networks in image classification, finding that despite better visual explanations, such models are more vulnerable to adversarial attacks.

Contribution

The study evaluates the robustness of saliency-trained models against adversarial examples, revealing limitations despite improved interpretability.

Findings

01

Saliency-guided models have lower adversarial robustness.

02

Visual explanations do not necessarily correlate with model robustness.

03

Saliency training does not enhance resistance to adversarial attacks.

Abstract

Deep Neural Networks are powerful tools to understand complex patterns and making decisions. However, their black-box nature impedes a complete understanding of their inner workings. While online saliency-guided training methods try to highlight the prominent features in the model's output to alleviate this problem, it is still ambiguous if the visually explainable features align with robustness of the model against adversarial examples. In this paper, we investigate the saliency trained model's vulnerability to adversarial examples methods. Models are trained using an online saliency-guided training method and evaluated against popular algorithms of adversarial examples. We quantify the robustness and conclude that despite the well-explained visualizations in the model's output, the salient models suffer from the lower performance against adversarial examples attacks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)

MethodsALIGN