Adversarial Attacks against Deep Saliency Models
Zhaohui Che, Ali Borji, Guangtao Zhai, Suiyi Ling, Guodong Guo,, Patrick Le Callet

TL;DR
This paper introduces a novel sparse feature-space adversarial attack method against deep saliency models, revealing vulnerabilities and raising security concerns in high-level vision tasks.
Contribution
It is the first to propose a feature-space attack requiring limited model information, producing sparser, more subtle adversarial perturbations than traditional methods.
Findings
Deeper hidden layers yield higher attack success rates.
Sparser perturbations are generated by layers with larger receptive fields.
Different loss functions and layers produce diverse adversarial patterns.
Abstract
Currently, a plethora of saliency models based on deep neural networks have led great breakthroughs in many complex high-level vision tasks (e.g. scene description, object detection). The robustness of these models, however, has not yet been studied. In this paper, we propose a sparse feature-space adversarial attack method against deep saliency models for the first time. The proposed attack only requires a part of the model information, and is able to generate a sparser and more insidious adversarial perturbation, compared to traditional image-space attacks. These adversarial perturbations are so subtle that a human observer cannot notice their presences, but the model outputs will be revolutionized. This phenomenon raises security threats to deep saliency models in practical applications. We also explore some intriguing properties of the feature-space attack, e.g. 1) the hidden layers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Visual Attention and Saliency Detection
