Uncertainty-based Detection of Adversarial Attacks in Semantic Segmentation
Kira Maag, Asja Fischer

TL;DR
This paper introduces an uncertainty-based detection method for adversarial attacks in semantic segmentation that leverages output entropy differences without modifying the model, effective across various attack types.
Contribution
It presents a lightweight, post-processing approach using entropy-based uncertainty to detect adversarial attacks in semantic segmentation, filling a research gap in this area.
Findings
Effective detection across multiple attack types
No model modification required
Utilizes output entropy differences
Abstract
State-of-the-art deep neural networks have proven to be highly powerful in a broad range of tasks, including semantic image segmentation. However, these networks are vulnerable against adversarial attacks, i.e., non-perceptible perturbations added to the input image causing incorrect predictions, which is hazardous in safety-critical applications like automated driving. Adversarial examples and defense strategies are well studied for the image classification task, while there has been limited research in the context of semantic segmentation. First works however show that the segmentation outcome can be severely distorted by adversarial attacks. In this work, we introduce an uncertainty-based approach for the detection of adversarial attacks in semantic segmentation. We observe that uncertainty as for example captured by the entropy of the output distribution behaves differently on clean…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
