DARCCC: Detecting Adversaries by Reconstruction from Class Conditional Capsules
Nicholas Frosst, Sara Sabour, Geoffrey Hinton

TL;DR
This paper introduces a capsule-based method for detecting adversarial images by using reconstruction errors, demonstrating high effectiveness across datasets and resilience against certain attacks.
Contribution
The paper proposes a novel adversarial detection technique using capsule models' reconstruction errors, extending to CNNs and analyzing robustness against white-box attacks.
Findings
Reconstruction error effectively detects adversarial images in multiple datasets.
The technique works well with CNNs trained for reconstruction from hidden layers.
White-box attacks can fool detection but require the adversarial image to resemble the target class.
Abstract
We present a simple technique that allows capsule models to detect adversarial images. In addition to being trained to classify images, the capsule model is trained to reconstruct the images from the pose parameters and identity of the correct top-level capsule. Adversarial images do not look like a typical member of the predicted class and they have much larger reconstruction errors when the reconstruction is produced from the top-level capsule for that class. We show that setting a threshold on the distance between the input image and its reconstruction from the winning capsule is very effective at detecting adversarial images for three different datasets. The same technique works quite well for CNNs that have been trained to reconstruct the image from all or part of the last hidden layer before the softmax. We then explore a stronger, white-box attack that takes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Bacillus and Francisella bacterial research
