Defending Adversarial Attacks via Semantic Feature Manipulation
Shuo Wang, Tianle Chen, Surya Nepal, Carsten Rudolph, Marthie Grobler,, Shangyu Chen

TL;DR
This paper introduces FM-Defense, a semantic feature manipulation method using a combo-variational autoencoder to detect and purify adversarial examples effectively across multiple datasets.
Contribution
It proposes a novel, attack-agnostic defense mechanism leveraging semantic feature disentanglement for adversarial detection and purification.
Findings
Detects nearly 100% of adversarial examples from various attacks.
Achieves over 99% purification accuracy on suspicious instances.
Demonstrates effectiveness across three datasets.
Abstract
Machine learning models have demonstrated vulnerability to adversarial attacks, more specifically misclassification of adversarial examples. In this paper, we propose a one-off and attack-agnostic Feature Manipulation (FM)-Defense to detect and purify adversarial examples in an interpretable and efficient manner. The intuition is that the classification result of a normal image is generally resistant to non-significant intrinsic feature changes, e.g., varying thickness of handwritten digits. In contrast, adversarial examples are sensitive to such changes since the perturbation lacks transferability. To enable manipulation of features, a combo-variational autoencoder is applied to learn disentangled latent codes that reveal semantic features. The resistance to classification change over the morphs, derived by varying and reconstructing latent codes, is used to detect suspicious inputs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications
MethodsSolana Customer Service Number +1-833-534-1729
