Memory Defense: More Robust Classification via a Memory-Masking Autoencoder
Eashan Adhikarla (1), Dan Luo (1), Brian D. Davison (1) ((1) Lehigh, University)

TL;DR
Memory Defense introduces a memory-masking autoencoder to create class-specific latent representations, significantly improving robustness of classifiers against adversarial attacks on datasets like Fashion-MNIST and CIFAR-10.
Contribution
The paper presents a novel memory-masking autoencoder framework that enhances classifier robustness by learning independent class-specific latent representations.
Findings
Outperforms existing defenses against four attack types
Demonstrates improved robustness on Fashion-MNIST and CIFAR-10
Validates effectiveness through extensive experiments
Abstract
Many deep neural networks are susceptible to minute perturbations of images that have been carefully crafted to cause misclassification. Ideally, a robust classifier would be immune to small variations in input images, and a number of defensive approaches have been created as a result. One method would be to discern a latent representation which could ignore small changes to the input. However, typical autoencoders easily mingle inter-class latent representations when there are strong similarities between classes, making it harder for a decoder to accurately project the image back to the original high-dimensional space. We propose a novel framework, Memory Defense, an augmented classifier with a memory-masking autoencoder to counter this challenge. By masking other classes, the autoencoder learns class-specific independent latent representations. We test the model's robustness against…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
