Toward Robust Image Classification
Basemah Alshemali, Alta Graham, Jugal Kalita

TL;DR
This paper presents a model combining dropout randomization and preprocessing for adversarial image detection, achieving high accuracy on MNIST against various attack methods, enhancing robustness of neural network classifiers.
Contribution
The paper introduces a novel adversarial detection model using dropout and preprocessing, outperforming recent methods on MNIST dataset.
Findings
Achieved 97% adversarial detection accuracy.
Maintained 99% classification accuracy after discarding adversarial images.
Outperformed recent similar techniques in detection accuracy.
Abstract
Neural networks are frequently used for image classification, but can be vulnerable to misclassification caused by adversarial images. Attempts to make neural network image classification more robust have included variations on preprocessing (cropping, applying noise, blurring), adversarial training, and dropout randomization. In this paper, we implemented a model for adversarial detection based on a combination of two of these techniques: dropout randomization with preprocessing applied to images within a given Bayesian uncertainty. We evaluated our model on the MNIST dataset, using adversarial images generated using Fast Gradient Sign Method (FGSM), Jacobian-based Saliency Map Attack (JSMA) and Basic Iterative Method (BIM) attacks. Our model achieved an average adversarial image detection accuracy of 97%, with an average image classification accuracy, after discarding images flagged…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Anomaly Detection Techniques and Applications
MethodsDropout
