Robust Image Classification in the Presence of Out-of-Distribution and Adversarial Samples Using Attractors in Neural Networks
Nasrin Alipour, Seyyed Ali SeyyedSalehi

TL;DR
This paper introduces a neural network approach that uses training samples as attractors to improve robustness against out-of-distribution and adversarial samples, maintaining high accuracy even under severe perturbations.
Contribution
The study proposes a novel neural network method that enhances robustness to OOD and adversarial samples by training with attractors, outperforming existing methods under high perturbation levels.
Findings
Achieves 87.13% accuracy on highly perturbed MNIST adversarial examples.
Distinguishes OOD samples from MNIST with over 98% accuracy.
Maintains high robustness with minimal performance drop under severe attacks.
Abstract
The proper handling of out-of-distribution (OOD) samples in deep classifiers is a critical concern for ensuring the suitability of deep neural networks in safety-critical systems. Existing approaches developed for robust OOD detection in the presence of adversarial attacks lose their performance by increasing the perturbation levels. This study proposes a method for robust classification in the presence of OOD samples and adversarial attacks with high perturbation levels. The proposed approach utilizes a fully connected neural network that is trained to use training samples as its attractors, enhancing its robustness. This network has the ability to classify inputs and identify OOD samples as well. To evaluate this method, the network is trained on the MNIST dataset, and its performance is tested on adversarial examples. The results indicate that the network maintains its performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection
