A Perturbation Resistant Transformation and Classification System for Deep Neural Networks
Nathaniel Dean, Dilip Sarkar

TL;DR
This paper introduces a novel attack-agnostic transformation and ensemble classification system that enhances the robustness of deep neural networks against various adversarial attacks with minimal accuracy loss on clean data.
Contribution
It presents a new multi-pronged system combining feature kernel-based input transformations and ensemble voting, improving adversarial robustness of neural networks.
Findings
Improves robustness against white-box attacks on CIFAR10
Maintains high accuracy on clean images
Augments adversarially trained networks effectively
Abstract
Deep convolutional neural networks accurately classify a diverse range of natural images, but may be easily deceived when designed, imperceptible perturbations are embedded in the images. In this paper, we design a multi-pronged training, input transformation, and image ensemble system that is attack agnostic and not easily estimated. Our system incorporates two novel features. The first is a transformation layer that computes feature level polynomial kernels from class-level training data samples and iteratively updates input image copies at inference time based on their feature kernel differences to create an ensemble of transformed inputs. The second is a classification system that incorporates the prediction of the undefended network with a hard vote on the ensemble of filtered images. Our evaluations on the CIFAR10 dataset show our system improves the robustness of an undefended…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
