Protecting Feed-Forward Networks from Adversarial Attacks Using Predictive Coding
Ehsan Ganjidoost, Jeff Orchard

TL;DR
This paper proposes using predictive coding networks as an auxiliary preprocessing step to defend feed-forward neural networks against adversarial attacks, significantly improving robustness on MNIST and CIFAR10 datasets.
Contribution
Introducing a novel adversarial defense method by integrating predictive coding networks into feed-forward models without altering the primary architecture.
Findings
82% robustness improvement on MNIST
65% robustness improvement on CIFAR10
Effective countermeasure using small training subset
Abstract
An adversarial example is a modified input image designed to cause a Machine Learning (ML) model to make a mistake; these perturbations are often invisible or subtle to human observers and highlight vulnerabilities in a model's ability to generalize from its training data. Several adversarial attacks can create such examples, each with a different perspective, effectiveness, and perceptibility of changes. Conversely, defending against such adversarial attacks improves the robustness of ML models in image processing and other domains of deep learning. Most defence mechanisms require either a level of model awareness, changes to the model, or access to a comprehensive set of adversarial examples during training, which is impractical. Another option is to use an auxiliary model in a preprocessing manner without changing the primary model. This study presents a practical and effective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSecurity in Wireless Sensor Networks · Adversarial Robustness in Machine Learning
MethodsSparse Evolutionary Training
