Protecting Feed-Forward Networks from Adversarial Attacks Using   Predictive Coding

Ehsan Ganjidoost; Jeff Orchard

arXiv:2411.00222·cs.CR·November 4, 2024

Protecting Feed-Forward Networks from Adversarial Attacks Using Predictive Coding

Ehsan Ganjidoost, Jeff Orchard

PDF

Open Access

TL;DR

This paper proposes using predictive coding networks as an auxiliary preprocessing step to defend feed-forward neural networks against adversarial attacks, significantly improving robustness on MNIST and CIFAR10 datasets.

Contribution

Introducing a novel adversarial defense method by integrating predictive coding networks into feed-forward models without altering the primary architecture.

Findings

01

82% robustness improvement on MNIST

02

65% robustness improvement on CIFAR10

03

Effective countermeasure using small training subset

Abstract

An adversarial example is a modified input image designed to cause a Machine Learning (ML) model to make a mistake; these perturbations are often invisible or subtle to human observers and highlight vulnerabilities in a model's ability to generalize from its training data. Several adversarial attacks can create such examples, each with a different perspective, effectiveness, and perceptibility of changes. Conversely, defending against such adversarial attacks improves the robustness of ML models in image processing and other domains of deep learning. Most defence mechanisms require either a level of model awareness, changes to the model, or access to a comprehensive set of adversarial examples during training, which is impractical. Another option is to use an auxiliary model in a preprocessing manner without changing the primary model. This study presents a practical and effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity in Wireless Sensor Networks · Adversarial Robustness in Machine Learning

MethodsSparse Evolutionary Training