Towards adversarial robustness with 01 loss neural networks

Yunzhe Xue; Meiyan Xie; Usman Roshan

arXiv:2008.09148·cs.LG·August 24, 2020

Towards adversarial robustness with 01 loss neural networks

Yunzhe Xue, Meiyan Xie, Usman Roshan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a single hidden layer neural network using 01 loss trained with stochastic coordinate descent, demonstrating superior robustness against adversarial attacks on CIFAR10 compared to traditional models.

Contribution

The paper proposes a novel 01 loss neural network trained with stochastic coordinate descent that enhances adversarial robustness, outperforming standard sigmoid and binarized networks in attack resistance.

Findings

01

01 loss network has the highest adversarial distortion margins.

02

01 loss network is more resistant to black box attacks across various thresholds.

03

Compared to simple convolutional models, the 01 loss network shows comparable robustness.

Abstract

Motivated by the general robustness properties of the 01 loss we propose a single hidden layer 01 loss neural network trained with stochastic coordinate descent as a defense against adversarial attacks in machine learning. One measure of a model's robustness is the minimum distortion required to make the input adversarial. This can be approximated with the Boundary Attack (Brendel et. al. 2018) and HopSkipJump (Chen et. al. 2019) methods. We compare the minimum distortion of the 01 loss network to the binarized neural network and the standard sigmoid activation network with cross-entropy loss all trained with and without Gaussian noise on the CIFAR10 benchmark binary classification between classes 0 and 1. Both with and without noise training we find our 01 loss network to have the largest adversarial distortion of the three models by non-trivial margins. To further validate these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zero-one-loss/mlp01
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Anomaly Detection Techniques and Applications

MethodsSigmoid Activation