Robust and Information-theoretically Safe Bias Classifier against   Adversarial Attacks

Lijia Yu; Xiao-Shan Gao

arXiv:2111.04404·cs.LG·February 15, 2022·1 cites

Robust and Information-theoretically Safe Bias Classifier against Adversarial Attacks

Lijia Yu, Xiao-Shan Gao

PDF

Open Access

TL;DR

This paper introduces a bias classifier based on the bias part of a DNN with ReLU activations, which is inherently resistant to gradient-based adversarial attacks, and enhances its safety with a random perturbation.

Contribution

It proposes the concept of an information-theoretically safe classifier and provides a training method to achieve robustness against gradient attacks.

Findings

01

Bias classifier is more robust than similar-sized DNNs against attacks.

02

Adding a random first-degree part makes the classifier safe from gradient-based attacks.

03

The concept of information-theoretically safe classifier is introduced for the first time.

Abstract

In this paper, the bias classifier is introduced, that is, the bias part of a DNN with Relu as the activation function is used as a classifier. The work is motivated by the fact that the bias part is a piecewise constant function with zero gradient and hence cannot be directly attacked by gradient-based methods to generate adversaries, such as FGSM. The existence of the bias classifier is proved and an effective training method for the bias classifier is given. It is proved that by adding a proper random first-degree part to the bias classifier, an information-theoretically safe classifier against the original-model gradient attack is obtained in the sense that the attack will generate a totally random attacking direction. This seems to be the first time that the concept of information-theoretically safe classifier is proposed. Several attack methods for the bias classifier are proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research