Defending Adversarial Examples via DNN Bottleneck Reinforcement

Wenqing Liu; Miaojing Shi; Teddy Furon; Li Li

arXiv:2008.05230·cs.CV·August 13, 2020

Defending Adversarial Examples via DNN Bottleneck Reinforcement

Wenqing Liu, Miaojing Shi, Teddy Furon, Li Li

PDF

TL;DR

This paper introduces a novel DNN bottleneck reinforcement method that enhances robustness against adversarial attacks by reinforcing the information bottleneck through joint auto-encoder training and frequency steering, without altering the classifier structure.

Contribution

It proposes a new defense mechanism that maintains the classifier structure intact and uses frequency-based reinforcement to improve adversarial robustness.

Findings

01

Effective against various adversarial attacks on MNIST, CIFAR-10, and ImageNet.

02

No need for pre-processing head or classifier modification.

03

Outperforms existing defense methods in experiments.

Abstract

This paper presents a DNN bottleneck reinforcement scheme to alleviate the vulnerability of Deep Neural Networks (DNN) against adversarial attacks. Typical DNN classifiers encode the input image into a compressed latent representation more suitable for inference. This information bottleneck makes a trade-off between the image-specific structure and class-specific information in an image. By reinforcing the former while maintaining the latter, any redundant information, be it adversarial or not, should be removed from the latent representation. Hence, this paper proposes to jointly train an auto-encoder (AE) sharing the same encoding weights with the visual classifier. In order to reinforce the information bottleneck, we introduce the multi-scale low-pass objective and multi-scale high-frequency communication for better frequency steering in the network. Unlike existing approaches, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.