Enhancing Adversarial Defense by k-Winners-Take-All

Chang Xiao; Peilin Zhong; Changxi Zheng

arXiv:1905.10510·cs.LG·October 30, 2019·6 cites

Enhancing Adversarial Defense by k-Winners-Take-All

Chang Xiao, Peilin Zhong, Changxi Zheng

PDF

Open Access 1 Repo

TL;DR

This paper introduces k-Winners-Take-All activation functions to neural networks, which improve robustness against gradient-based adversarial attacks by intentionally invalidating gradients at certain input points, with minimal training overhead.

Contribution

The paper presents a novel activation function, k-WTA, that enhances adversarial robustness by disrupting gradient-based attacks while maintaining training efficiency.

Findings

01

k-WTA activation increases adversarial robustness across various network architectures.

02

Discontinuities in k-WTA prevent effective gradient-based adversarial search.

03

k-WTA networks outperform traditional networks under white-box attacks.

Abstract

We propose a simple change to existing neural network structures for better defending against gradient-based adversarial attacks. Instead of using popular activation functions (such as ReLU), we advocate the use of k-Winners-Take-All (k-WTA) activation, a C0 discontinuous function that purposely invalidates the neural network model's gradient at densely distributed input data points. The proposed k-WTA activation can be readily used in nearly all existing networks and training methods with no significant overhead. Our proposal is theoretically rationalized. We analyze why the discontinuities in k-WTA networks can largely prevent gradient-based search of adversarial examples and why they at the same time remain innocuous to the network training. This understanding is also empirically backed. We test k-WTA activation on various network structures optimized by a training method, be it…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

a554b554/kWTA-Activation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications