Defending against Adversarial Attack towards Deep Neural Networks via   Collaborative Multi-task Training

Derek Wang; Chaoran Li; Sheng Wen; Surya Nepal; Yang Xiang

arXiv:1803.05123·cs.LG·July 27, 2020·6 cites

Defending against Adversarial Attack towards Deep Neural Networks via Collaborative Multi-task Training

Derek Wang, Chaoran Li, Sheng Wen, Surya Nepal, Yang Xiang

PDF

Open Access

TL;DR

This paper introduces a collaborative multi-task training framework to defend deep neural networks against various adversarial attacks, improving robustness and detection capabilities across different attack types.

Contribution

The proposed method uniquely combines label encoding, adversarial training, and a detection mechanism within a collaborative architecture to defend against diverse adversarial attacks.

Findings

01

Achieved up to 96.3% accuracy on black-box adversarial examples

02

Detected up to 98.7% of high-confidence adversarial examples

03

Minimal accuracy decrease (2.1%) on benign CIFAR10 samples

Abstract

Deep neural networks (DNNs) are known to be vulnerable to adversarial examples which contain human-imperceptible perturbations. A series of defending methods, either proactive defence or reactive defence, have been proposed in the recent years. However, most of the methods can only handle specific attacks. For example, proactive defending methods are invalid against grey-box or white-box attacks, while reactive defending methods are challenged by low-distortion adversarial examples or transferring adversarial examples. This becomes a critical problem since a defender usually does not have the type of the attack as a priori knowledge. Moreover, existing two-pronged defences (e.g., MagNet), which take advantages of both proactive and reactive methods, have been reported as broken under transferring attacks. To address this problem, this paper proposed a novel defensive framework based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning