Defending against Adversarial Attack towards Deep Neural Networks via Collaborative Multi-task Training
Derek Wang, Chaoran Li, Sheng Wen, Surya Nepal, Yang Xiang

TL;DR
This paper introduces a collaborative multi-task training framework to defend deep neural networks against various adversarial attacks, improving robustness and detection capabilities across different attack types.
Contribution
The proposed method uniquely combines label encoding, adversarial training, and a detection mechanism within a collaborative architecture to defend against diverse adversarial attacks.
Findings
Achieved up to 96.3% accuracy on black-box adversarial examples
Detected up to 98.7% of high-confidence adversarial examples
Minimal accuracy decrease (2.1%) on benign CIFAR10 samples
Abstract
Deep neural networks (DNNs) are known to be vulnerable to adversarial examples which contain human-imperceptible perturbations. A series of defending methods, either proactive defence or reactive defence, have been proposed in the recent years. However, most of the methods can only handle specific attacks. For example, proactive defending methods are invalid against grey-box or white-box attacks, while reactive defending methods are challenged by low-distortion adversarial examples or transferring adversarial examples. This becomes a critical problem since a defender usually does not have the type of the attack as a priori knowledge. Moreover, existing two-pronged defences (e.g., MagNet), which take advantages of both proactive and reactive methods, have been reported as broken under transferring attacks. To address this problem, this paper proposed a novel defensive framework based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
