Towards Characterizing Adversarial Defects of Deep Learning Software   from the Lens of Uncertainty

Xiyue Zhang; Xiaofei Xie; Lei Ma; Xiaoning Du; Qiang Hu; Yang Liu,; Jianjun Zhao; Meng Sun

arXiv:2004.11573·cs.SE·April 27, 2020·5 cites

Towards Characterizing Adversarial Defects of Deep Learning Software from the Lens of Uncertainty

Xiyue Zhang, Xiaofei Xie, Lei Ma, Xiaoning Du, Qiang Hu, Yang Liu,, Jianjun Zhao, Meng Sun

PDF

Open Access

TL;DR

This paper systematically studies the relationship between adversarial examples and uncertainty in deep learning, proposing methods to generate diverse AEs and BEs, revealing gaps in current defenses and emphasizing the need for more comprehensive testing.

Contribution

It introduces an uncertainty-based characterization of adversarial defects, and proposes an automated testing approach to generate diverse, uncommon adversarial and benign examples.

Findings

01

Uncertainty metrics can differentiate benign and adversarial examples.

02

Existing methods miss many uncertainty patterns in AEs and BEs.

03

Generated uncommon data reduces defense success rate by 35%.

Abstract

Over the past decade, deep learning (DL) has been successfully applied to many industrial domain-specific tasks. However, the current state-of-the-art DL software still suffers from quality issues, which raises great concern especially in the context of safety- and security-critical scenarios. Adversarial examples (AEs) represent a typical and important type of defects needed to be urgently addressed, on which a DL software makes incorrect decisions. Such defects occur through either intentional attack or physical-world noise perceived by input sensors, potentially hindering further industry deployment. The intrinsic uncertainty nature of deep learning decisions can be a fundamental reason for its incorrect behavior. Although some testing, adversarial attack and defense techniques have been recently proposed, it still lacks a systematic study to uncover the relationship between AEs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Advanced Malware Detection Techniques