Towards Characterizing Adversarial Defects of Deep Learning Software from the Lens of Uncertainty
Xiyue Zhang, Xiaofei Xie, Lei Ma, Xiaoning Du, Qiang Hu, Yang Liu,, Jianjun Zhao, Meng Sun

TL;DR
This paper systematically studies the relationship between adversarial examples and uncertainty in deep learning, proposing methods to generate diverse AEs and BEs, revealing gaps in current defenses and emphasizing the need for more comprehensive testing.
Contribution
It introduces an uncertainty-based characterization of adversarial defects, and proposes an automated testing approach to generate diverse, uncommon adversarial and benign examples.
Findings
Uncertainty metrics can differentiate benign and adversarial examples.
Existing methods miss many uncertainty patterns in AEs and BEs.
Generated uncommon data reduces defense success rate by 35%.
Abstract
Over the past decade, deep learning (DL) has been successfully applied to many industrial domain-specific tasks. However, the current state-of-the-art DL software still suffers from quality issues, which raises great concern especially in the context of safety- and security-critical scenarios. Adversarial examples (AEs) represent a typical and important type of defects needed to be urgently addressed, on which a DL software makes incorrect decisions. Such defects occur through either intentional attack or physical-world noise perceived by input sensors, potentially hindering further industry deployment. The intrinsic uncertainty nature of deep learning decisions can be a fundamental reason for its incorrect behavior. Although some testing, adversarial attack and defense techniques have been recently proposed, it still lacks a systematic study to uncover the relationship between AEs and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Advanced Malware Detection Techniques
