TL;DR
This paper introduces Surprise Adequacy (SADL), a new testing criterion for deep learning systems that measures how surprising an input is relative to training data, improving robustness against adversarial examples.
Contribution
The paper proposes a novel surprise-based adequacy criterion for DL testing, addressing limitations of existing coverage metrics and guiding more effective testing.
Findings
Surprise sampling improves classification accuracy against adversarial examples by up to 77.5%.
SADL effectively guides testing to identify subtle model behaviors.
Empirical results across various DL systems demonstrate the method's effectiveness.
Abstract
Deep Learning (DL) systems are rapidly being adopted in safety and security critical domains, urgently calling for ways to test their correctness and robustness. Testing of DL systems has traditionally relied on manual collection and labelling of data. Recently, a number of coverage criteria based on neuron activation values have been proposed. These criteria essentially count the number of neurons whose activation during the execution of a DL system satisfied certain properties, such as being above predefined thresholds. However, existing coverage criteria are not sufficiently fine grained to capture subtle behaviours exhibited by DL systems. Moreover, evaluations have focused on showing correlation between adversarial examples and proposed criteria rather than evaluating and guiding their use for actual testing of DL systems. We propose a novel test adequacy criterion for testing of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
