A Review and Refinement of Surprise Adequacy

Michael Weiss; Rwiddhi Chakraborty; Paolo Tonella

arXiv:2103.05939·cs.LG·March 11, 2021

A Review and Refinement of Surprise Adequacy

Michael Weiss, Rwiddhi Chakraborty, Paolo Tonella

PDF

2 Repos

TL;DR

This paper reviews and refines the computation of Surprise Adequacy (SA) for Deep Learning testing, introducing optimized algorithms that significantly reduce evaluation time and analyzing SA's effectiveness and sensitivity in out-of-distribution detection.

Contribution

It presents a performance-optimized implementation of SA, refined variants for faster evaluation, and an empirical study on MNIST highlighting SA's capabilities and sensitivity issues.

Findings

01

Refined SA variants are substantially faster with comparable results.

02

Optimized implementation reduces evaluation time by up to 97%.

03

SA can be highly sensitive to non-determinism in DNN training.

Abstract

Surprise Adequacy (SA) is one of the emerging and most promising adequacy criteria for Deep Learning (DL) testing. As an adequacy criterion, it has been used to assess the strength of DL test suites. In addition, it has also been used to find inputs to a Deep Neural Network (DNN) which were not sufficiently represented in the training data, or to select samples for DNN retraining. However, computation of the SA metric for a test suite can be prohibitively expensive, as it involves a quadratic number of distance calculations. Hence, we developed and released a performance-optimized, but functionally equivalent, implementation of SA, reducing the evaluation time by up to 97\%. We also propose refined variants of the SA omputation algorithm, aiming to further increase the evaluation speed. We then performed an empirical study on MNIST, focused on the out-of-distribution detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.