TEASMA: A Practical Methodology for Test Adequacy Assessment of Deep Neural Networks
Amin Abbasishahkoo, Mahboubeh Dadkhah, Lionel Briand, Dayi, Lin

TL;DR
TEASMA is a practical methodology that predicts the fault detection rate of test sets for DNNs, enabling better assessment of test adequacy and trustworthiness before deployment.
Contribution
It introduces a comprehensive approach to model and predict DNN test set fault detection ability using existing adequacy metrics, validated through extensive empirical evaluation.
Findings
Strong linear correlation between predicted and actual FDR with R^2 > 0.90.
Low RMSE of 9% indicates high prediction accuracy.
MS metric showed superior predictive performance over DSC and IDC.
Abstract
Successful deployment of Deep Neural Networks (DNNs) requires their validation with an adequate test set to ensure a sufficient degree of confidence in test outcomes. Although well-established test adequacy assessment techniques have been proposed for DNNs, we still need to investigate their application within a comprehensive methodology for accurately predicting the fault detection ability of test sets and thus assessing their adequacy. In this paper, we propose and evaluate TEASMA, a comprehensive and practical methodology designed to accurately assess the adequacy of test sets for DNNs. In practice, TEASMA allows engineers to decide whether they can trust high-accuracy test results and thus validate the DNN before its deployment. Based on a DNN model's training set, TEASMA provides a procedure to build accurate DNN-specific prediction models of the Fault Detection Rate (FDR) of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
