A Probabilistic Framework for Mutation Testing in Deep Neural Networks
Florian Tambon, Foutse Khomh, Giuliano Antoniol

TL;DR
This paper introduces a probabilistic mutation testing framework for deep neural networks that addresses the inconsistency caused by stochastic training, providing more reliable fault detection in DL models.
Contribution
It proposes a novel probabilistic approach to mutation testing in deep learning that improves consistency and reliability over existing methods.
Findings
PMT offers more consistent mutation detection across test runs.
The method balances approximation error and computational cost effectively.
Experimental results validate the effectiveness of PMT on multiple models and mutation operators.
Abstract
Context: Mutation Testing (MT) is an important tool in traditional Software Engineering (SE) white-box testing. It aims to artificially inject faults in a system to evaluate a test suite's capability to detect them, assuming that the test suite defects finding capability will then translate to real faults. If MT has long been used in SE, it is only recently that it started gaining the attention of the Deep Learning (DL) community, with researchers adapting it to improve the testability of DL models and improve the trustworthiness of DL systems. Objective: If several techniques have been proposed for MT, most of them neglected the stochasticity inherent to DL resulting from the training phase. Even the latest MT approaches in DL, which propose to tackle MT through a statistical approach, might give inconsistent results. Indeed, as their statistic is based on a fixed set of sampled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Adversarial Robustness in Machine Learning · Software Engineering Research
MethodsTest
