A Probabilistic Framework for Mutation Testing in Deep Neural Networks

Florian Tambon; Foutse Khomh; Giuliano Antoniol

arXiv:2208.06018·cs.SE·January 16, 2023·1 cites

A Probabilistic Framework for Mutation Testing in Deep Neural Networks

Florian Tambon, Foutse Khomh, Giuliano Antoniol

PDF

Open Access 1 Repo

TL;DR

This paper introduces a probabilistic mutation testing framework for deep neural networks that addresses the inconsistency caused by stochastic training, providing more reliable fault detection in DL models.

Contribution

It proposes a novel probabilistic approach to mutation testing in deep learning that improves consistency and reliability over existing methods.

Findings

01

PMT offers more consistent mutation detection across test runs.

02

The method balances approximation error and computational cost effectively.

03

Experimental results validate the effectiveness of PMT on multiple models and mutation operators.

Abstract

Context: Mutation Testing (MT) is an important tool in traditional Software Engineering (SE) white-box testing. It aims to artificially inject faults in a system to evaluate a test suite's capability to detect them, assuming that the test suite defects finding capability will then translate to real faults. If MT has long been used in SE, it is only recently that it started gaining the attention of the Deep Learning (DL) community, with researchers adapting it to improve the testability of DL models and improve the trustworthiness of DL systems. Objective: If several techniques have been proposed for MT, most of them neglected the stochasticity inherent to DL resulting from the training phase. Even the latest MT approaches in DL, which propose to tackle MT through a statistical approach, might give inconsistent results. Indeed, as their statistic is based on a fixed set of sampled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

flowss/pmt
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Adversarial Robustness in Machine Learning · Software Engineering Research

MethodsTest