New Formulation of DNN Statistical Mutation Killing for Ensuring Monotonicity: A Technical Report

Jinhan Kim; Nargiz Humbatova; Gunel Jahangirova; Shin Yoo; Paolo Tonella

arXiv:2507.11199·cs.SE·July 16, 2025

New Formulation of DNN Statistical Mutation Killing for Ensuring Monotonicity: A Technical Report

Jinhan Kim, Nargiz Humbatova, Gunel Jahangirova, Shin Yoo, Paolo Tonella

PDF

Open Access

TL;DR

This paper introduces a new Fisher exact test-based formulation for DNN mutation testing that maintains statistical rigor and guarantees monotonicity, addressing limitations of previous methods.

Contribution

It proposes a novel mutation killing criterion for DNNs that ensures monotonicity while preserving statistical testing validity.

Findings

01

Ensures monotonicity in mutation testing results.

02

Maintains statistical rigor with Fisher exact test.

03

Addresses limitations of previous approaches.

Abstract

Mutation testing has emerged as a powerful technique for evaluating the effectiveness of test suites for Deep Neural Networks. Among existing approaches, the statistical mutant killing criterion of DeepCrime has leveraged statistical testing to determine whether a mutant significantly differs from the original model. However, it suffers from a critical limitation: it violates the monotonicity property, meaning that expanding a test set may result in previously killed mutants no longer being classified as killed. In this technical report, we propose a new formulation of statistical mutant killing based on Fisher exact test that preserves the statistical rigour of it while ensuring monotonicity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCancer Genomics and Diagnostics