DeepMutants: Training neural bug detectors with contextual mutations

Cedric Richter; Heike Wehrheim

arXiv:2107.06657·cs.SE·July 15, 2021

DeepMutants: Training neural bug detectors with contextual mutations

Cedric Richter, Heike Wehrheim

PDF

Open Access

TL;DR

DeepMutants introduces a context-aware mutation technique using language models to generate realistic bugs, significantly improving the training of neural bug detectors for real-world code analysis.

Contribution

The paper presents a novel contextual mutation operator leveraging language models to create more realistic training data for bug detection models.

Findings

01

Context-aware mutants better mimic real bugs.

02

Improved bug detector performance on real-world code.

03

Language model-based mutations outperform traditional methods.

Abstract

Learning-based bug detectors promise to find bugs in large code bases by exploiting natural hints such as names of variables and functions or comments. Still, existing techniques tend to underperform when presented with realistic bugs. We believe bug detector learning to currently suffer from a lack of realistic defective training examples. In fact, real world bugs are scarce which has driven existing methods to train on artificially created and mostly unrealistic mutants. In this work, we propose a novel contextual mutation operator which incorporates knowledge about the mutation context to dynamically inject natural and more realistic faults into code. Our approach employs a masked language model to produce a context-dependent distribution over feasible token replacements. The evaluation shows that sampling from a language model does not only produce mutants which more accurately…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software Reliability and Analysis Research