LLMorpheus: Mutation Testing using Large Language Models

Frank Tip; Jonathan Bell; Max Schaefer

arXiv:2404.09952·cs.SE·March 10, 2025·1 cites

LLMorpheus: Mutation Testing using Large Language Models

Frank Tip, Jonathan Bell, Max Schaefer

PDF

Open Access 1 Repo

TL;DR

This paper introduces LLMorpheus, a mutation testing approach using large language models to generate realistic mutants in JavaScript, addressing limitations of traditional fixed-operator methods.

Contribution

The paper presents a novel mutation testing technique leveraging LLMs to generate more realistic mutants, implemented in a new tool called LLMorpheus for JavaScript.

Findings

01

LLMorpheus produces mutants resembling real-world bugs.

02

It outperforms traditional mutation tools like StrykerJS in generating diverse mutants.

03

The approach is practical in terms of runtime and cost.

Abstract

In mutation testing, the quality of a test suite is evaluated by introducing faults into a program and determining whether the program's tests detect them. Most existing approaches for mutation testing involve the application of a fixed set of mutation operators, e.g., replacing a "+" with a "-", or removing a function's body. However, certain types of real-world bugs cannot easily be simulated by such approaches, limiting their effectiveness. This paper presents a technique for mutation testing where placeholders are introduced at designated locations in a program's source code and where a Large Language Model (LLM) is prompted to ask what they could be replaced with. The technique is implemented in LLMorpheus, a mutation testing tool for JavaScript, and evaluated on 13 subject packages, considering several variations on the prompting strategy, and using several LLMs. We find…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

githubnext/llmorpheus
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Advanced Malware Detection Techniques