Test vs Mutant: Adversarial LLM Agents for Robust Unit Test Generation

Pengyu Chang; Yixiong Fang; Silin Chen; Yuling Shi; Beijun Shen; Xiaodong Gu

arXiv:2602.08146·cs.SE·February 11, 2026

Test vs Mutant: Adversarial LLM Agents for Robust Unit Test Generation

Pengyu Chang, Yixiong Fang, Silin Chen, Yuling Shi, Beijun Shen, Xiaodong Gu

PDF

Open Access

TL;DR

This paper introduces AdverTest, an adversarial framework with two interacting agents that co-evolve to generate more robust and comprehensive unit tests for software, significantly improving bug detection and coverage.

Contribution

The paper presents a novel adversarial approach for LLM-based test generation, enhancing robustness and bug detection beyond existing methods.

Findings

01

Improves fault detection rates by 8.56% over existing LLM methods

02

Achieves 63.30% higher fault detection than EvoSuite

03

Enhances line and branch coverage in testing

Abstract

Software testing is a critical, yet resource-intensive phase of the software development lifecycle. Over the years, various automated tools have been developed to aid in this process. Search-based approaches typically achieve high coverage but produce tests with low readability, whereas large language model (LLM)-based methods generate more human-readable tests but often suffer from low coverage and compilability. While the majority of research efforts have focused on improving test coverage and readability, little attention has been paid to enhancing the robustness of bug detection, particularly in exposing corner cases and vulnerable execution paths. To address this gap, we propose AdverTest, a novel adversarial framework for LLM-powered test case generation. AdverTest comprises two interacting agents: a test case generation agent (T) and a mutant generation agent (M). These agents…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software Engineering Techniques and Practices