EvoGPT: Leveraging LLM-Driven Seed Diversity to Improve Search-Based Test Suite Generation
Lior Broide, Roni Stern, Argaman Mordoch

TL;DR
EvoGPT combines Large Language Models with Search-Based Software Testing and evolutionary algorithms to generate diverse, high-quality test suites, significantly improving code coverage and mutation scores on benchmark datasets.
Contribution
This paper introduces EvoGPT, a novel hybrid system that explicitly enforces diversity in LLM-driven test generation, enhancing the effectiveness of automated unit testing.
Findings
EvoGPT achieves 10% higher coverage than baselines.
Diversity enforcement improves test suite quality.
EvoGPT outperforms LLM-only and traditional SBST methods.
Abstract
Search-Based Software Testing (SBST) is a well-established approach for automated unit test generation, yet it often suffers from premature convergence and limited diversity in the generated test suites. Recently, Large Language Models (LLMs) have emerged as an alternative technique for unit test generation. We present EvoGPT, a hybrid test generation system that integrates LLM-based test generation with SBST-based test suite optimization. EvoGPT uses LLMs to generate an initial population of test suites, and uses an Evolutionary Algorithm (EA) to further optimize this test suite population. A distinguishing feature of EvoGPT is its explicit enforcement of diversity, achieved through the use of multiple temperatures and prompt instructions during test generation. In addition, each LLM-generated test is refined using a generation-repair loop and coverage-guided assertion generation. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · VLSI and Analog Circuit Testing · Engineering and Test Systems
