Reflective Unit Test Generation for Precise Type Error Detection with Large Language Models

Chen Yang; Ziqi Wang; Yanjie Jiang; Lin Yang; Yuteng Zheng; Jianyi Zhou; Junjie Chen

arXiv:2507.02318·cs.SE·October 3, 2025·2 cites

Reflective Unit Test Generation for Precise Type Error Detection with Large Language Models

Chen Yang, Ziqi Wang, Yanjie Jiang, Lin Yang, Yuteng Zheng, Jianyi Zhou, Junjie Chen

PDF

Open Access

TL;DR

RTED is a novel type-aware test generation method that combines constraint analysis and reflection to detect Python type errors more accurately, reducing false positives and discovering new errors in real-world code.

Contribution

RTED introduces a new approach integrating type constraint analysis with reflective validation for precise Python type error detection.

Findings

01

Detects 22-29 more errors than existing techniques

02

Improves precision by up to 245.9%

03

Discovered 12 new errors in open-source projects

Abstract

Type errors in Python often lead to runtime failures, posing significant challenges to software reliability and developer productivity. Existing static analysis tools aim to detect such errors without execution but frequently suffer from high false positive rates. Recently, unit test generation techniques offer great promise in achieving high test coverage, but they often struggle to produce bug-revealing tests without tailored guidance. To address these limitations, we present RTED, a novel type-aware test generation technique for automatically detecting Python type errors. Specifically, RTED combines step-by-step type constraint analysis with reflective validation to guide the test generation process and effectively suppress false positives. We evaluated RTED on two widely-used benchmarks, BugsInPy and TypeBugs. Experimental results show that RTED can detect 22-29 more benchmarked…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Assessment