ATTest: Agent-Driven Tensor Testing for Deep Learning Library Modules

Zhengyu Zhan; Ye Shang; Jiawei Liu; Chunrong Fang; Quanjun Zhang; and Zhenyu Chen

arXiv:2602.13987·cs.SE·February 17, 2026

ATTest: Agent-Driven Tensor Testing for Deep Learning Library Modules

Zhengyu Zhan, Ye Shang, Jiawei Liu, Chunrong Fang, Quanjun Zhang, and Zhenyu Chen

PDF

Open Access

TL;DR

ATTest introduces an agent-driven framework for tensor testing in deep learning libraries, effectively addressing semantic challenges and outperforming existing methods in code coverage for PyTorch and TensorFlow.

Contribution

It presents a novel seven-stage pipeline with an iterative loop for constraint extraction, test generation, validation, and repair, improving test stability and coverage.

Findings

01

Achieves 55.60% branch coverage on PyTorch

02

Achieves 54.77% branch coverage on TensorFlow

03

Outperforms state-of-the-art baselines significantly

Abstract

The unit testing of Deep Learning (DL) libraries is challenging due to complex numerical semantics and implicit tensor constraints. Traditional Search-Based Software Testing (SBST) often suffers from semantic blindness, failing to satisfy the constraints of high-dimensional tensors, whereas Large Language Models (LLMs) struggle with cross-file context and unstable code modifications. This paper proposes ATTest, an agent-driven tensor testing framework for module-level unit test generation. ATTest orchestrates a seven-stage pipeline, which encompasses constraint extraction and an iterative "generation-validation-repair" loop, to maintain testing stability and mitigate context-window saturation. An evaluation on PyTorch and TensorFlow demonstrates that ATTest significantly outperforms state-of-the-art baselines such as PynguinML, achieving an average branch coverage of 55.60% and 54.77%,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Topic Modeling · Machine Learning in Materials Science