May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs

Shaoyu Yang; Chunrong Fang; Haifeng Lin; Xiang Chen; Jia Liu; Zhenyu Chen

arXiv:2506.17642·cs.SE·January 21, 2026

May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs

Shaoyu Yang, Chunrong Fang, Haifeng Lin, Xiang Chen, Jia Liu, Zhenyu Chen

PDF

Open Access 1 Repo

TL;DR

FUEL leverages feedback from fuzz testing DL frameworks using LLMs to generate more effective tests, improving coverage and bug detection, including high-severity vulnerabilities.

Contribution

This work introduces FUEL, a novel framework utilizing two LLMs for analyzing feedback and guiding test generation in DL framework fuzzing, with automated test repair and enhanced diversity.

Findings

01

Improves code coverage of PyTorch by 4.48% and TensorFlow by 9.14%.

02

Detects 104 previously unknown bugs, with 93 confirmed as new.

03

Identifies 14 vulnerabilities, 7 with high severity.

Abstract

Deep Learning (DL) frameworks have served as fundamental components in DL systems over the last decade. However, bugs in DL frameworks could lead to catastrophic consequences in critical scenarios. A simple yet effective way to find bugs in DL frameworks is fuzz testing (Fuzzing). Existing approaches focus on test generation, leaving execution results with high semantic value (e.g., coverage information, bug reports, and exception logs) in the wild, which can serve as multiple types of feedback. To fill this gap, we propose FUEL to effectively utilize the feedback information, which comprises two Large Language Models (LLMs): analysis LLM and generation LLM. Specifically, analysis LLM infers analysis summaries from feedback information, while the generation LLM creates tests guided by these summaries. Furthermore, based on multiple feedback guidance, we design two additional components:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nju-ise/fuel
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Adversarial Robustness in Machine Learning · Software Engineering Research