Large Language Models are Edge-Case Fuzzers: Testing Deep Learning Libraries via FuzzGPT
Yinlin Deng, Chunqiu Steven Xia, Chenyuan Yang, Shizhuo Dylan Zhang,, Shujing Yang, Lingming Zhang

TL;DR
FuzzGPT leverages large language models to generate unusual, edge-case programs for fuzzing deep learning libraries, significantly improving bug detection over previous methods.
Contribution
This paper introduces FuzzGPT, a novel approach that automates the synthesis of unusual programs for fuzzing DL libraries using LLMs, enhancing bug-finding capabilities.
Findings
FuzzGPT detects 76 bugs in PyTorch and TensorFlow.
49 of these bugs are previously unknown.
11 high-priority bugs or security vulnerabilities identified.
Abstract
Deep Learning (DL) library bugs affect downstream DL applications, emphasizing the need for reliable systems. Generating valid input programs for fuzzing DL libraries is challenging due to the need for satisfying both language syntax/semantics and constraints for constructing valid computational graphs. Recently, the TitanFuzz work demonstrates that modern Large Language Models (LLMs) can be directly leveraged to implicitly learn all the constraints to generate valid DL programs for fuzzing. However, LLMs tend to generate ordinary programs following similar patterns seen in their massive training corpora, while fuzzing favors unusual inputs that cover edge cases or are unlikely to be manually produced. To fill this gap, this paper proposes FuzzGPT, the first technique to prime LLMs to synthesize unusual programs for fuzzing. FuzzGPT is built on the well-known hypothesis that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Software Engineering Research · Topic Modeling
