The Seeds of the FUTURE Sprout from History: Fuzzing for Unveiling Vulnerabilities in Prospective Deep-Learning Libraries
Zhiyuan Li, Jingzheng Wu, Xiang Ling, Tianyue Luo, Zhiqing Rui, Yanjun, Wu

TL;DR
This paper introduces FUTURE, a universal fuzzing framework that uses historical bug data and fine-tuned LLMs to effectively identify vulnerabilities in new and existing deep learning libraries, improving security and API coverage.
Contribution
FUTURE is the first fuzzing framework designed specifically for new DL libraries, leveraging historical bugs and LLMs for targeted bug detection and security enhancement.
Findings
Detected 148 bugs, including 142 new ones, across 452 APIs.
Successfully identified 7 bugs in PyTorch, improving existing library security.
Outperformed existing fuzzers in bug detection, API coverage, and code validity.
Abstract
The widespread application of large language models (LLMs) underscores the importance of deep learning (DL) technologies that rely on foundational DL libraries such as PyTorch and TensorFlow. Despite their robust features, these libraries face challenges with scalability and adaptation to rapid advancements in the LLM community. In response, tech giants like Apple and Huawei are developing their own DL libraries to enhance performance, increase scalability, and safeguard intellectual property. Ensuring the security of these libraries is crucial, with fuzzing being a vital solution. However, existing fuzzing frameworks struggle with target flexibility, effectively testing bug-prone API sequences, and leveraging the limited available information in new libraries. To address these limitations, we propose FUTURE, the first universal fuzzing framework tailored for newly introduced and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI
