The Seeds of the FUTURE Sprout from History: Fuzzing for Unveiling   Vulnerabilities in Prospective Deep-Learning Libraries

Zhiyuan Li; Jingzheng Wu; Xiang Ling; Tianyue Luo; Zhiqing Rui; Yanjun; Wu

arXiv:2412.01317·cs.SE·December 12, 2024

The Seeds of the FUTURE Sprout from History: Fuzzing for Unveiling Vulnerabilities in Prospective Deep-Learning Libraries

Zhiyuan Li, Jingzheng Wu, Xiang Ling, Tianyue Luo, Zhiqing Rui, Yanjun, Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces FUTURE, a universal fuzzing framework that uses historical bug data and fine-tuned LLMs to effectively identify vulnerabilities in new and existing deep learning libraries, improving security and API coverage.

Contribution

FUTURE is the first fuzzing framework designed specifically for new DL libraries, leveraging historical bugs and LLMs for targeted bug detection and security enhancement.

Findings

01

Detected 148 bugs, including 142 new ones, across 452 APIs.

02

Successfully identified 7 bugs in PyTorch, improving existing library security.

03

Outperformed existing fuzzers in bug detection, API coverage, and code validity.

Abstract

The widespread application of large language models (LLMs) underscores the importance of deep learning (DL) technologies that rely on foundational DL libraries such as PyTorch and TensorFlow. Despite their robust features, these libraries face challenges with scalability and adaptation to rapid advancements in the LLM community. In response, tech giants like Apple and Huawei are developing their own DL libraries to enhance performance, increase scalability, and safeguard intellectual property. Ensuring the security of these libraries is crucial, with fuzzing being a vital solution. However, existing fuzzing frameworks struggle with target flexibility, effectively testing bug-prone API sequences, and leveraging the limited available information in new libraries. To address these limitations, we propose FUTURE, the first universal fuzzing framework tailored for newly introduced and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Redmept1on/FUTURE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI