Harnessing LLMs for Document-Guided Fuzzing of OpenCV Library

Bin Duan; Tarek Mahmud; Meiru Che; Yan Yan; Naipeng Dong; Dan Dongseong Kim; Guowei Yang

arXiv:2507.14558·cs.SE·July 22, 2025

Harnessing LLMs for Document-Guided Fuzzing of OpenCV Library

Bin Duan, Tarek Mahmud, Meiru Che, Yan Yan, Naipeng Dong, Dan Dongseong Kim, Guowei Yang

PDF

TL;DR

This paper presents VISTAFUZZ, a novel approach that leverages large language models to parse API documentation and systematically generate test inputs, effectively detecting new bugs in the OpenCV library.

Contribution

VISTAFUZZ introduces a document-guided fuzzing method using LLMs to improve bug detection in large software libraries like OpenCV.

Findings

01

Detected 17 new bugs in OpenCV APIs

02

Confirmed 10 bugs, with 5 fixed

03

Demonstrated effectiveness of LLM-guided fuzzing

Abstract

The combination of computer vision and artificial intelligence is fundamentally transforming a broad spectrum of industries by enabling machines to interpret and act upon visual data with high levels of accuracy. As the biggest and by far the most popular open-source computer vision library, OpenCV library provides an extensive suite of programming functions supporting real-time computer vision. Bugs in the OpenCV library can affect the downstream computer vision applications, and it is critical to ensure the reliability of the OpenCV library. This paper introduces VISTAFUZZ, a novel technique for harnessing large language models (LLMs) for document-guided fuzzing of the OpenCV library. VISTAFUZZ utilizes LLMs to parse API documentation and obtain standardized API information. Based on this standardized information, VISTAFUZZ extracts constraints on individual input parameters and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.