Large Language Model assisted Hybrid Fuzzing

Ruijie Meng; Gregory J. Duck; Abhik Roychoudhury

arXiv:2412.15931·cs.SE·May 6, 2026

Large Language Model assisted Hybrid Fuzzing

Ruijie Meng, Gregory J. Duck, Abhik Roychoudhury

PDF

TL;DR

This paper introduces HyllFuzz, a hybrid fuzzing approach that leverages Large Language Models to improve code coverage and bug detection efficiency in software testing.

Contribution

It develops an LLM-empowered concolic execution method that enhances hybrid fuzzing by generating inputs to reach complex program branches more effectively.

Findings

01

HyllFuzz covers 31.43% to 59.48% more code branches than state-of-the-art tools.

02

LLM-based concolic execution is 3 to 19 times faster than traditional methods.

03

HyllFuzz discovered seven previously unknown bugs in real-world software.

Abstract

Greybox fuzzing is one of the most popular methods for detecting software vulnerabilities, which conducts a biased random search within the program input space. To enhance its effectiveness in achieving deep coverage of program behaviors, greybox fuzzing is often combined with concolic execution, which performs a path-sensitive search over the domain of program inputs. In hybrid fuzzing, conventional greybox fuzzing is followed by concolic execution in an iterative loop, where reachability roadblocks encountered by greybox fuzzing are tackled by concolic execution. However, such hybrid fuzzing still suffers from difficulties conventionally faced by concolic execution, such as the need for environment modeling and system call support. In this work, we explore the potential of developing "smart" concolic execution empowered by Large Language Models (LLMs), leveraging their knowledge of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.