Large Language Models Based JSON Parser Fuzzing for Bug Discovery and   Behavioral Analysis

Zhiyuan Zhong; Zhezhen Cao; Zhanwei Zhang

arXiv:2410.21806·cs.SE·October 31, 2024

Large Language Models Based JSON Parser Fuzzing for Bug Discovery and Behavioral Analysis

Zhiyuan Zhong, Zhezhen Cao, Zhanwei Zhang

PDF

Open Access

TL;DR

This paper explores using Large Language Models to generate test cases and mutants for JSON parsers, aiming to discover bugs and analyze behavioral differences among open-source implementations.

Contribution

It introduces a novel approach leveraging LLMs for automated JSON parser fuzzing and behavioral analysis, which has not been extensively studied before.

Findings

01

LLMs can generate effective test cases for JSON parsers

02

Behavioral diversities among parsers can be identified using LLM-generated mutants

03

Potential bugs in open-source JSON parsers can be uncovered

Abstract

Fuzzing has been incredibly successful in uncovering bugs and vulnerabilities across diverse software systems. JSON parsers play a vital role in modern software development, and ensuring their reliability is of great importance. This research project focuses on leveraging Large Language Models (LLMs) to enhance JSON parser testing. The primary objectives are to generate test cases and mutants using LLMs for the discovery of potential bugs in open-source JSON parsers and the identification of behavioral diversities among them. We aim to uncover underlying bugs, plus discovering (and overcoming) behavioral diversities.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Data Mining and Analysis · Software Testing and Debugging Techniques · Advanced Malware Detection Techniques