Extremal Testing for Network Software using LLMs
Rathin Singha, Harry Qian, Srinath Saikrishnan, Tracy Zhao, Ryan Beckett, Siva Kesava Reddy Kakarla, and George Varghese

TL;DR
This paper introduces a method to automate extremal testing of network software using large language models, generating input constraints and tests to uncover bugs in protocols like HTTP, BGP, and DNS.
Contribution
It presents a novel two-step approach leveraging LLMs for extremal testing of network software, extending traditional boundary value analysis techniques.
Findings
Uncovered new bugs in HTTP, BGP, and DNS implementations.
Demonstrated extension to centralized network algorithms like shortest path.
Showed how LLMs can generate filtering code for extremal inputs.
Abstract
Physicists often manually consider extreme cases when testing a theory. In this paper, we show how to automate extremal testing of network software using LLMs in two steps: first, ask the LLM to generate input constraints (e.g., DNS name length limits); then ask the LLM to generate tests that violate the constraints. We demonstrate how easy this process is by generating extremal tests for HTTP, BGP and DNS implementations, each of which uncovered new bugs. We show how this methodology extends to centralized network software such as shortest path algorithms, and how LLMs can generate filtering code to reject extremal input. We propose using agentic AI to further automate extremal testing. LLM-generated extremal testing goes beyond an old technique in software testing called Boundary Value Analysis.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVLSI and Analog Circuit Testing · Software Testing and Debugging Techniques · Real-time simulation and control systems
