Learning From Developers: Towards Reliable Patch Validation at Scale for Linux
Chih-En Lin, Attreyee Mukherjee, Ajay Rawat, Ruqi Zhang, Pedro Fonseca

TL;DR
This paper introduces FLINT, a system that leverages developer discussions and large language models to automate and improve patch validation for Linux, addressing scalability issues in patch review processes.
Contribution
The paper presents FLINT, a novel patch validation framework that combines rule-based analysis and LLMs to enhance scalability and accuracy in Linux patch reviews.
Findings
FLINT detected 2 new issues in Linux v6.18 cycle.
FLINT achieved 21% higher coverage on concurrency bugs.
FLINT reduced false positives to 35%.
Abstract
Patch reviewing is critical for software development, especially in distributed open-source development, which highly depends on voluntary work, such as Linux. This paper studies the past 10 years of patch reviews of the Linux memory management subsystem to characterize the challenges involved in patch reviewing at scale. Our study reveals that the review process is still primarily reliant on human effort despite a wide-range of automatic checking tools. Although kernel developers strive to review all patch proposals, they struggle to keep up with the increasing volume of submissions and depend significantly on a few developers for these reviews. To help scale the patch review process, we introduce FLINT, a patch validation system framework that synthesizes insights from past discussions among developers and automatically analyzes patch proposals for compliance. FLINT employs a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software System Performance and Reliability
