Characterizing and Mitigating False-Positive Bug Reports in the Linux Kernel
Jiashuo Tian, Dong Wang, Chen Yang, Haichi Wang, Zan Wang, Junjie Chen

TL;DR
This paper presents an empirical study of false-positive bug reports in the Linux kernel, highlighting their impact and evaluating large language models for automated mitigation.
Contribution
It constructs a dataset of false positives, analyzes their characteristics, and demonstrates the effectiveness of retrieval-augmented LLMs in mitigating them.
Findings
False positives demand effort comparable to genuine bugs.
They mainly occur in File Systems and Drivers due to external dependencies.
RAG-based LLMs achieve 91% recall and 88% F1 in mitigation.
Abstract
False-positive bug reports represent a significant yet underexplored challenge in the development and maintenance of the Linux kernel. They occur when correct system behavior is mistakenly flagged as a defect, consuming developer effort without leading to actual code improvements. Such reports can mislead developers, waste debugging resources, and delay the resolution of real bugs. In this paper, we present the first comprehensive empirical study of false-positive bug reports in the Linux kernel. We manually construct a dataset of 2,006 bug reports comprising 1,509 genuine bugs and 497 false positives collected from Bugzilla and Syzkaller. Our analysis indicates that false positives demand effort comparable to real bugs, often requiring extended discussions and non-trivial closure time. They occur in several components, especially File Systems and Drivers, mainly due to external…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
