LLM-Guided Issue Generation from Uncovered Code Segments
Diany Pressato, Honghao Tan, Mariam Elmoazen, Shin Hwei Tan

TL;DR
IssueSpecter is an automated tool that leverages coverage analysis and large language models to identify uncovered bugs in code, generate actionable reports, and prioritize issues to assist developers in bug fixing.
Contribution
The paper introduces IssueSpecter, a novel system combining coverage analysis with LLMs to automatically find, report, and rank bugs in code, improving over existing methods.
Findings
84.6% of top-ranked issues are valid or worth investigation
LLM-based ranking outperforms rule-based ranking by 50% at P@3
IssueSpecter achieves higher bug validity rate than CoverUp (81.0% vs. 76.2%)
Abstract
Developers are increasingly overwhelmed by AI-generated issue reports that lack actionability and reproducibility, eroding trust in automated bug detection tools. In this paper, we present IssueSpecter, an automated tool that finds bugs in uncovered code segments and automatically generates prioritized, actionable issue reports. IssueSpecter combines coverage analysis with LLM-based defect identification, producing structured reports complete with severity ratings, reproduction steps, and suggested fixes. We evaluate IssueSpecter on 13 actively maintained Python projects, generating 10,467 issue reports. Manual annotation of the top-130 ranked issues by IssueSpecter confirms that 84.6% of the LLM-generated issues are valid or warrant further investigation, with only 15.4% false positives. LLM-based ranking outperforms rule-based ranking by 50% at P@3 and 41% in MRR. The identified bugs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
