LineBreaker: Finding Token-Inconsistency Bugs with Large Language Models
Hongbo Chen, Yifan Zhang, Xing Han, Tianhao Mao, Huanyao Rong, Yuheng Zhang, XiaoFeng Wang, Luyi Xing, Xun Chen, Hang Zhang

TL;DR
This paper introduces LineBreaker, a cascaded detection system that leverages large language models and smaller, efficient models to identify token-inconsistency bugs in code, improving detection precision, recall, and scalability.
Contribution
The paper presents a novel cascaded approach combining LLMs and smaller models for effective token-inconsistency bug detection, addressing limitations of existing methods.
Findings
GPT-4 shows promise but has limitations in precision and scalability.
LineBreaker uncovers 123 new bugs in real-world repositories.
41 out of 69 proposed fixes have been confirmed or merged.
Abstract
Token-inconsistency bugs (TIBs) involve the misuse of syntactically valid yet incorrect code tokens, such as misused variables and erroneous function invocations, which can often lead to software bugs. Unlike simple syntactic bugs, TIBs occur at the semantic level and are subtle - sometimes they remain undetected for years. Traditional detection methods, such as static analysis and dynamic testing, often struggle with TIBs due to their versatile and context-dependent nature. However, advancements in large language models (LLMs) like GPT-4 present new opportunities for automating TIB detection by leveraging these models' semantic understanding capabilities. This paper reports the first systematic measurement of LLMs' capabilities in detecting TIBs, revealing that while GPT-4 shows promise, it exhibits limitations in precision and scalability. Specifically, its detection capability is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Topic Modeling
