TL;DR
This paper introduces TDCleaner, a neural network-based model that automatically detects obsolete TODO comments in software code to improve code quality and maintainability.
Contribution
The paper presents a novel neural model that leverages code changes, comments, and commit messages to identify obsolete TODO comments, with extensive evaluation on real-world datasets.
Findings
TDCleaner achieves promising performance on benchmark datasets.
18 obsolete TODO comments identified in real-world projects, with 9 confirmed and removed by developers.
The approach effectively assists developers in maintaining cleaner code comments.
Abstract
TODO comments are very widely used by software developers to describe their pending tasks during software development. However, after performing the task developers sometimes neglect or simply forget to remove the TODO comment, resulting in obsolete TODO comments. These obsolete TODO comments can confuse development teams and may cause the introduction of bugs in the future, decreasing the software's quality and maintainability. In this work, we propose a novel model, named TDCleaner (TODO comment Cleaner), to identify obsolete TODO comments in software projects. TDCleaner can assist developers in just-in-time checking of TODO comments status and avoid leaving obsolete TODO comments. Our approach has two main stages: offline learning and online prediction. During offline learning, we first automatically establish <code_change, todo_comment, commit_msg> training samples and leverage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
