Text Tells the Cost: Predicting and Analyzing Repayment Effort of Self-Admitted Technical Debt
Yikun Li, Mohamed Soliman, Paris Avgeriou, Jie Tan, Jiakun Liu

TL;DR
This paper introduces PRESTI, a novel method using textual analysis to predict the repayment effort of Self-Admitted Technical Debt, supported by a large dataset and advanced models like BERT.
Contribution
It provides the first comprehensive dataset and a new approach for automatically estimating SATD repayment effort from textual descriptions.
Findings
Different SATD types demand varying repayment efforts.
BERT- and TextCNN-based models outperform traditional methods.
Keywords can indicate the level of repayment effort.
Abstract
Technical debt refers to the consequences of sub-optimal decisions made during software development that prioritize short-term benefits over long-term maintainability. Self-Admitted Technical Debt (SATD) is a specific form of technical debt, explicitly documented by developers within software artifacts such as source code comments and commit messages. As SATD can hinder software development and maintenance, it is crucial to estimate the effort required to repay it so that we can effectively prioritize it. However, we currently lack an understanding of SATD repayment, and more importantly, we lack approaches that can automatically estimate the repayment effort of SATD based on its textual description. To bridge this gap, we have curated a comprehensive dataset of 341,740 SATD items from 2,568,728 commits across 1,060 Apache repositories and analyzed the repayment effort comparing SATD…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
