Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment
Mohammad Sadegh Sheikhaei, Yuan Tian, Shaowei Wang, Bowen Xu

TL;DR
This paper develops large-scale datasets and new evaluation metrics to assess the effectiveness of LLMs in automatically repaying Self-Admitted Technical Debt in code, providing a comprehensive benchmark and analysis.
Contribution
It introduces large-scale, language-independent datasets for SATD repayment, new diff-based and interpretable evaluation metrics, and a thorough evaluation of various LLM approaches.
Findings
Fine-tuned small models achieve high Exact Match scores.
Large models like Llama-3.1-70B-Instruct excel in BLEU and LEMOD metrics.
Gemma-2-9B addresses over 10% of Python SATDs.
Abstract
Self-Admitted Technical Debt (SATD), cases where developers intentionally acknowledge suboptimal solutions in code through comments, poses a significant challenge to software maintainability. Left unresolved, SATD can degrade code quality and increase maintenance costs. While Large Language Models (LLMs) have shown promise in tasks like code generation and program repair, their potential in automated SATD repayment remains underexplored. In this paper, we identify three key challenges in training and evaluating LLMs for SATD repayment: (1) dataset representativeness and scalability, (2) removal of irrelevant SATD repayments, and (3) limitations of existing evaluation metrics. To address the first two dataset-related challenges, we adopt a language-independent SATD tracing tool and design a 10-step filtering pipeline to extract SATD repayments from repositories, resulting two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinTech, Crowdfunding, Digital Finance · Private Equity and Venture Capital
