Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?
Antonio Mastropaolo, Massimiliano Di Penta, Gabriele Bavota

TL;DR
This study evaluates the potential of neural-based generative models to automatically address Self-Admitted Technical Debt in software, revealing limited success and highlighting the importance of pre-training and input context.
Contribution
It provides an empirical assessment of various transformer-based models and LLMs for automating SATD removal, demonstrating the challenges and key factors influencing performance.
Findings
Best models fix 2-8% of SATD instances
Pre-training significantly boosts model performance
Removing SATD comments reduces effectiveness
Abstract
Upon evolving their software, organizations and individual developers have to spend a substantial effort to pay back technical debt, i.e., the fact that software is released in a shape not as good as it should be, e.g., in terms of functionality, reliability, or maintainability. This paper empirically investigates the extent to which technical debt can be automatically paid back by neural-based generative models, and in particular models exploiting different strategies for pre-training and fine-tuning. We start by extracting a dateset of 5,039 Self-Admitted Technical Debt (SATD) removals from 595 open-source projects. SATD refers to technical debt instances documented (e.g., via code comments) by developers. We use this dataset to experiment with seven different generative deep learning (DL) model configurations. Specifically, we compare transformers pre-trained and fine-tuned with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Open Source Software Innovations · Software Reliability and Analysis Research
