Automated Detection of Algorithm Debt in Deep Learning Frameworks: An Empirical Study
Emmanuel Iko-Ojo Simon, Chirath Hettiarachchi, Alex Potanin, Hanna, Suominen, Fatemeh Fard

TL;DR
This paper empirically investigates automated detection of Algorithm Debt in deep learning frameworks using various ML/DL models and feature extraction techniques to improve early identification of technical debt.
Contribution
It introduces an empirical study exploring multiple feature extraction methods and advanced language models for detecting Algorithm Debt in deep learning source code comments.
Findings
Feature enrichment improves detection accuracy
Language models like ROBERTA and ALBERTv2 enhance detection performance
Enriched datasets with AD-related terms aid in early debt identification
Abstract
Context: Previous studies demonstrate that Machine or Deep Learning (ML/DL) models can detect Technical Debt from source code comments called Self-Admitted Technical Debt (SATD). Despite the importance of ML/DL in software development, limited studies focus on automated detection for new SATD types: Algorithm Debt (AD). AD detection is important because it helps to identify TD early, facilitating research, learning, and preventing the accumulation of issues related to model degradation and lack of scalability. Aim: Our goal is to improve AD detection performance of various ML/DL models. Method: We will perform empirical studies using approaches: TF-IDF, Count Vectorizer, Hash Vectorizer, and TD-indicative words to identify features that improve AD detection, using ML/DL classifiers with different data featurisations. We will use an existing dataset curated from seven DL frameworks where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
