Automated Detection of Algorithm Debt in Deep Learning Frameworks: An   Empirical Study

Emmanuel Iko-Ojo Simon; Chirath Hettiarachchi; Alex Potanin; Hanna; Suominen; Fatemeh Fard

arXiv:2408.10529·cs.SE·August 23, 2024·2 cites

Automated Detection of Algorithm Debt in Deep Learning Frameworks: An Empirical Study

Emmanuel Iko-Ojo Simon, Chirath Hettiarachchi, Alex Potanin, Hanna, Suominen, Fatemeh Fard

PDF

Open Access

TL;DR

This paper empirically investigates automated detection of Algorithm Debt in deep learning frameworks using various ML/DL models and feature extraction techniques to improve early identification of technical debt.

Contribution

It introduces an empirical study exploring multiple feature extraction methods and advanced language models for detecting Algorithm Debt in deep learning source code comments.

Findings

01

Feature enrichment improves detection accuracy

02

Language models like ROBERTA and ALBERTv2 enhance detection performance

03

Enriched datasets with AD-related terms aid in early debt identification

Abstract

Context: Previous studies demonstrate that Machine or Deep Learning (ML/DL) models can detect Technical Debt from source code comments called Self-Admitted Technical Debt (SATD). Despite the importance of ML/DL in software development, limited studies focus on automated detection for new SATD types: Algorithm Debt (AD). AD detection is important because it helps to identify TD early, facilitating research, learning, and preventing the accumulation of issues related to model degradation and lack of scalability. Aim: Our goal is to improve AD detection performance of various ML/DL models. Method: We will perform empirical studies using approaches: TF-IDF, Count Vectorizer, Hash Vectorizer, and TD-indicative words to identify features that improve AD detection, using ML/DL classifiers with different data featurisations. We will use an existing dataset curated from seven DL frameworks where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)