FedCSD: A Federated Learning Based Approach for Code-Smell Detection
Sadi Alawadi, Khalid Alkharabsheh, Fahed Alkhabbas, Victor Kebande,, Feras M. Awaysheh, Fabio Palomba, Mohammed Awad

TL;DR
FedCSD introduces a federated learning approach for code smell detection, enabling collaborative model training across organizations while maintaining data privacy, with experiments showing high accuracy and robustness across different scenarios.
Contribution
This paper presents the first federated learning framework for code smell detection, addressing data privacy and technical debt issues in collaborative software quality assessment.
Findings
High accuracy achieved in centralized training (up to 99.5%)
Significant accuracy drop in cross-evaluation with different datasets
Global federated model maintains high accuracy (98.34%) across multiple organizations
Abstract
This paper proposes a Federated Learning Code Smell Detection (FedCSD) approach that allows organizations to collaboratively train federated ML models while preserving their data privacy. These assertions have been supported by three experiments that have significantly leveraged three manually validated datasets aimed at detecting and examining different code smell scenarios. In experiment 1, which was concerned with a centralized training experiment, dataset two achieved the lowest accuracy (92.30%) with fewer smells, while datasets one and three achieved the highest accuracy with a slight difference (98.90% and 99.5%, respectively). This was followed by experiment 2, which was concerned with cross-evaluation, where each ML model was trained using one dataset, which was then evaluated over the other two datasets. Results from this experiment show a significant drop in the model's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research
