A Comprehensive Empirical Study of Bugs in Open-Source Federated Learning Frameworks
Weijie Shao, Yuyang Gao, Fu Song, Sen Chen, Lingling Fan, and JingZhu He

TL;DR
This paper provides a comprehensive empirical analysis of 1,119 bugs across 17 open-source federated learning frameworks, classifying their symptoms, causes, and fixes to improve framework reliability and guide future research.
Contribution
It is the first detailed empirical study of bugs in federated learning frameworks, offering a bug taxonomy and insights for better development and bug management.
Findings
Identified 12 bug symptoms, 12 root causes, and 18 fix patterns.
Analyzed bug distribution across 23 functionalities.
Provided insights for improving FL framework robustness.
Abstract
Federated learning (FL) is a distributed machine learning (ML) paradigm, allowing multiple clients to collaboratively train shared machine learning (ML) models without exposing clients' data privacy. It has gained substantial popularity in recent years, especially since the enforcement of data protection laws and regulations in many countries. To foster the application of FL, a variety of FL frameworks have been proposed, allowing non-experts to easily train ML models. As a result, understanding bugs in FL frameworks is critical for facilitating the development of better FL frameworks and potentially encouraging the development of bug detection, localization and repair tools. Thus, we conduct the first empirical study to comprehensively collect, taxonomize, and characterize bugs in FL frameworks. Specifically, we manually collect and classify 1,119 bugs from all the 676 closed issues…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Access Control and Trust · Internet Traffic Analysis and Secure E-voting
