FedEval: A Holistic Evaluation Framework for Federated Learning
Di Chai, Leye Wang, Liu Yang, Junxue Zhang, Kai Chen, and Qiang Yang

TL;DR
This paper introduces FedEval, a comprehensive evaluation framework for federated learning that standardizes metrics and settings, enabling fair comparison of seven state-of-the-art algorithms across privacy, robustness, effectiveness, and efficiency.
Contribution
The paper proposes a holistic evaluation framework and platform for federated learning, addressing inconsistencies in prior evaluations and providing a detailed benchmarking study.
Findings
Identified strengths and weaknesses of seven FL algorithms.
Provided practical recommendations for algorithm selection.
Suggested future research directions in FL evaluation.
Abstract
Federated Learning (FL) has been widely accepted as the solution for privacy-preserving machine learning without collecting raw data. While new technologies proposed in the past few years do evolve the FL area, unfortunately, the evaluation results presented in these works fall short in integrity and are hardly comparable because of the inconsistent evaluation metrics and experimental settings. In this paper, we propose a holistic evaluation framework for FL called FedEval, and present a benchmarking study on seven state-of-the-art FL algorithms. Specifically, we first introduce the core evaluation taxonomy model, called FedEval-Core, which covers four essential evaluation aspects for FL: Privacy, Robustness, Effectiveness, and Efficiency, with various well-defined metrics and experimental settings. Based on the FedEval-Core, we further develop an FL evaluation platform with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Internet Traffic Analysis and Secure E-voting
