Benchmarking Data Heterogeneity Evaluation Approaches for Personalized Federated Learning
Zhilong Li, Xiaohu Wu, Xiaoli Tang, Tiantian He, Yew-Soon Ong,, Mengmeng Chen, Qiqi Liu, Qicheng Lao, Han Yu

TL;DR
This paper introduces a comprehensive benchmarking framework for evaluating data heterogeneity measures in personalized federated learning, enabling fair comparison and guiding the selection of appropriate approaches across different non-IID data scenarios.
Contribution
It provides the first unified benchmark with six approaches and extensive experiments, offering insights into their effectiveness in various federated learning settings.
Findings
Certain approaches perform better in specific non-IID scenarios
The framework guides the choice of heterogeneity measures for FL applications
Insights help improve fairness and personalization in federated learning
Abstract
There is growing research interest in measuring the statistical heterogeneity of clients' local datasets. Such measurements are used to estimate the suitability for collaborative training of personalized federated learning (PFL) models. Currently, these research endeavors are taking place in silos and there is a lack of a unified benchmark to provide a fair and convenient comparison among various approaches in common settings. We aim to bridge this important gap in this paper. The proposed benchmarking framework currently includes six representative approaches. Extensive experiments have been conducted to compare these approaches under five standard non-IID FL settings, providing much needed insights into which approaches are advantageous under which settings. The proposed framework offers useful guidance on the suitability of various data divergence measures in FL systems. It is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management
