The Impact of Data Distribution on Fairness and Robustness in Federated Learning
Mustafa Safa Ozdayi, Murat Kantarcioglu

TL;DR
This paper investigates how differences in local data distributions among agents in federated learning impact the fairness and robustness of the trained models, revealing that small distribution variations can significantly affect security and fairness.
Contribution
It provides empirical evidence that data distribution heterogeneity affects fairness and robustness in federated learning beyond just accuracy, highlighting critical considerations for deployment.
Findings
Models become more biased with diverse data distributions.
Robustness to attacks decreases as data heterogeneity increases.
Fairness degradation can be more severe than accuracy loss.
Abstract
Federated Learning (FL) is a distributed machine learning protocol that allows a set of agents to collaboratively train a model without sharing their datasets. This makes FL particularly suitable for settings where data privacy is desired. However, it has been observed that the performance of FL is closely related to the similarity of the local data distributions of agents. Particularly, as the data distributions of agents differ, the accuracy of the trained models drop. In this work, we look at how variations in local data distributions affect the fairness and the robustness properties of the trained models in addition to the accuracy. Our experimental results indicate that, the trained models exhibit higher bias, and become more susceptible to attacks as local data distributions differ. Importantly, the degradation in the fairness, and robustness can be much more severe than the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI
