The Impact of Data Distribution on Fairness and Robustness in Federated   Learning

Mustafa Safa Ozdayi; Murat Kantarcioglu

arXiv:2112.01274·cs.LG·December 3, 2021

The Impact of Data Distribution on Fairness and Robustness in Federated Learning

Mustafa Safa Ozdayi, Murat Kantarcioglu

PDF

Open Access 1 Repo

TL;DR

This paper investigates how differences in local data distributions among agents in federated learning impact the fairness and robustness of the trained models, revealing that small distribution variations can significantly affect security and fairness.

Contribution

It provides empirical evidence that data distribution heterogeneity affects fairness and robustness in federated learning beyond just accuracy, highlighting critical considerations for deployment.

Findings

01

Models become more biased with diverse data distributions.

02

Robustness to attacks decreases as data heterogeneity increases.

03

Fairness degradation can be more severe than accuracy loss.

Abstract

Federated Learning (FL) is a distributed machine learning protocol that allows a set of agents to collaboratively train a model without sharing their datasets. This makes FL particularly suitable for settings where data privacy is desired. However, it has been observed that the performance of FL is closely related to the similarity of the local data distributions of agents. Particularly, as the data distributions of agents differ, the accuracy of the trained models drop. In this work, we look at how variations in local data distributions affect the fairness and the robustness properties of the trained models in addition to the accuracy. Our experimental results indicate that, the trained models exhibit higher bias, and become more susceptible to attacks as local data distributions differ. Importantly, the degradation in the fairness, and robustness can be much more severe than the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tinfoilhat0/fairness-robustness-in-fedlearning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI