FeDa4Fair: Client-Level Federated Datasets for Fairness Evaluation
Xenia Heilmann, Luca Corbucci, Mattia Cerrato, Anna Monreale

TL;DR
FeDa4Fair is a benchmarking framework that creates datasets to evaluate fairness in federated learning under heterogeneous client biases, addressing limitations of existing fairness solutions.
Contribution
It introduces a library for generating tailored datasets, a benchmark suite for evaluation, and functions for assessing fairness outcomes in federated learning.
Findings
First benchmark for fairness in heterogeneous FL scenarios.
Supports evaluation of fairness methods under conflicting client biases.
Provides standardized datasets and evaluation tools.
Abstract
Federated Learning (FL) enables collaborative training while preserving privacy, yet it introduces a critical challenge: the "illusion of fairness''. A global model, usually evaluated on the server, appears fair on average while keeping persistent discrimination at the client level. Current fairness-enhancing FL solutions often fall short, as they typically mitigate biases for a single, usually binary, sensitive attribute, while ignoring two realistic and conflicting scenarios: attribute-bias (where clients are unfair toward different sensitive attributes) and value-bias (where clients exhibit conflicting biases toward different values of the same attribute). To support more robust and reproducible fairness research in FL, we introduce FeDa4Fair, the first benchmarking framework designed to stress-test fairness methods under these heterogeneous conditions. Our contributions are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
