Is Shapley Value fair? Improving Client Selection for Mavericks in Federated Learning
Jiyue Huang, Chi Hong, Lydia Y. Chen, Stefanie Roos

TL;DR
This paper identifies limitations of Shapley Value in federated learning, especially for Mavericks with unique data, and proposes FedEMD, a Wasserstein distance-based client selection method that enhances convergence speed and accuracy.
Contribution
The paper introduces FedEMD, an adaptive client selection strategy that improves federated learning convergence by effectively handling Mavericks, outperforming Shapley Value-based methods.
Findings
FedEMD improves neural network classifier convergence by at least 26.9%.
Shapley Value underestimates Mavericks' contributions.
FedEMD adapts selection to benefit rare class improvements.
Abstract
Shapley Value is commonly adopted to measure and incentivize client participation in federated learning. In this paper, we show -- theoretically and through simulations -- that Shapley Value underestimates the contribution of a common type of client: the Maverick. Mavericks are clients that differ both in data distribution and data quantity and can be the sole owners of certain types of data. Selecting the right clients at the right moment is important for federated learning to reduce convergence times and improve accuracy. We propose FedEMD, an adaptive client selection strategy based on the Wasserstein distance between the local and global data distributions. As FedEMD adapts the selection probability such that Mavericks are preferably selected when the model benefits from improvement on rare classes, it consistently ensures the fast convergence in the presence of different types of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Cryptography and Data Security
