Advocating for the Silent: Enhancing Federated Generalization for Non-Participating Clients
Zheshun Wu, Zenglin Xu, Dun Zeng, Qifan Wang, Jie Liu

TL;DR
This paper introduces an information-theoretic framework for federated learning that improves model generalization to non-participating clients by quantifying distribution discrepancies and proposing new aggregation and client selection strategies.
Contribution
It presents a novel generalization framework based on information entropy and introduces weighted aggregation and client selection methods to enhance performance on non-participating clients.
Findings
The proposed methods improve generalization to non-participating clients.
Empirical results validate the effectiveness of the theoretical bounds.
Strategies outperform baseline approaches in diverse data scenarios.
Abstract
Federated Learning (FL) has surged in prominence due to its capability of collaborative model training without direct data sharing. However, the vast disparity in local data distributions among clients, often termed the Non-Independent Identically Distributed (Non-IID) challenge, poses a significant hurdle to FL's generalization efficacy. The scenario becomes even more complex when not all clients participate in the training process, a common occurrence due to unstable network connections or limited computational capacities. This can greatly complicate the assessment of the trained models' generalization abilities. While a plethora of recent studies has centered on the generalization gap pertaining to unseen data from participating clients with diverse distributions, the distinction between the training distributions of participating clients and the testing distributions of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques
