Advocating for the Silent: Enhancing Federated Generalization for   Non-Participating Clients

Zheshun Wu; Zenglin Xu; Dun Zeng; Qifan Wang; Jie Liu

arXiv:2310.07171·cs.LG·December 12, 2024·1 cites

Advocating for the Silent: Enhancing Federated Generalization for Non-Participating Clients

Zheshun Wu, Zenglin Xu, Dun Zeng, Qifan Wang, Jie Liu

PDF

Open Access

TL;DR

This paper introduces an information-theoretic framework for federated learning that improves model generalization to non-participating clients by quantifying distribution discrepancies and proposing new aggregation and client selection strategies.

Contribution

It presents a novel generalization framework based on information entropy and introduces weighted aggregation and client selection methods to enhance performance on non-participating clients.

Findings

01

The proposed methods improve generalization to non-participating clients.

02

Empirical results validate the effectiveness of the theoretical bounds.

03

Strategies outperform baseline approaches in diverse data scenarios.

Abstract

Federated Learning (FL) has surged in prominence due to its capability of collaborative model training without direct data sharing. However, the vast disparity in local data distributions among clients, often termed the Non-Independent Identically Distributed (Non-IID) challenge, poses a significant hurdle to FL's generalization efficacy. The scenario becomes even more complex when not all clients participate in the training process, a common occurrence due to unstable network connections or limited computational capacities. This can greatly complicate the assessment of the trained models' generalization abilities. While a plethora of recent studies has centered on the generalization gap pertaining to unseen data from participating clients with diverse distributions, the distinction between the training distributions of participating clients and the testing distributions of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques