How to Collaborate: Towards Maximizing the Generalization Performance in Cross-Silo Federated Learning
Yuchang Sun, Marios Kountouris, Jun Zhang

TL;DR
This paper investigates how collaboration among clients in cross-silo federated learning can enhance individual model generalization, proposing a hierarchical clustering scheme that adapts to data heterogeneity and improves performance.
Contribution
It derives a generalization bound for clients, formulates a utility maximization problem, and introduces HCCT, a hierarchical clustering-based training scheme that adapts to data similarity without predefining group numbers.
Findings
HCCT improves generalization over baseline schemes.
Collaboration benefits depend on data similarity and size.
HCCT degenerates to independent training in certain scenarios.
Abstract
Federated learning (FL) has attracted vivid attention as a privacy-preserving distributed learning framework. In this work, we focus on cross-silo FL, where clients become the model owners after training and are only concerned about the model's generalization performance on their local data. Due to the data heterogeneity issue, asking all the clients to join a single FL training process may result in model performance degradation. To investigate the effectiveness of collaboration, we first derive a generalization bound for each client when collaborating with others or when training independently. We show that the generalization performance of a client can be improved only by collaborating with other clients that have more training data and similar data distribution. Our analysis allows us to formulate a client utility maximization problem by partitioning clients into multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Statistical Methods and Inference
MethodsFocus
