FedProto: Federated Prototype Learning across Heterogeneous Clients
Yue Tan, Guodong Long, Lu Liu, Tianyi Zhou, Qinghua Lu, Jing Jiang,, and Chengqi Zhang

TL;DR
FedProto introduces a federated learning framework that uses class prototypes instead of gradients to better handle client heterogeneity, improving convergence and accuracy across diverse data distributions.
Contribution
The paper proposes FedProto, a novel prototype-based federated learning method that enhances robustness to heterogeneity and provides theoretical convergence guarantees.
Findings
FedProto outperforms recent FL methods on multiple datasets.
It effectively handles data distribution and model heterogeneity.
Theoretical analysis confirms convergence under non-convex objectives.
Abstract
Heterogeneity across clients in federated learning (FL) usually hinders the optimization convergence and generalization performance when the aggregation of clients' knowledge occurs in the gradient space. For example, clients may differ in terms of data distribution, network latency, input/output space, and/or model architecture, which can easily lead to the misalignment of their local gradients. To improve the tolerance to heterogeneity, we propose a novel federated prototype learning (FedProto) framework in which the clients and server communicate the abstract class prototypes instead of the gradients. FedProto aggregates the local prototypes collected from different clients, and then sends the global prototypes back to all clients to regularize the training of local models. The training on each client aims to minimize the classification error on the local data while keeping the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
