TL;DR
FedHPro introduces hyper-prototypes optimized via gradient matching to improve semantic consistency and performance in federated learning, addressing issues of semantic drift and misalignment across clients.
Contribution
The paper proposes hyper-prototypes and a federated learning framework, FedHPro, which enhance semantic alignment and achieve state-of-the-art results in heterogeneous scenarios.
Findings
Hyper-prototypes provide a more semantically consistent global signal.
FedHPro outperforms existing methods on benchmark datasets.
Gradient matching effectively aligns hyper-prototypes with client data.
Abstract
Federated Learning (FL) enables collaborative training of distributed clients while protecting privacy. To enhance generalization capability in FL, prototype-based FL is in the spotlight, since shared global prototypes offer semantic anchors for aligning client-specific local prototypes. However, existing methods update global prototypes at the prototype-level via averaging local prototypes or refining global anchors, which often leads to semantic drift across clients and subsequently yields a misaligned global signal. To alleviate this issue, we introduce hyper-prototypes, defined by a set of learnable global class-wise prototypes to preserve underlying semantic knowledge across clients. The hyper-prototypes are optimized via gradient matching to align with class-relevant characteristics distilled directly from clients' real samples, rather than prototype-level descriptors. We further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
