FedHPro: Federated Hyper-Prototype Learning via Gradient Matching

Huan Wang; Jun Shen; Haoran Li; Zhenyu Yang; Jun Yan; Ousman Manjang; Yanlong Zhai; Di Wu; Guansong Pang

arXiv:2605.13475·cs.CV·May 21, 2026

FedHPro: Federated Hyper-Prototype Learning via Gradient Matching

Huan Wang, Jun Shen, Haoran Li, Zhenyu Yang, Jun Yan, Ousman Manjang, Yanlong Zhai, Di Wu, Guansong Pang

PDF

1 Repo

TL;DR

FedHPro introduces hyper-prototypes optimized via gradient matching to improve semantic consistency and performance in federated learning, addressing issues of semantic drift and misalignment across clients.

Contribution

The paper proposes hyper-prototypes and a federated learning framework, FedHPro, which enhance semantic alignment and achieve state-of-the-art results in heterogeneous scenarios.

Findings

01

Hyper-prototypes provide a more semantically consistent global signal.

02

FedHPro outperforms existing methods on benchmark datasets.

03

Gradient matching effectively aligns hyper-prototypes with client data.

Abstract

Federated Learning (FL) enables collaborative training of distributed clients while protecting privacy. To enhance generalization capability in FL, prototype-based FL is in the spotlight, since shared global prototypes offer semantic anchors for aligning client-specific local prototypes. However, existing methods update global prototypes at the prototype-level via averaging local prototypes or refining global anchors, which often leads to semantic drift across clients and subsequently yields a misaligned global signal. To alleviate this issue, we introduce hyper-prototypes, defined by a set of learnable global class-wise prototypes to preserve underlying semantic knowledge across clients. The hyper-prototypes are optimized via gradient matching to align with class-relevant characteristics distilled directly from clients' real samples, rather than prototype-level descriptors. We further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mala-lab/FedHPro
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.