FedSM: Robust Semantics-Guided Feature Mixup for Bias Reduction in Federated Learning with Long-Tail Data

Jingrui Zhang; Yimeng Xu; Shujie Li; Feng Liang; Haihan Duan; Yanjie Dong; Victor C. M. Leung; Xiping Hu

arXiv:2510.27240·cs.LG·January 8, 2026

FedSM: Robust Semantics-Guided Feature Mixup for Bias Reduction in Federated Learning with Long-Tail Data

Jingrui Zhang, Yimeng Xu, Shujie Li, Feng Liang, Haihan Duan, Yanjie Dong, Victor C. M. Leung, Xiping Hu

PDF

TL;DR

FedSM is a client-centric federated learning framework that uses semantics-guided feature mixup and classifier retraining to reduce bias caused by long-tail data distributions, improving accuracy and robustness.

Contribution

Introduces FedSM, a novel semantics-guided feature mixup approach with probabilistic category selection to mitigate bias in federated learning with long-tail data.

Findings

01

Outperforms state-of-the-art methods in accuracy on long-tail datasets.

02

Demonstrates high robustness to domain shift.

03

Requires minimal server overhead and computational resources.

Abstract

Federated Learning (FL) enables collaborative model training across decentralized clients without sharing private data. However, FL suffers from biased global models due to non-IID and long-tail data distributions. We propose \textbf{FedSM}, a novel client-centric framework that mitigates this bias through semantics-guided feature mixup and lightweight classifier retraining. FedSM uses a pretrained image-text-aligned model to compute category-level semantic relevance, guiding the category selection of local features to mix-up with global prototypes to generate class-consistent pseudo-features. These features correct classifier bias, especially when data are heavily skewed. To address the concern of potential domain shift between the pretrained model and the data, we propose probabilistic category selection, enhancing feature diversity to effectively mitigate biases. All computations are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.