Stabilizing Federated Learning under Extreme Heterogeneity with HeteRo-Select
Md. Akmol Masud, Md Abrar Jahin, Mahmud Hasan

TL;DR
This paper introduces HeteRo-Select, a client selection framework for federated learning that enhances stability and performance under extreme data heterogeneity by leveraging a theoretical scoring system and convergence guarantees.
Contribution
HeteRo-Select is a novel client selection method that improves long-term stability and accuracy in federated learning with highly heterogeneous data, supported by theoretical analysis and empirical validation.
Findings
HeteRo-Select outperforms existing methods in peak and final accuracy.
HeteRo-Select maintains higher training stability with minimal accuracy drops.
Theoretical analysis confirms reduced communication and convergence guarantees.
Abstract
Federated Learning (FL) is a machine learning technique that often suffers from training instability due to the diverse nature of client data. Although utility-based client selection methods like Oort are used to converge by prioritizing high-loss clients, they frequently experience significant drops in accuracy during later stages of training. We propose a theoretical HeteRo-Select framework designed to maintain high performance and ensure long-term training stability. We provide a theoretical analysis showing that when client data is very different (high heterogeneity), choosing a smart subset of client participation can reduce communication more effectively compared to full participation. Our HeteRo-Select method uses a clear, step-by-step scoring system that considers client usefulness, fairness, update speed, and data variety. It also shows convergence guarantees under strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Big Data and Digital Economy · Data Quality and Management
