Efficient Federated Learning against Byzantine Attacks and Data Heterogeneity via Aggregating Normalized Gradients
Shiyuan Zuo, Xingrun Yan, Rongfei Fan, Li Shen, Puning Zhao, Jie Xu, Han Hu

TL;DR
This paper introduces Fed-NGA, a simple and efficient federated learning algorithm that is robust against Byzantine attacks and data heterogeneity, with proven convergence and superior experimental performance.
Contribution
Fed-NGA performs gradient aggregation via normalized weighted means, reducing computational overhead and ensuring robustness and convergence in challenging federated learning scenarios.
Findings
Fed-NGA has a time complexity of O(pM), improving efficiency.
Fed-NGA is robust to Byzantine faults and data heterogeneity.
Experimental results show Fed-NGA outperforms existing methods in convergence and speed.
Abstract
Federated Learning (FL) enables multiple clients to collaboratively train models without sharing raw data, but is vulnerable to Byzantine attacks and data heterogeneity, which can severely degrade performance. Existing Byzantine-robust approaches tackle data heterogeneity, but incur high computational overhead during gradient aggregation, thereby slowing down the training process. To address this issue, we propose a simple yet effective Federated Normalized Gradients Algorithm (Fed-NGA), which performs aggregation by merely computing the weighted mean of the normalized gradients from each client. This approach yields a favorable time complexity of , where is the model dimension and is the number of clients. We rigorously prove that Fed-NGA is robust to both Byzantine faults and data heterogeneity. For non-convex loss functions, Fed-NGA achieves convergence to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Cryptography and Data Security
