Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine Federated Learning
Amit Dutta, Thinh T. Doan

TL;DR
This paper introduces a new two-time-scale local stochastic gradient descent method for Byzantine federated learning, achieving exact convergence to the optimal solution at an optimal rate despite malicious agents.
Contribution
It proposes a novel variant of local SGD that guarantees exact convergence in Byzantine settings, addressing a key open problem in federated optimization.
Findings
Converges exactly to the optimal solution under Byzantine attacks.
Achieves an optimal convergence rate of O(1/k) in both convex and non-convex cases.
Validated through simulations confirming theoretical results.
Abstract
We study local stochastic gradient descent methods for solving federated optimization over a network of agents communicating indirectly through a centralized coordinator. We are interested in the Byzantine setting where there is a subset of malicious agents that could observe the entire network and send arbitrary values to the coordinator to disrupt the performance of other non-faulty agents. The objective of the non-faulty agents is to collaboratively compute the optimizer of their respective local functions under the presence of Byzantine agents. In this setting, prior works show that the local stochastic gradient descent method can only return an approximate of the desired solutions due to the impacts of Byzantine agents. Whether this method can find an exact solution remains an open question. In this paper, we will address this open question by proposing a new variant of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
