Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine   Federated Learning

Amit Dutta; Thinh T. Doan

arXiv:2409.03092·math.OC·September 6, 2024

Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine Federated Learning

Amit Dutta, Thinh T. Doan

PDF

Open Access

TL;DR

This paper introduces a new two-time-scale local stochastic gradient descent method for Byzantine federated learning, achieving exact convergence to the optimal solution at an optimal rate despite malicious agents.

Contribution

It proposes a novel variant of local SGD that guarantees exact convergence in Byzantine settings, addressing a key open problem in federated optimization.

Findings

01

Converges exactly to the optimal solution under Byzantine attacks.

02

Achieves an optimal convergence rate of O(1/k) in both convex and non-convex cases.

03

Validated through simulations confirming theoretical results.

Abstract

We study local stochastic gradient descent methods for solving federated optimization over a network of agents communicating indirectly through a centralized coordinator. We are interested in the Byzantine setting where there is a subset of $f$ malicious agents that could observe the entire network and send arbitrary values to the coordinator to disrupt the performance of other non-faulty agents. The objective of the non-faulty agents is to collaboratively compute the optimizer of their respective local functions under the presence of Byzantine agents. In this setting, prior works show that the local stochastic gradient descent method can only return an approximate of the desired solutions due to the impacts of Byzantine agents. Whether this method can find an exact solution remains an open question. In this paper, we will address this open question by proposing a new variant of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Machine Learning and ELM