Resilient Federated Learning under Byzantine Attack in Distributed Nonconvex Optimization with 2-f Redundancy
Amit Dutta, Thinh T. Doan, Jeffrey H. Reed

TL;DR
This paper investigates the robustness of federated learning against Byzantine attacks in a nonconvex optimization context, proposing a filter-based method to ensure convergence despite malicious agents.
Contribution
It extends the theoretical analysis of the CE filter to nonconvex settings, demonstrating convergence of federated learning under Byzantine faults.
Findings
The CE filter effectively mitigates Byzantine attacks in nonconvex federated learning.
The proposed method guarantees convergence to a stationary point.
Numerical simulations support the theoretical convergence results.
Abstract
We study the problem of Byzantine fault tolerance in a distributed optimization setting, where there is a group of agents communicating with a trusted centralized coordinator. Among these agents, there is a subset of agents that may not follow a prescribed algorithm and may share arbitrarily incorrect information with the coordinator. The goal is to find the optimizer of the aggregate cost functions of the honest agents. We will be interested in studying the local gradient descent method, also known as federated learning, to solve this problem. However, this method often returns an approximate value of the underlying optimal solution in the Byzantine setting. Recent work showed that by incorporating the so-called comparative elimination (CE) filter at the coordinator, one can provably mitigate the detrimental impact of Byzantine agents and precisely compute the true optimizer in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Nanocluster Synthesis and Applications
