A-FedPD: Aligning Dual-Drift is All Federated Primal-Dual Learning Needs
Yan Sun, Li Shen, and Dacheng Tao

TL;DR
This paper introduces A-FedPD, a federated learning method that addresses dual drift caused by client inactivity in non-convex scenarios, improving alignment and efficiency in collaborative training.
Contribution
The paper proposes a novel A-FedPD method that constructs virtual dual updates to mitigate dual drift caused by inactive clients in federated primal-dual learning.
Findings
A-FedPD effectively aligns global and local dual variables.
The method demonstrates high optimization and generalization efficiency.
Experimental results validate its practicality and superiority.
Abstract
As a popular paradigm for juggling data privacy and collaborative training, federated learning (FL) is flourishing to distributively process the large scale of heterogeneous datasets on edged clients. Due to bandwidth limitations and security considerations, it ingeniously splits the original problem into multiple subproblems to be solved in parallel, which empowers primal dual solutions to great application values in FL. In this paper, we review the recent development of classical federated primal dual methods and point out a serious common defect of such methods in non-convex scenarios, which we say is a "dual drift" caused by dual hysteresis of those longstanding inactive clients under partial participation training. To further address this problem, we propose a novel Aligned Federated Primal Dual (A-FedPD) method, which constructs virtual dual updates to align global consensus and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · DNA and Biological Computing · Algorithms and Data Compression
MethodsALIGN
