DFedADMM: Dual Constraints Controlled Model Inconsistency for Decentralized Federated Learning
Qinglun Li, Li Shen, Guanghao Li, Quanjun Yin, Dacheng Tao

TL;DR
This paper introduces DFedADMM and DFedADMM-SAM algorithms for decentralized federated learning, addressing local inconsistency and overfitting issues, with theoretical convergence guarantees and superior empirical performance on standard datasets.
Contribution
The paper proposes novel ADMM-based algorithms with gradient perturbation for DFL, providing convergence analysis and demonstrating improved results over state-of-the-art methods.
Findings
Superior generalization performance on MNIST, CIFAR10, CIFAR100.
Faster convergence compared to existing DFL optimizers.
Theoretical convergence rates established for non-convex settings.
Abstract
To address the communication burden issues associated with federated learning (FL), decentralized federated learning (DFL) discards the central server and establishes a decentralized communication network, where each client communicates only with neighboring clients. However, existing DFL methods still suffer from two major challenges: local inconsistency and local heterogeneous overfitting, which have not been fundamentally addressed by existing DFL methods. To tackle these issues, we propose novel DFL algorithms, DFedADMM and its enhanced version DFedADMM-SAM, to enhance the performance of DFL. The DFedADMM algorithm employs primal-dual optimization (ADMM) by utilizing dual variables to control the model inconsistency raised from the decentralized heterogeneous data distributions. The DFedADMM-SAM algorithm further improves on DFedADMM by employing a Sharpness-Aware Minimization (SAM)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Internet Traffic Analysis and Secure E-voting
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sharpness-Aware Minimization
