DRAG: Divergence-based Adaptive Aggregation in Federated learning on Non-IID Data
Feng Zhu, Jingjing Zhang, Shengyun Liu, Xin Wang

TL;DR
This paper introduces DRAG, a divergence-based adaptive aggregation method in federated learning that mitigates client-drift on non-IID data, achieves sublinear convergence, and enhances robustness against Byzantine attacks without extra communication overhead.
Contribution
The paper proposes a novel divergence metric and an adaptive aggregation algorithm that improves convergence and robustness in federated learning on heterogeneous data.
Findings
DRAG outperforms state-of-the-art algorithms in managing client-drift.
DRAG achieves a proven sublinear convergence rate.
DRAG enhances resilience against Byzantine attacks.
Abstract
Local stochastic gradient descent (SGD) is a fundamental approach in achieving communication efficiency in Federated Learning (FL) by allowing individual workers to perform local updates. However, the presence of heterogeneous data distributions across working nodes causes each worker to update its local model towards a local optimum, leading to the phenomenon known as ``client-drift" and resulting in slowed convergence. To address this issue, previous works have explored methods that either introduce communication overhead or suffer from unsteady performance. In this work, we introduce a novel metric called ``degree of divergence," quantifying the angle between the local gradient and the global reference direction. Leveraging this metric, we propose the divergence-based adaptive aggregation (DRAG) algorithm, which dynamically ``drags" the received local updates toward the reference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Traffic Prediction and Management Techniques
