Aiding Global Convergence in Federated Learning via Local Perturbation and Mutual Similarity Information
Emanuel Buttaci, Giuseppe Carlo Calafiore

TL;DR
This paper introduces a federated learning method that uses local perturbations and client similarity information to accelerate convergence and improve generalization, especially in heterogeneous environments.
Contribution
It proposes a novel federated optimization framework leveraging client similarity graphs and local perturbations, with theoretical convergence guarantees and empirical validation.
Findings
Achieves up to 30 fewer global rounds for convergence compared to FedAvg.
Provides theoretical proof of faster exponential contraction in strongly convex cases.
Shows modest generalization improvements on CIFAR10 and FEMNIST datasets.
Abstract
Federated learning has emerged in the last decade as a distributed optimization paradigm due to the rapidly increasing number of portable devices able to support the heavy computational needs related to the training of machine learning models. Federated learning utilizes gradient-based optimization to minimize a loss objective shared across participating agents. To the best of our knowledge, the literature mostly lacks elegant solutions that naturally harness the reciprocal statistical similarity between clients to redesign the optimization procedure. To address this gap, by conceiving the federated network as a similarity graph, we propose a novel modified framework wherein each client locally performs a perturbed gradient step leveraging prior information about other statistically affine clients. We theoretically prove that our procedure, due to a suitably introduced adaptation in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Face and Expression Recognition
