FedRepOpt: Gradient Re-parametrized Optimizers in Federated Learning

Kin Wai Lau; Yasar Abbas Ur Rehman; Pedro Porto Buarque de Gusm\~ao,; Lai-Man Po; Lan Ma; Yuyang Xie

arXiv:2409.15898·cs.LG·October 11, 2024

FedRepOpt: Gradient Re-parametrized Optimizers in Federated Learning

Kin Wai Lau, Yasar Abbas Ur Rehman, Pedro Porto Buarque de Gusm\~ao,, Lai-Man Po, Lan Ma, Yuyang Xie

PDF

Open Access

TL;DR

FedRepOpt introduces a gradient re-parameterized optimizer for federated learning, enabling simple models on edge devices to achieve performance comparable to complex models with faster convergence.

Contribution

The paper proposes FedRepOpt, a novel optimizer that re-parameters gradients to allow simple models to mimic complex models in federated learning, improving performance and convergence.

Findings

01

Models with FedRepOpt outperform RepGhost and RepVGG by 11-17% in accuracy.

02

FedRepOpt achieves 11.7% and 57.4% faster convergence times.

03

Significant performance boosts demonstrate effectiveness across VGG-style and Ghost-style models.

Abstract

Federated Learning (FL) has emerged as a privacy-preserving method for training machine learning models in a distributed manner on edge devices. However, on-device models face inherent computational power and memory limitations, potentially resulting in constrained gradient updates. As the model's size increases, the frequency of gradient updates on edge devices decreases, ultimately leading to suboptimal training outcomes during any particular FL round. This limits the feasibility of deploying advanced and large-scale models on edge devices, hindering the potential for performance enhancements. To address this issue, we propose FedRepOpt, a gradient re-parameterized optimizer for FL. The gradient re-parameterized method allows training a simple local model with a similar performance as a complex model by modifying the optimizer's gradients according to a set of model-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Machine Learning and ELM

MethodsSparse Evolutionary Training · Focus