The Power of Extrapolation in Federated Learning

Hanmin Li; Kirill Acharya; Peter Richt\'arik

arXiv:2405.13766·math.OC·October 2, 2024·NeurIPS

The Power of Extrapolation in Federated Learning

Hanmin Li, Kirill Acharya, Peter Richt\'arik

PDF

Open Access 1 Video

TL;DR

This paper introduces server-extrapolation strategies for federated learning, specifically enhancing FedProx's convergence through theoretical analysis and novel extrapolation methods in convex settings.

Contribution

It proposes FedExProx with three extrapolation strategies and provides the first theoretical guarantees for such methods in federated learning.

Findings

01

Extrapolation improves convergence in federated learning.

02

Theoretical guarantees are established for the proposed methods.

03

Numerical experiments confirm the effectiveness of extrapolation strategies.

Abstract

We propose and study several server-extrapolation strategies for enhancing the theoretical and empirical convergence properties of the popular federated learning optimizer FedProx [Li et al., 2020]. While it has long been known that some form of extrapolation can help in the practice of FL, only a handful of works provide any theoretical guarantees. The phenomenon seems elusive, and our current theoretical understanding remains severely incomplete. In our work, we focus on smooth convex or strongly convex problems in the interpolation regime. In particular, we propose Extrapolated FedProx (FedExProx), and study three extrapolation strategies: a constant strategy (depending on various smoothness parameters and the number of participating devices), and two smoothness-adaptive strategies; one based on the notion of gradient diversity (FedExProx-GraDS), and the other one based on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The Power of Extrapolation in Federated Learning· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data