Methods with Local Steps and Random Reshuffling for Generally Smooth   Non-Convex Federated Optimization

Yury Demidovich; Petr Ostroukhov; Grigory Malinovsky; Samuel; Horv\'ath; Martin Tak\'a\v{c}; Peter Richt\'arik; Eduard Gorbunov

arXiv:2412.02781·math.OC·April 14, 2025

Methods with Local Steps and Random Reshuffling for Generally Smooth Non-Convex Federated Optimization

Yury Demidovich, Petr Ostroukhov, Grigory Malinovsky, Samuel, Horv\'ath, Martin Tak\'a\v{c}, Peter Richt\'arik, Eduard Gorbunov

PDF

Open Access

TL;DR

This paper introduces new federated optimization methods that work under generalized smoothness assumptions, incorporating local steps, partial participation, and random reshuffling, with theoretical analysis and experimental validation.

Contribution

It proposes and analyzes federated algorithms with local steps and random reshuffling under generalized smoothness, extending existing methods without restrictive assumptions.

Findings

01

Methods perform well under generalized smoothness.

02

Theoretical analysis under Polyak-Łojasiewicz condition.

03

Experimental results support theoretical claims.

Abstract

Non-convex Machine Learning problems typically do not adhere to the standard smoothness assumption. Based on empirical findings, Zhang et al. (2020b) proposed a more realistic generalized $(L_{0}, L_{1})$ -smoothness assumption, though it remains largely unexplored. Many existing algorithms designed for standard smooth problems need to be revised. However, in the context of Federated Learning, only a few works address this problem but rely on additional limiting assumptions. In this paper, we address this gap in the literature: we propose and analyze new methods with local steps, partial participation of clients, and Random Reshuffling without extra restrictive assumptions beyond generalized smoothness. The proposed methods are based on the proper interplay between clients' and server's stepsizes and gradient clipping. Furthermore, we perform the first analysis of these methods under the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Distributed Control Multi-Agent Systems · Sparse and Compressive Sensing Techniques