On the Convergence of FedProx with Extrapolation and Inexact Prox
Hanmin Li, Peter Richt\'arik

TL;DR
This paper analyzes the convergence of the FedExProx federated learning algorithm with server-side extrapolation when clients compute proximal operators approximately, showing it converges to a neighborhood of the solution and can be made robust to inexactness.
Contribution
It extends the theoretical analysis of FedExProx by removing the exactness assumption, demonstrating convergence behavior with inexact proximal computations in convex settings.
Findings
Inexactness causes convergence to a neighborhood of the solution.
Careful control of inexactness mitigates adverse effects.
Robustness of extrapolation to inexact proximal updates is validated.
Abstract
Enhancing the FedProx federated learning algorithm (Li et al., 2020) with server-side extrapolation, Li et al. (2024a) recently introduced the FedExProx method. Their theoretical analysis, however, relies on the assumption that each client computes a certain proximal operator exactly, which is impractical since this is virtually never possible to do in real settings. In this paper, we investigate the behavior of FedExProx without this exactness assumption in the smooth and globally strongly convex setting. We establish a general convergence result, showing that inexactness leads to convergence to a neighborhood of the solution. Additionally, we demonstrate that, with careful control, the adverse effects of this inexactness can be mitigated. By linking inexactness to biased compression (Beznosikov et al., 2023), we refine our analysis, highlighting robustness of extrapolation to inexact…
Peer Reviews
Decision·Submitted to ICLR 2026
- The paper fills a gap in the analysis of the previous work that introduced FedExProx by removing the assumption of exactness of the local proximal operator.
- While the paper's analysis is sound and well-presented, its contribution is narrowly focused on a single variant of FedAvg—the FedExProx algorithm. It is unclear how the insights from the analysis apply more broadly and how they guide the design of federated optimization algorithms in general. - In the literature review, please cite previous analyses of FedProx, such as FedNova (https://arxiv.org/abs/2007.07481), which includes as a special case the convergence analysis of FedProx (with $\tau$
1. The motivation of the paper is clear. The first paper that analyzes FedExProx assumes exact subproblem solvers, which cannot be implemented. 2. The theory is backed by reasonable proofs.
The contribution is overall a bit limited. > Scope The main contribution of the paper is to extend the analysis of FedExProx to allow inexact local solutions in the proximal steps. This has already been partially done in [1] (Appendix E) using the definition of Absolute approximation, under PL conditions. FedExProx was originally proposed and analyzed in [1,2]. These papers also study client sampling, adaptive stepsize, and demonstrate the benefits of extrapolation in terms of communication
1. The paper addresses a practical gap of FedExProx by relaxing the unrealistic exact proximal operator assumption, establishing convergence under inexact updates, which bridges theory and practical FL applications. 2. The paper is generally well organized and clearly presented, with sufficient technical details and proactive supplementary analyses offered to enhance readability and applicability. 3. The idea of linking proximal inexactness to biased compression is somewhat interesting, and it
The major concern goes to the novely of analysis and significance of results, given that this work essentially represents a theoretical contribution to FL. 1. The convergence analysis is incrementally novel, mostly extending existing proof techniques without any particularly new ideas/tools developed. While the core results are intuitive and interesting, they are not expected to generate significant impact on FL research, both in thoery and practice. 2. The analysis relies on overly strong a
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical Methods and Algorithms · Digital Filter Design and Implementation
