Optimal Time Complexities of Parallel Stochastic Optimization Methods   Under a Fixed Computation Model

Alexander Tyurin; Peter Richt\'arik

arXiv:2305.12387·math.OC·November 28, 2023·NeurIPS·2 cites

Optimal Time Complexities of Parallel Stochastic Optimization Methods Under a Fixed Computation Model

Alexander Tyurin, Peter Richt\'arik

PDF

Open Access 1 Video

TL;DR

This paper establishes the fundamental limits and optimal algorithms for parallel stochastic optimization under a fixed computation time model, extending classical sequential theory to parallel settings.

Contribution

It introduces a new protocol generalizing the oracle framework and derives minimax complexities for parallel methods with stochastic gradients, providing optimal algorithms.

Findings

01

Derived minimax complexity bounds for parallel stochastic optimization.

02

Developed algorithms that attain these bounds, proving their optimality.

03

Revealed implications for asynchronous optimization methods.

Abstract

Parallelization is a popular strategy for improving the performance of iterative algorithms. Optimization methods are no exception: design of efficient parallel optimization methods and tight analysis of their theoretical properties are important research endeavors. While the minimax complexities are well known for sequential optimization methods, the theory of parallel optimization methods is less explored. In this paper, we propose a new protocol that generalizes the classical oracle framework approach. Using this protocol, we establish minimax complexities for parallel optimization methods that have access to an unbiased stochastic gradient oracle with bounded variance. We consider a fixed computation model characterized by each worker requiring a fixed but worker-dependent time to calculate stochastic gradient. We prove lower bounds and develop optimal algorithms that attain them.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Optimal Time Complexities of Parallel Stochastic Optimization Methods Under a Fixed Computation Model· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Error Correcting Code Techniques