Parallel-in-iteration optimization using multigrid reduction-in-time
G. H. M. Ara\'ujo, O. A. Krzysik, H. De Sterck

TL;DR
This paper introduces a parallel-in-iteration optimization framework using multigrid reduction-in-time (MGRIT) to accelerate convergence of gradient-based methods, demonstrated on convex and nonsmooth problems with promising parallel speedups.
Contribution
The work adapts parallel-in-time algorithms, specifically MGRIT, to optimize iterative algorithms, enabling parallelization across iterations for faster convergence in large-scale problems.
Findings
MGRIT achieves fast convergence similar to PDE diffusion problems.
Parallel speedup predictions align with theoretical models.
Applicable to both smooth and nonsmooth optimization problems.
Abstract
Standard gradient-based iteration algorithms for optimization, such as gradient descent and its various proximal-based extensions to nonsmooth problems, are known to converge slowly for ill-conditioned problems, sometimes requiring many tens of thousands of iterations in practice. Since these iterations are computed sequentially, they may present a computational bottleneck in large-scale parallel simulations. In this work, we present a "parallel-in-iteration" framework that allows one to parallelize across these iterations using multiple processors with the objective of reducing the wall-clock time needed to solve the underlying optimization problem. Our methodology is based on re-purposing parallel time integration algorithms for time-dependent differential equations, motivated by the fact that optimization algorithms often have interpretations as discretizations of time-dependent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Numerical methods for differential equations · Advanced Optimization Algorithms Research
