Preconditioned accelerated gradient descent methods for locally Lipschitz smooth objectives with applications to the solution of nonlinear PDEs
Jea-Hyun Park, Abner J. Salgado, Steven M. Wise

TL;DR
This paper establishes a theoretical foundation for preconditioned accelerated gradient descent (PAGD) in solving nonlinear PDEs, demonstrating exponential convergence and improved efficiency over traditional methods through rigorous analysis and numerical experiments.
Contribution
It introduces a new theoretical framework for PAGD applied to PDEs, including invariant set existence, convergence rates, and a connection to a second-order ODE, with practical numerical validation.
Findings
PAGD achieves exponential convergence for locally Lipschitz smooth objectives.
The method's convergence rate is mesh size-independent and faster than PGD.
Numerical experiments confirm the theoretical acceleration and stability.
Abstract
We develop a theoretical foundation for the application of Nesterov's accelerated gradient descent method (AGD) to the approximation of solutions of a wide class of partial differential equations (PDEs). This is achieved by proving the existence of an invariant set and exponential convergence rates when its preconditioned version (PAGD) is applied to minimize locally Lipschitz smooth, strongly convex objective functionals. We introduce a second-order ordinary differential equation (ODE) with a preconditioner built-in and show that PAGD is an explicit time-discretization of this ODE, which requires a natural time step restriction for energy stability. At the continuous time level, we show an exponential convergence of the ODE solution to its steady state using a simple energy argument. At the discrete level, assuming the aforementioned step size restriction, the existence of an invariant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
