Proximal Implicit ODE Solvers for Accelerating Learning Neural ODEs
Justin Baker, Hedi Xia, Yiwei Wang, Elena Cherkaev, Akil, Narayan, Long Chen, Jack Xin, Andrea L. Bertozzi, Stanley J., Osher, Bao Wang

TL;DR
This paper introduces proximal implicit ODE solvers for neural ODEs, which improve stability and efficiency over explicit solvers, enabling faster training on complex tasks like graph neural networks and normalizing flows.
Contribution
It proposes a novel proximal implicit ODE solver framework that leverages inner-outer iterations, offering better stability and efficiency for learning neural ODEs.
Findings
Proximal implicit solvers outperform explicit solvers in stability.
They require fewer computational resources for the same accuracy.
Validated on graph neural networks and normalizing flows.
Abstract
Learning neural ODEs often requires solving very stiff ODE systems, primarily using explicit adaptive step size ODE solvers. These solvers are computationally expensive, requiring the use of tiny step sizes for numerical stability and accuracy guarantees. This paper considers learning neural ODEs using implicit ODE solvers of different orders leveraging proximal operators. The proximal implicit solver consists of inner-outer iterations: the inner iterations approximate each implicit update step using a fast optimization algorithm, and the outer iterations solve the ODE system over time. The proximal implicit ODE solver guarantees superiority over explicit solvers in numerical stability and computational efficiency. We validate the advantages of proximal implicit solvers over existing popular neural ODE solvers on various challenging benchmark tasks, including learning continuous-depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks
