Faster Training of Neural ODEs Using Gau{\ss}-Legendre Quadrature

Alexander Norcliffe; Marc Peter Deisenroth

arXiv:2308.10644·cs.LG·August 22, 2023·1 cites

Faster Training of Neural ODEs Using Gau{\ss}-Legendre Quadrature

Alexander Norcliffe, Marc Peter Deisenroth

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach to accelerate neural ODE training by employing Gau{e2}ss-Legendre quadrature for faster integral computation, improving efficiency especially for large models, and extends this method to SDE training via the Wong-Zakai theorem.

Contribution

The paper proposes using Gau{e2}ss-Legendre quadrature to speed up neural ODE training and extends this technique to SDEs through the Wong-Zakai theorem, offering a more efficient training method.

Findings

01

Faster training of neural ODEs demonstrated with large models.

02

Memory-efficient integral computation using Gau{e2}ss-Legendre quadrature.

03

Extension of the method to SDE training via the Wong-Zakai theorem.

Abstract

Neural ODEs demonstrate strong performance in generative and time-series modelling. However, training them via the adjoint method is slow compared to discrete models due to the requirement of numerically solving ODEs. To speed neural ODEs up, a common approach is to regularise the solutions. However, this approach may affect the expressivity of the model; when the trajectory itself matters, this is particularly important. In this paper, we propose an alternative way to speed up the training of neural ODEs. The key idea is to speed up the adjoint method by using Gau{\ss}-Legendre quadrature to solve integrals faster than ODE-based methods while remaining memory efficient. We also extend the idea to training SDEs using the Wong-Zakai theorem, by training a corresponding ODE and transferring the parameters. Our approach leads to faster training of neural ODEs, especially for large models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

a-norcliffe/torch_gq_adjoint
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings