Symplectic Adjoint Method for Exact Gradient of Neural ODE with Minimal   Memory

Takashi Matsubara; Yuto Miyatake; Takaharu Yaguchi

arXiv:2102.09750·cs.LG·October 20, 2021·5 cites

Symplectic Adjoint Method for Exact Gradient of Neural ODE with Minimal Memory

Takashi Matsubara, Yuto Miyatake, Takaharu Yaguchi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces the symplectic adjoint method for neural ODEs, achieving exact gradients with minimal memory and improved computational efficiency compared to traditional methods.

Contribution

It proposes a novel symplectic adjoint method that provides exact gradients with low memory usage and better robustness, advancing neural ODE training techniques.

Findings

01

Consumes less memory than backpropagation and checkpointing

02

Faster than traditional adjoint method

03

More robust to rounding errors

Abstract

A neural network model of a differential equation, namely neural ODE, has enabled the learning of continuous-time dynamical systems and probabilistic distributions with high accuracy. The neural ODE uses the same network repeatedly during a numerical integration. The memory consumption of the backpropagation algorithm is proportional to the number of uses times the network size. This is true even if a checkpointing scheme divides the computation graph into sub-graphs. Otherwise, the adjoint method obtains a gradient by a numerical integration backward in time. Although this method consumes memory only for a single network use, it requires high computational cost to suppress numerical errors. This study proposes the symplectic adjoint method, which is an adjoint method solved by a symplectic integrator. The symplectic adjoint method obtains the exact gradient (up to rounding error) with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tksmatsubara/symplectic-adjoint-method
pytorch

Videos

Symplectic Adjoint Method for Exact Gradient of Neural ODE with Minimal Memory· slideslive

Taxonomy

TopicsModel Reduction and Neural Networks · Fluid Dynamics and Turbulent Flows · Neural Networks and Applications