An Ode to an ODE

Krzysztof Choromanski; Jared Quincy Davis; Valerii Likhosherstov,; Xingyou Song; Jean-Jacques Slotine; Jacob Varley; Honglak Lee; Adrian Weller,; Vikas Sindhwani

arXiv:2006.11421·cs.LG·June 24, 2020

An Ode to an ODE

Krzysztof Choromanski, Jared Quincy Davis, Valerii Likhosherstov,, Xingyou Song, Jean-Jacques Slotine, Jacob Varley, Honglak Lee, Adrian Weller,, Vikas Sindhwani

PDF

Open Access

TL;DR

This paper introduces ODEtoODE, a novel Neural ODE paradigm with time-dependent parameters evolving on a matrix manifold, improving training stability and solving gradient issues in deep networks.

Contribution

The paper proposes a new Neural ODE framework with parameters constrained on a manifold, providing stability, convergence guarantees, and improved performance over existing methods.

Findings

01

Enhanced training stability and convergence in deep neural networks.

02

Superior performance in reinforcement learning and supervised tasks.

03

Provable solutions to gradient vanishing and explosion problems.

Abstract

We present a new paradigm for Neural ODE algorithms, called ODEtoODE, where time-dependent parameters of the main flow evolve according to a matrix flow on the orthogonal group O(d). This nested system of two flows, where the parameter-flow is constrained to lie on the compact manifold, provides stability and effectiveness of training and provably solves the gradient vanishing-explosion problem which is intrinsically related to training deep neural network architectures such as Neural ODEs. Consequently, it leads to better downstream models, as we show on the example of training reinforcement learning policies with evolution strategies, and in the supervised learning setting, by comparing with previous SOTA baselines. We provide strong convergence results for our proposed mechanism that are independent of the depth of the network, supporting our empirical studies. Our results show an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques