Sparsity in long-time control of neural ODEs

Carlos Esteve-Yag\"ue; Borjan Geshkovski

arXiv:2102.13566·cs.LG·September 9, 2022

Sparsity in long-time control of neural ODEs

Carlos Esteve-Yag\"ue, Borjan Geshkovski

PDF

Open Access 1 Repo

TL;DR

This paper studies neural ODEs with penalties, proving optimal controls vanish after a certain time, leading to ordered sparsity in neural network parameters and establishing a turnpike property for long-time control.

Contribution

It introduces a novel analysis of -penalized neural ODE control, showing sparsity patterns and stability estimates without small data assumptions.

Findings

01

Optimal control vanishes beyond a positive stopping time.

02

Parameters exhibit ordered sparsity beyond a certain layer.

03

Empirical risk stability follows a polynomial estimate over time.

Abstract

We consider the neural ODE and optimal control perspective of supervised learning, with $ℓ^{1}$ -control penalties, where rather than only minimizing a final cost (the \emph{empirical risk}) for the state, we integrate this cost over the entire time horizon. We prove that any optimal control (for this cost) vanishes beyond some positive stopping time. When seen in the discrete-time context, this result entails an \emph{ordered} sparsity pattern for the parameters of the associated residual neural network: ordered in the sense that these parameters are all $0$ beyond a certain layer. Furthermore, we provide a polynomial stability estimate for the empirical risk with respect to the time horizon. This can be seen as a \emph{turnpike property}, for nonsmooth dynamics and functionals with $ℓ^{1}$ -penalties, and without any smallness assumptions on the data, both of which are new in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

borjanG/dynamical.systems
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Advanced Numerical Methods in Computational Mathematics

Methods*Communicated@Fast*How Do I Communicate to Expedia?