Discretize-Optimize vs. Optimize-Discretize for Time-Series Regression and Continuous Normalizing Flows
Derek Onken, Lars Ruthotto

TL;DR
This paper compares discretize-optimize and optimize-discretize methods for neural ODEs in time-series regression and continuous normalizing flows, showing that discretize-optimize can significantly reduce training costs while maintaining performance.
Contribution
It extends the comparison of Disc-Opt and Opt-Disc methods to neural ODEs for time-series tasks, demonstrating the efficiency of Disc-Opt with proper numerical treatment.
Findings
Disc-Opt achieves similar accuracy as Opt-Disc with lower training costs.
Training time reduced by 39% to 97% in most cases.
In one case, Disc-Opt reduced training from nine days to less than one day.
Abstract
We compare the discretize-optimize (Disc-Opt) and optimize-discretize (Opt-Disc) approaches for time-series regression and continuous normalizing flows (CNFs) using neural ODEs. Neural ODEs are ordinary differential equations (ODEs) with neural network components. Training a neural ODE is an optimal control problem where the weights are the controls and the hidden features are the states. Every training iteration involves solving an ODE forward and another backward in time, which can require large amounts of computation, time, and memory. Comparing the Opt-Disc and Disc-Opt approaches in image classification tasks, Gholami et al. (2019) suggest that Disc-Opt is preferable due to the guaranteed accuracy of gradients. In this paper, we extend the comparison to neural ODEs for time-series regression and CNFs. Unlike in classification, meaningful models in these tasks must also satisfy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Gaussian Processes and Bayesian Inference · Anomaly Detection Techniques and Applications
MethodsNormalizing Flows
