ODEFormer: Symbolic Regression of Dynamical Systems with Transformers
St\'ephane d'Ascoli, S\"oren Becker, Alexander Mathis, Philippe, Schwaller, Niki Kilbertus

TL;DR
ODEFormer is a transformer-based model capable of inferring symbolic multidimensional ODE systems from single solution trajectories, outperforming existing methods in accuracy, robustness, and speed.
Contribution
This work introduces ODEFormer, the first transformer model for symbolic regression of multidimensional ODEs from limited observational data.
Findings
Outperforms existing methods in accuracy and robustness
Handles noisy and irregularly sampled data effectively
Provides faster inference than previous approaches
Abstract
We introduce ODEFormer, the first transformer able to infer multidimensional ordinary differential equation (ODE) systems in symbolic form from the observation of a single solution trajectory. We perform extensive evaluations on two datasets: (i) the existing "Strogatz" dataset featuring two-dimensional systems; (ii) ODEBench, a collection of one- to four-dimensional systems that we carefully curated from the literature to provide a more holistic benchmark. ODEFormer consistently outperforms existing methods while displaying substantially improved robustness to noisy and irregularly sampled observations, as well as faster inference. We release our code, model and benchmark dataset publicly.
Peer Reviews
Decision·ICLR 2024 spotlight
The paper is extremely well-written (including notations, clearly stating contributions, etc.), very well motivated and the proposed method is shown to achieve state of the art performance. The placement within existing literature is very well articulated. Since, authors propose a benchmark, it is appreciated that the data generation procedure is outlined precisely. The section on filtering of the data to avoid rapidly converging and divergent systems is worth mentioning, details like these m
The authors have mentioned a few of their limitations, which is great. While authors mention the presence of a very related work "Becker, Sören, et al. "Predicting Ordinary Differential Equations with Transformers." (2023).", I am not sure why they do not elucidate the difference between the paper mentioned and their proposed method, why is this method not used as a baseline? There is no attempt to illustrate the difference at all. I have another point which I would like to bring up with the
The introduction of ODEFormer represents a novel framework that serves the purpose of generating ordinary differential equations (ODEs) specifically designed for testing dynamical systems. This innovative approach allows researchers and practitioners to create ODE models that can accurately capture and represent the dynamics of real-world systems, providing a valuable tool for testing and understanding the behavior of complex dynamical systems. ODEFormer introduces a pioneering use case for tra
One notable limitation in the presented work is the absence of comparisons with benchmarks employed in previous studies, such as the widely recognized benchmark datasets used in Neural ODE (Chen et al). This absence makes it challenging to gauge how the proposed ODEFormer framework performs in comparison to existing approaches on well-established and widely accepted testing scenarios. The demonstrated applicability of ODEFormer on toy datasets represents another potential limitation. Toy datase
(1) The paper is well motivated, as discovering the symbolic form of governing laws from observed data has always been the focus of scientific research. (2) The paper contributes to neural network models for symbolic regression of ordinary differential equations.
(1) The idea in this paper is not novel enough. The idea of transformer-based symbolic regression on tokenized sequences was previously reported[1]. The proposed model shares the same network structure but is applied to ordinary differential equation data. However, no analysis could be found in the paper on how the network is adapted to this new type of data. (2) Figures and explanations of the proposed model are way too rough and lack the necessary details. (3) This paper lacks crucial detail
Code & Models
Videos
Taxonomy
TopicsModel Reduction and Neural Networks · Evolutionary Algorithms and Applications · Gaussian Processes and Bayesian Inference
