Neural Context Flows for Meta-Learning of Dynamical Systems
Roussel Desmond Nzoyem, David A.W. Barton, Tom Deakin

TL;DR
Neural Context Flow (NCF) is a meta-learning framework that enhances Neural ODEs' ability to adapt to unseen dynamic behaviors, especially with unobserved parameters, by using Taylor expansion-based contextual self-modulation and uncertainty estimation.
Contribution
This paper introduces NCF, a novel meta-learning approach with theoretical guarantees that improves NODEs' out-of-distribution performance and interpretability in dynamical systems.
Findings
NCF achieves state-of-the-art OOD performance on 5 out of 6 benchmarks.
The framework effectively models unobserved parameter variations.
Extensive experiments reveal the interpretability and flexibility of NCF.
Abstract
Neural Ordinary Differential Equations (NODEs) often struggle to adapt to new dynamic behaviors caused by parameter changes in the underlying physical system, even when these dynamics are similar to previously observed behaviors. This problem becomes more challenging when the changing parameters are unobserved, meaning their value or influence cannot be directly measured when collecting data. To address this issue, we introduce Neural Context Flow (NCF), a robust and interpretable Meta-Learning framework that includes uncertainty estimation. NCF uses Taylor expansion to enable contextual self-modulation, allowing context vectors to influence dynamics from other domains while also modulating themselves. After establishing theoretical guarantees, we empirically test NCF and compare it to related adaptation methods. Our results show that NCF achieves state-of-the-art Out-of-Distribution…
Peer Reviews
Decision·ICLR 2025 Poster
- Writing is very clear and the methodology is well explained. This allows readers to understand the differences between this method and previous ones. - Interesting use of context vectors through the 3-network model. Ablation studies in supplementary material show the need for such an architecture. - The combination via context through the Taylor expansion seems to be an interesting and novel application, which I can see being used in other fields outside of ODE and PDE simulations. - The esti
Manuscript makes reference to sample efficiency of using such adaptive models for new context. However the manuscript does not include any experiments to support such a statement.
The targeted problem is important. Building neural-ode like solvers able to generalize to changes in the PDE coefficients is important, often referred to solving parametric PDEs. The method seems novel for learning dynamical systems with changes in pde coefficients. The use of taylor expansion is intuitive and natural. I particularly liked the intuition given line 202-205. Existing context-based methods do not try to leverage information from each context vectors, each describing the environmen
Regarding the writing style of the paper: - I think there is room for improvement. In the introduction, I think the problem of solving parametric PDEs / learning dynamical systems with varying PDE coefficients should be stated more clearly and explains what is an environment in your specific setting. The introduction should 1) clearly defines the problem of building generalizable neural PDE solvers, 2) what are the different directions taken to do so [1, 2, 3] and 3) explain how your work fits i
The paper introduces a method that seems reasonable, with several interesting analysis of the behavior of the learned vector field. The related works section does present the most popular baselines and methods for this problem. The datasets used to evaluate the method are in par with what is currently used in the litterature.
I have several major concerns on this paper that I will try to rank from higher to lower importance. 1) **Overfit**: I am very confused by the OoD adaption protocol used in the paper. From what I understood, Algorithm 2 is used on the validation set to tune the value of $\xi$ using gradient descent. Then, the prediction error is computed **on the same trajectories as the one used to pick $\xi$**. Given the size of $\xi$, it is most certainly overfitting to the (small) set of trajectories used to
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks
