DoFlow: Flow-based Generative Models for Interventional and Counterfactual Forecasting on Time Series
Dongze Wu, Feng Qiu, Yao Xie

TL;DR
DoFlow is a flow-based generative model that provides accurate observational, interventional, and counterfactual predictions for multivariate time series, while also enabling anomaly detection and causal reasoning.
Contribution
It introduces DoFlow, a novel flow-based model over causal DAGs that unifies causal inference with generative time series forecasting, including counterfactuals.
Findings
Achieves accurate observational forecasting on synthetic and real data.
Enables causal forecasting for interventional and counterfactual queries.
Effectively detects anomalies using explicit likelihoods.
Abstract
Time-series forecasting increasingly demands not only accurate observational predictions but also causal forecasting under interventional and counterfactual queries in multivariate systems. We present DoFlow, a flow-based generative model defined over a causal Directed Acyclic Graph (DAG) that delivers coherent observational and interventional predictions, as well as counterfactuals through the natural encoding-decoding mechanism of continuous normalizing flows (CNFs). We also provide a supporting counterfactual recovery theory under certain assumptions. Beyond forecasting, DoFlow provides explicit likelihoods of future trajectories, enabling principled anomaly detection. Experiments on synthetic datasets with various causal DAG structures and real-world hydropower and cancer-treatment time series show that DoFlow achieves accurate system-wide observational forecasting, enables causal…
Peer Reviews
Decision·ICLR 2026 Poster
* Encoder–decoder CNF over a causal DAG neatly instantiates **abduction–action–prediction**, so observational, interventional, and counterfactual forecasts all come from one coherent mechanism. * **Impressive empirical results**: strong observational/interventional accuracy and compelling real-world demos (incl. early anomaly/outage detection), plus robustness across diverse synthetic DAGs. * **Theory that matches the use case**: clear assumptions lead to bijectivity-in-noise and a **counterfact
* **Positioning vs prior causal–flow work could be clearer.** The paper does a good job situating the CNF side (Neural ODEs, flow matching, encode–decode) but is lighter on contrasting with prior *causal* modeling using flows (e.g., works like *Causal Normalizing Flows: From Theory to Practice*, Javaloy et al.). A short subsection disentangling what comes from causal literature, what from flows, and what’s **novel here** would help readers map contributions more precisely. * **Prediction inter
1. This paper introduces an unexplored research direction, as claimed by the authors, which could be of potential value. 2. The technical pipeline presented in this work appears to be complete. 3. Producing the probability density function (PDF) instead of a single counterfactual forecast is an insightful approach.
1. Several key concepts are not clearly introduced at the beginning. For example, in the Introduction section, it is unclear what is meant by “interventional question.” How are “binary or discrete, fixed-time actions, treatment focus …” defined? What does “extrapolating a counterfactual path and contrasting it” mean? Most importantly, the key concept of “counterfactual” should be clearly illustrated from the beginning. What does “these works do not encode causal DAGs or simulate system-wide traj
- Preliminaries about NeuralODE and Flow matching are clear. Additionally, the method section: 'Time conditioned flow on a causal DAG' is very well written and the method is easy to understand. The method is based on state of the art deep learning techniques, and all the proposed strategy is sound and practical. I find the model useful for future reseach. However, I would like to see a complete scheme of the implementation of the framework: how many NeuralODEs are needed, how the RNN is traine
- Neither the problem or the solution proposed are well specified in the introduction. There are many unconected paragraphs that do not provide a connected story line about the motivation, the **limitations of existing work** or the strategy that this paper proposes. For example, `line 96`says: ''motivated by these gaps...'', which gaps? Also, more information about the interventions that the model is suposed to handle has to be provided. What does '...counterfactual queries across the entire sy
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Time Series Analysis and Forecasting · Generative Adversarial Networks and Image Synthesis
