DoFlow: Flow-based Generative Models for Interventional and Counterfactual Forecasting on Time Series

Dongze Wu; Feng Qiu; Yao Xie

arXiv:2511.02137·stat.ML·March 3, 2026

DoFlow: Flow-based Generative Models for Interventional and Counterfactual Forecasting on Time Series

Dongze Wu, Feng Qiu, Yao Xie

PDF

Open Access 3 Reviews

TL;DR

DoFlow is a flow-based generative model that provides accurate observational, interventional, and counterfactual predictions for multivariate time series, while also enabling anomaly detection and causal reasoning.

Contribution

It introduces DoFlow, a novel flow-based model over causal DAGs that unifies causal inference with generative time series forecasting, including counterfactuals.

Findings

01

Achieves accurate observational forecasting on synthetic and real data.

02

Enables causal forecasting for interventional and counterfactual queries.

03

Effectively detects anomalies using explicit likelihoods.

Abstract

Time-series forecasting increasingly demands not only accurate observational predictions but also causal forecasting under interventional and counterfactual queries in multivariate systems. We present DoFlow, a flow-based generative model defined over a causal Directed Acyclic Graph (DAG) that delivers coherent observational and interventional predictions, as well as counterfactuals through the natural encoding-decoding mechanism of continuous normalizing flows (CNFs). We also provide a supporting counterfactual recovery theory under certain assumptions. Beyond forecasting, DoFlow provides explicit likelihoods of future trajectories, enabling principled anomaly detection. Experiments on synthetic datasets with various causal DAG structures and real-world hydropower and cancer-treatment time series show that DoFlow achieves accurate system-wide observational forecasting, enables causal…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 3

Strengths

* Encoder–decoder CNF over a causal DAG neatly instantiates **abduction–action–prediction**, so observational, interventional, and counterfactual forecasts all come from one coherent mechanism. * **Impressive empirical results**: strong observational/interventional accuracy and compelling real-world demos (incl. early anomaly/outage detection), plus robustness across diverse synthetic DAGs. * **Theory that matches the use case**: clear assumptions lead to bijectivity-in-noise and a **counterfact

Weaknesses

* **Positioning vs prior causal–flow work could be clearer.** The paper does a good job situating the CNF side (Neural ODEs, flow matching, encode–decode) but is lighter on contrasting with prior *causal* modeling using flows (e.g., works like *Causal Normalizing Flows: From Theory to Practice*, Javaloy et al.). A short subsection disentangling what comes from causal literature, what from flows, and what’s **novel here** would help readers map contributions more precisely. * **Prediction inter

Reviewer 02Rating 2Confidence 5

Strengths

1. This paper introduces an unexplored research direction, as claimed by the authors, which could be of potential value. 2. The technical pipeline presented in this work appears to be complete. 3. Producing the probability density function (PDF) instead of a single counterfactual forecast is an insightful approach.

Weaknesses

1. Several key concepts are not clearly introduced at the beginning. For example, in the Introduction section, it is unclear what is meant by “interventional question.” How are “binary or discrete, fixed-time actions, treatment focus …” defined? What does “extrapolating a counterfactual path and contrasting it” mean? Most importantly, the key concept of “counterfactual” should be clearly illustrated from the beginning. What does “these works do not encode causal DAGs or simulate system-wide traj

Reviewer 03Rating 6Confidence 2

Strengths

- Preliminaries about NeuralODE and Flow matching are clear. Additionally, the method section: 'Time conditioned flow on a causal DAG' is very well written and the method is easy to understand. The method is based on state of the art deep learning techniques, and all the proposed strategy is sound and practical. I find the model useful for future reseach. However, I would like to see a complete scheme of the implementation of the framework: how many NeuralODEs are needed, how the RNN is traine

Weaknesses

- Neither the problem or the solution proposed are well specified in the introduction. There are many unconected paragraphs that do not provide a connected story line about the motivation, the **limitations of existing work** or the strategy that this paper proposes. For example, `line 96`says: ''motivated by these gaps...'', which gaps? Also, more information about the interventions that the model is suposed to handle has to be provided. What does '...counterfactual queries across the entire sy

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Time Series Analysis and Forecasting · Generative Adversarial Networks and Image Synthesis