Joint Treatment Effect Estimation from Incomplete Healthcare Data: Temporal Causal Normalizing Flows with LLM-driven Evolutionary MNAR Imputation

Olivia Jullian Parra; Sara Zoccheddu; David Catalan Cerezo; Tom Forzy; Franziska Ulrich; William Sutcliffe; Jakob Martin Burgstaller; Oliver Senn; Patrick Owen; Nicola Serra

arXiv:2605.05125·cs.LG·May 7, 2026

Joint Treatment Effect Estimation from Incomplete Healthcare Data: Temporal Causal Normalizing Flows with LLM-driven Evolutionary MNAR Imputation

Olivia Jullian Parra, Sara Zoccheddu, David Catalan Cerezo, Tom Forzy, Franziska Ulrich, William Sutcliffe, Jakob Martin Burgstaller, Oliver Senn, Patrick Owen, Nicola Serra

PDF

TL;DR

This paper introduces a novel pipeline combining causal normalizing flows with LLM-driven imputation to accurately estimate treatment effects from incomplete, time-varying EHR data, addressing missingness and confounding.

Contribution

It proposes a two-stage approach with DAG-constrained normalizing flows for causal inference and an LLM-based imputer for handling MNAR missing data, improving robustness and accuracy.

Findings

01

CausalFlow-T achieves exact invertible counterfactual inference, reducing approximation errors.

02

The LLM-driven imputer outperforms baselines in MNAR scenarios, enhancing data completeness.

03

Applied to real-world EHRs, the pipeline's treatment effect estimates align with randomized trial results.

Abstract

Target trial emulation (TTE) enables causal questions to be studied with observational data when randomized controlled trials (RCTs) are infeasible. Yet treatment-effect methods often address causal estimation, missingness, and temporal structure separately, limiting their robustness in electronic health records (EHRs), where time-varying confounding and missing-not-at-random (MNAR) biomarkers can reach 50%--80%. We propose a two-stage pipeline for treatment effect estimation from incomplete longitudinal EHRs. First, CausalFlow-T, a directed acyclic graph (DAG)-constrained normalizing flow with long short-term memory (LSTM)-encoded patient history, performs exact invertible counterfactual inference, avoiding approximation errors from variational inference and separating confounding through explicit causal structure. Ablations on four synthetic and one semi-synthetic benchmark with known…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.