CausalWrap: Model-Agnostic Causal Constraint Wrappers for Tabular Synthetic Data
Amir Asiaee, Zhuohui J. Liang, Chao Yan

TL;DR
CausalWrap is a model-agnostic method that enhances synthetic tabular data with causal structure fidelity, improving causal inference accuracy without sacrificing overall data utility.
Contribution
It introduces a post-hoc correction wrapper that injects partial causal knowledge into any pretrained generative model, ensuring better causal fidelity in synthetic data.
Findings
Reduces average treatment effect error by up to 63% on benchmarks.
Improves ATE agreement from 0.00 to 0.38 on ICU data.
Maintains high conventional utility of synthetic data.
Abstract
Tabular synthetic data generators are typically trained to match observational distributions, which can yield high conventional utility (e.g., column correlations, predictive accuracy) yet poor preservation of structural relations relevant to causal analysis and out-of-distribution (OOD) reasoning. When the downstream use of synthetic data involves causal reasoning -- estimating treatment effects, evaluating policies, or testing mediation pathways -- merely matching the observational distribution is insufficient: structural fidelity and treatment-mechanism preservation become essential. We propose CausalWrap (CW), a model-agnostic wrapper that injects partial causal knowledge (PCK) -- trusted edges, forbidden edges, and qualitative/monotonic constraints -- into any pretrained base generator (GAN, VAE, or diffusion model), without requiring access to its internals. CW learns a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Advanced Causal Inference Techniques · Machine Learning in Healthcare
