CausalWrap: Model-Agnostic Causal Constraint Wrappers for Tabular Synthetic Data

Amir Asiaee; Zhuohui J. Liang; Chao Yan

arXiv:2603.02015·cs.LG·March 3, 2026

CausalWrap: Model-Agnostic Causal Constraint Wrappers for Tabular Synthetic Data

Amir Asiaee, Zhuohui J. Liang, Chao Yan

PDF

Open Access

TL;DR

CausalWrap is a model-agnostic method that enhances synthetic tabular data with causal structure fidelity, improving causal inference accuracy without sacrificing overall data utility.

Contribution

It introduces a post-hoc correction wrapper that injects partial causal knowledge into any pretrained generative model, ensuring better causal fidelity in synthetic data.

Findings

01

Reduces average treatment effect error by up to 63% on benchmarks.

02

Improves ATE agreement from 0.00 to 0.38 on ICU data.

03

Maintains high conventional utility of synthetic data.

Abstract

Tabular synthetic data generators are typically trained to match observational distributions, which can yield high conventional utility (e.g., column correlations, predictive accuracy) yet poor preservation of structural relations relevant to causal analysis and out-of-distribution (OOD) reasoning. When the downstream use of synthetic data involves causal reasoning -- estimating treatment effects, evaluating policies, or testing mediation pathways -- merely matching the observational distribution is insufficient: structural fidelity and treatment-mechanism preservation become essential. We propose CausalWrap (CW), a model-agnostic wrapper that injects partial causal knowledge (PCK) -- trusted edges, forbidden edges, and qualitative/monotonic constraints -- into any pretrained base generator (GAN, VAE, or diffusion model), without requiring access to its internals. CW learns a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Advanced Causal Inference Techniques · Machine Learning in Healthcare