VIGOR+: Iterative Confounder Generation and Validation via LLM-CEVAE Feedback Loop
JiaWei Zhu, ZiHeng Liu

TL;DR
VIGOR+ introduces an iterative framework combining LLM-generated confounders with CEVAE validation, improving the plausibility and statistical utility of hidden confounders in causal inference from observational data.
Contribution
The paper presents VIGOR+, a novel iterative feedback loop that refines confounder generation using LLMs guided by CEVAE validation signals, formalizing and proving convergence.
Findings
Effective iterative refinement of confounders demonstrated.
Convergence properties established under mild assumptions.
Framework improves confounder plausibility and utility.
Abstract
Hidden confounding remains a fundamental challenge in causal inference from observational data. Recent advances leverage Large Language Models (LLMs) to generate plausible hidden confounders based on domain knowledge, yet a critical gap exists: LLM-generated confounders often exhibit semantic plausibility without statistical utility. We propose VIGOR+ (Variational Information Gain for iterative cOnfounder Refinement), a novel framework that closes the loop between LLM-based confounder generation and CEVAE-based statistical validation. Unlike prior approaches that treat generation and validation as separate stages, VIGOR+ establishes an iterative feedback mechanism: validation signals from CEVAE (including information gain, latent consistency metrics, and diagnostic messages) are transformed into natural language feedback that guides subsequent LLM generation rounds. This iterative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Advanced Causal Inference Techniques
