Posterior-Calibrated Causal Circuits in Variational Autoencoders: Why Image-Domain Interpretability Fails on Tabular Data
Dip Roy, Rajiv Misra, Sanjay Kumar Singh, Anisha Roy

TL;DR
This paper investigates the generalization of causal circuitry in variational autoencoders from image to tabular data, introduces new techniques, and evaluates their effectiveness across multiple architectures and datasets.
Contribution
It extends a causal intervention framework to tabular data, introduces three novel techniques, and provides empirical insights into the differences between image and tabular VAEs.
Findings
Tabular VAEs have about 50% lower circuit modularity than image VAEs.
β-VAE's causal effect strength collapses on tabular data due to reconstruction issues.
CES captures most architecture differences and high-specificity interventions predict better downstream performance.
Abstract
Although mechanism-based interpretability has generated an abundance of insight for discriminative network analysis, generative models are less understood -- particularly outside of image-related applications. We investigate how much of the causal circuitry found within image-related variational autoencoders (VAEs) will generalize to tabular data, as VAEs are increasingly used for imputation, anomaly detection, and synthetic data generation. In addition to extending a four-level causal intervention framework to four tabular and one image benchmark across five different VAE architectures (with 75 individual training runs per architecture and three random seed values for each run), this paper introduces three new techniques: posterior-calibration of Causal Effect Strength (CES), path-specific activation patching, and Feature-Group Disentanglement (FGD). The results from our experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
