TL;DR
This paper introduces TABCF, a transformer-based VAE approach for generating unbiased counterfactual explanations in tabular data, improving interpretability of black-box models in real-world applications.
Contribution
We propose a novel transformer-based VAE with a Gumbel-Softmax detokenizer for unbiased counterfactual generation in tabular data, addressing data complexity and feature interdependencies.
Findings
TABCF outperforms existing methods in generating effective counterfactuals.
It does not exhibit bias toward specific feature types.
Extensive evaluation on financial datasets confirms its effectiveness.
Abstract
In the field of Explainable AI (XAI), counterfactual (CF) explanations are one prominent method to interpret a black-box model by suggesting changes to the input that would alter a prediction. In real-world applications, the input is predominantly in tabular form and comprised of mixed data types and complex feature interdependencies. These unique data characteristics are difficult to model, and we empirically show that they lead to bias towards specific feature types when generating CFs. To overcome this issue, we introduce TABCF, a CF explanation method that leverages a transformer-based Variational Autoencoder (VAE) tailored for modeling tabular data. Our approach uses transformers to learn a continuous latent space and a novel Gumbel-Softmax detokenizer that enables precise categorical reconstruction while preserving end-to-end differentiability. Extensive quantitative evaluation on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsALIGN
