Generalizability vs. Counterfactual Explainability Trade-Off
Fabiano Veglianti, Flavio Giorgi, Fabrizio Silvestri, Gabriele Tolomei

TL;DR
This paper explores the inherent trade-off between a model's ability to generalize and its counterfactual explainability, introducing a new measure called $ ext{ extepsilon}$-VCP to quantify this relationship.
Contribution
It introduces $ ext{ extepsilon}$-VCP, a theoretical framework linking model overfitting to increased counterfactual explainability, supported by empirical validation.
Findings
$ ext{ extepsilon}$-VCP increases with overfitting
Poor generalization correlates with easier counterfactual generation
$ ext{ extepsilon}$-VCP can serve as a proxy for overfitting
Abstract
In this work, we investigate the relationship between model generalization and counterfactual explainability in supervised learning. We introduce the notion of -valid counterfactual probability (-VCP) -- the probability of finding perturbations of a data point within its -neighborhood that result in a label change. We provide a theoretical analysis of -VCP in relation to the geometry of the model's decision boundary, showing that -VCP tends to increase with model overfitting. Our findings establish a rigorous connection between poor generalization and the ease of counterfactual generation, revealing an inherent trade-off between generalization and counterfactual explainability. Empirical results validate our theory, suggesting -VCP as a practical proxy for quantitatively characterizing overfitting.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Generative Adversarial Networks and Image Synthesis
