CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations
Hangzhi Guo, Thanh Hong Nguyen, Amulya Yadav

TL;DR
CounterNet is an innovative end-to-end framework that simultaneously trains predictive models and generates counterfactual explanations, improving explanation quality and runtime efficiency compared to traditional post-hoc methods.
Contribution
This paper introduces CounterNet, the first end-to-end learning framework that integrates model training with counterfactual explanation generation, addressing limitations of post-hoc approaches.
Findings
Achieves 100% counterfactual validity across datasets.
Runs 3 times faster than state-of-the-art baselines.
Maintains high prediction accuracy and low explanation proximity.
Abstract
This work presents CounterNet, a novel end-to-end learning framework which integrates Machine Learning (ML) model training and the generation of corresponding counterfactual (CF) explanations into a single end-to-end pipeline. Counterfactual explanations offer a contrastive case, i.e., they attempt to find the smallest modification to the feature values of an instance that changes the prediction of the ML model on that instance to a predefined output. Prior techniques for generating CF explanations suffer from two major limitations: (i) all of them are post-hoc methods designed for use with proprietary ML models -- as a result, their procedure for generating CF explanations is uninformed by the training of the ML model, which leads to misalignment between model predictions and explanations; and (ii) most of them rely on solving separate time-intensive optimization problems to find CF…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
