Explanation-based Counterfactual Retraining(XCR): A Calibration Method for Black-box Models
Liu Zhendong, Wenyu Jiang, Yi Zhang, Chongjun Wang

TL;DR
This paper introduces XCR, a novel calibration method that uses explanations as counterfactual inputs to retrain black-box models, improving robustness and addressing out-of-distribution issues without altering model structure.
Contribution
XCR is a new explanation-based retraining approach that enhances black-box model calibration and robustness by leveraging explanation-generated counterfactuals.
Findings
XCR improves model performance using only 12.5% of crucial features.
XCR enhances robustness and calibration on corruption datasets.
XCR outperforms existing OOD calibration methods.
Abstract
With the rapid development of eXplainable Artificial Intelligence (XAI), a long line of past work has shown concerns about the Out-of-Distribution (OOD) problem in perturbation-based post-hoc XAI models and explanations are socially misaligned. We explore the limitations of post-hoc explanation methods that use approximators to mimic the behavior of black-box models. Then we propose eXplanation-based Counterfactual Retraining (XCR), which extracts feature importance fastly. XCR applies the explanations generated by the XAI model as counterfactual input to retrain the black-box model to address OOD and social misalignment problems. Evaluation of popular image datasets shows that XCR can improve model performance when only retaining 12.5% of the most crucial features without changing the black-box model structure. Furthermore, the evaluation of the benchmark of corruption datasets shows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Credit Risk and Financial Regulations
