Explanation-based Counterfactual Retraining(XCR): A Calibration Method   for Black-box Models

Liu Zhendong; Wenyu Jiang; Yi Zhang; Chongjun Wang

arXiv:2206.11126·cs.LG·June 23, 2022

Explanation-based Counterfactual Retraining(XCR): A Calibration Method for Black-box Models

Liu Zhendong, Wenyu Jiang, Yi Zhang, Chongjun Wang

PDF

Open Access

TL;DR

This paper introduces XCR, a novel calibration method that uses explanations as counterfactual inputs to retrain black-box models, improving robustness and addressing out-of-distribution issues without altering model structure.

Contribution

XCR is a new explanation-based retraining approach that enhances black-box model calibration and robustness by leveraging explanation-generated counterfactuals.

Findings

01

XCR improves model performance using only 12.5% of crucial features.

02

XCR enhances robustness and calibration on corruption datasets.

03

XCR outperforms existing OOD calibration methods.

Abstract

With the rapid development of eXplainable Artificial Intelligence (XAI), a long line of past work has shown concerns about the Out-of-Distribution (OOD) problem in perturbation-based post-hoc XAI models and explanations are socially misaligned. We explore the limitations of post-hoc explanation methods that use approximators to mimic the behavior of black-box models. Then we propose eXplanation-based Counterfactual Retraining (XCR), which extracts feature importance fastly. XCR applies the explanations generated by the XAI model as counterfactual input to retrain the black-box model to address OOD and social misalignment problems. Evaluation of popular image datasets shows that XCR can improve model performance when only retaining 12.5% of the most crucial features without changing the black-box model structure. Furthermore, the evaluation of the benchmark of corruption datasets shows…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Credit Risk and Financial Regulations