Interpretable Model-Aware Counterfactual Explanations for Random Forest
Joshua S. Harvey, Guanchao Feng, Sai Anusha Meesala, Tina Zhao, Dhagash Mehta

TL;DR
This paper introduces a novel method for generating interpretable counterfactual explanations for random forest models by leveraging the model's learned representations, resulting in more actionable and sparse explanations than traditional approaches.
Contribution
The paper proposes a new approach to counterfactual explanation generation that uses similarity learning within the random forest's own structure, improving interpretability and usefulness.
Findings
Produces sparser explanations than Shapley values.
Generates more actionable counterfactuals.
Effective on datasets like MNIST and German credit.
Abstract
Despite their enormous predictive power, machine learning models are often unsuitable for applications in regulated industries such as finance, due to their limited capacity to provide explanations. While model-agnostic frameworks such as Shapley values have proved to be convenient and popular, they rarely align with the kinds of causal explanations that are typically sought after. Counterfactual case-based explanations, where an individual is informed of which circumstances would need to be different to cause a change in outcome, may be more intuitive and actionable. However, finding appropriate counterfactual cases is an open challenge, as is interpreting which features are most critical for the change in outcome. Here, we pose the question of counterfactual search and interpretation in terms of similarity learning, exploiting the representation learned by the random forest predictive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Financial Distress and Bankruptcy Prediction
