TL;DR
This paper compares different algorithms for generating counterfactual explanations in high-dimensional behavioral and textual data, highlighting LIME-C as a promising, efficient alternative to existing methods like SEDC.
Contribution
It empirically benchmarks LIME-C and SHAP-C against SEDC, demonstrating their efficiency and effectiveness in generating counterfactual explanations for complex data.
Findings
LIME-C and SHAP-C are faster but less efficient than SEDC overall.
SHAP-C struggles with highly unbalanced data.
LIME-C is a promising alternative with good overall performance.
Abstract
We study the interpretability of predictive systems that use high-dimensonal behavioral and textual data. Examples include predicting product interest based on online browsing data and detecting spam emails or objectionable web content. Recently, counterfactual explanations have been proposed for generating insight into model predictions, which focus on what is relevant to a particular instance. Conducting a complete search to compute counterfactuals is very time-consuming because of the huge dimensionality. To our knowledge, for behavioral and text data, only one model-agnostic heuristic algorithm (SEDC) for finding counterfactual explanations has been proposed in the literature. However, there may be better algorithms for finding counterfactuals quickly. This study aligns the recently proposed Linear Interpretable Model-agnostic Explainer (LIME) and Shapley Additive Explanations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsCounterfactuals Explanations · Interpretability
