TL;DR
This paper introduces T-COL, a novel method for generating counterfactual explanations in machine learning that adapts to general user preferences and variable models, improving robustness and personalization.
Contribution
The paper proposes T-COL, a tree-based approach that generates adaptable counterfactual explanations aligned with user preferences and robust to model changes.
Findings
T-COL outperforms baselines in adapting to user preferences.
T-COL maintains robustness even when ML models are replaced.
Experiments validate the effectiveness of T-COL in real-world scenarios.
Abstract
To address the interpretability challenge in machine learning (ML) systems, counterfactual explanations (CEs) have emerged as a promising solution. CEs are unique as they provide workable suggestions to users, instead of explaining why a certain outcome was predicted. The application of CEs encounters two main challenges: general user preferences and variable ML systems. On one hand, user preferences for specific values can vary depending on the task and scenario. On the other hand, the ML systems for verification may change while the CEs are performed. Thus, user preferences tend to be general rather than specific, and CEs need to be adaptable to variable ML models while maintaining robustness even as these models change. Facing these challenges, we propose general user preferences based on insights from psychology and behavioral science, and add the challenge of non-static ML systems…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
