Designing User-Centric Metrics for Evaluation of Counterfactual Explanations
Firdaus Ahmed Choudhury, Ethan Leicht, Jude Ethan Bislig, Hangzhi Guo, Amulya Yadav

TL;DR
This paper investigates the gap between artificial evaluation metrics and actual user preferences for counterfactual explanations, proposing a new user-centric model that better predicts user choices.
Contribution
It introduces the AWP model, a novel two-stage, user-centric evaluation framework for counterfactual explanations, validated through extensive user studies.
Findings
User-preferred CFEs matched proximity-based metrics only 63.81% of the time.
The AWP model predicts user preferences with 84.37% accuracy.
Existing metrics often do not align with real-world user evaluations.
Abstract
Counterfactual Explanations (CFEs) have grown in popularity as a means of offering actionable guidance by identifying the minimum changes in feature values required to flip an ML model's prediction to something more desirable. Unfortunately, most prior research on CFEs relies on artificial evaluation metrics, such as proximity, which may overlook end-user preferences and constraints, e.g., the user's perception of effort needed to make certain feature changes may differ from that of the model designer. To address this research gap, this paper makes three novel contributions. First, we conduct a pilot study with 20 crowd-workers on Amazon MTurk to experimentally validate the alignment of existing CF evaluation metrics with real-world user preferences. Results show that user-preferred CFEs matched those based on proximity in only 63.81% of cases, highlighting the limited applicability of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
