Feature Responsiveness Scores: Model-Agnostic Explanations for Recourse
Harry Cheon, Anneke Wernerfelt, Sorelle A. Friedler, Berk Ustun

TL;DR
This paper introduces feature responsiveness scores as a model-agnostic way to improve explanations for high-stakes decisions, ensuring they genuinely support recourse by focusing on actionable features.
Contribution
It proposes a novel responsiveness metric and efficient computation methods, addressing limitations of existing feature importance explanations like SHAP and LIME.
Findings
Standard explanations can highlight unresponsive features, undermining recourse.
Responsiveness scores identify features that decision subjects can realistically change.
Applying these scores improves the effectiveness of explanations in lending scenarios.
Abstract
Consumer protection rules require companies that deploy models to automate decisions in high-stakes settings to explain predictions to decision subjects. These rules are motivated, in part, by the belief that explanations can promote recourse by revealing information that decision subjects can use to contest or overturn their predictions. In practice, companies provide individuals with a list of principal reasons based on feature importance derived from methods like SHAP and LIME. In this work, we show how common practices can fail to provide recourse and propose to highlight features based on their responsiveness -- the probability that a decision subject can attain a target prediction through an arbitrary intervention on the feature. We develop efficient methods to compute responsiveness scores for any model and actionability constraints. We show that standard practices in lending can…
Peer Reviews
Decision·ICLR 2025 Poster
1. The paper effectively highlights a critical shortcoming in the use of feature attribution methods for generating explanations. By demonstrating that standard methods often provide non-actionable reasons, the authors reveal a key limitation that could undermine the intended goals of explainability and consumer protection in high-stakes applications. 2. The introduction of responsiveness scores offers a novel/nuanced approach to measuring "actionability." These scores help flag instances where
(1) While I find the concept of feature responsiveness quite interesting, the contributions of this paper appear marginal, particularly in light of the work [1]. This paper draws upon existing ideas introduced in [1], such as the notions of reachable sets and action sets, which are used for recourse verification there (i.e., determining if an individual can achieve recourse through actions in the feature space). In [1], the approach returns a binary output: 1 if there exists an action that achie
I clearly agree with the starting point of the paper, aiming to focus on another view of interpretability, where for a number of applications, only features that are actionable are relevant, while one would be interested in seeing how different outcomes may be if values for these features would change (even slightly). It is crucial to go away from thinking that Shapley values (and other similar approaches) are the go-to approach to bring interpretability to ML. Here, the approach is described in
In my view the only weakness of the paper is that it only concentrates on a given application area, while it could have been interesting to consider the ideas and concepts in the paper in a more general framework.
1) I believe this paper aims to address a very important problem with current feature attribution methods, that is, the features these methods identify as important are rarely those that can be modified/changed so that a different prediction may occur. 2) I believe the idea of action and reachable sets is an interesting one and I appreciate the authors trying to make these notions precise via theory. 3. The paper performs numerous experiments to demonstrate their result
1) I don't particularly think this work is very novel. There has been numerous works that propose counterfactual feature attribution methods, i.e. those that identify important features as the ones that, when changed, lead to a different prediction. Are these not pretty much responsiveness scores in the language of this paper. I advise the authors to take a look at these papers and address how their proposed notion of responsiveness is different than the notions proposes in the following works
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations · ALIGN
