TL;DR
RoCourseNet is a novel training framework that jointly optimizes machine learning predictions and counterfactual recourses to be robust against future data distribution shifts, enhancing trustworthiness and validity of explanations.
Contribution
It introduces a tri-level optimization formulation and a virtual data shift algorithm to generate predictions and recourses resilient to distributional changes.
Findings
Achieves over 96% robust validity in experiments.
Outperforms state-of-the-art baselines by at least 10%.
Effectively generalizes to various post-hoc methods.
Abstract
Counterfactual (CF) explanations for machine learning (ML) models are preferred by end-users, as they explain the predictions of ML models by providing a recourse (or contrastive) case to individuals who are adversely impacted by predicted outcomes. Existing CF explanation methods generate recourses under the assumption that the underlying target ML model remains stationary over time. However, due to commonly occurring distributional shifts in training data, ML models constantly get updated in practice, which might render previously generated recourses invalid and diminish end-users trust in our algorithmic framework. To address this problem, we propose RoCourseNet, a training framework that jointly optimizes predictions and recourses that are robust to future data shifts. This work contains four key contributions: (1) We formulate the robust recourse generation problem as a tri-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
