TL;DR
This paper introduces ACCENT, a novel framework that generates tangible counterfactual explanations for neural recommender systems by identifying influential training points and iteratively deducing counterfactuals, enhancing interpretability.
Contribution
The paper presents the first general method for producing counterfactual explanations for neural recommenders, extending influence functions to pairs of items and applying it to popular models.
Findings
Successfully applied to NCF and RCF models
Demonstrated on MovieLens 100K dataset
Improves interpretability of neural recommenders
Abstract
Understanding why specific items are recommended to users can significantly increase their trust and satisfaction in the system. While neural recommenders have become the state-of-the-art in recent years, the complexity of deep models still makes the generation of tangible explanations for end users a challenging problem. Existing methods are usually based on attention distributions over a variety of features, which are still questionable regarding their suitability as explanations, and rather unwieldy to grasp for an end user. Counterfactual explanations based on a small set of the user's own actions have been shown to be an acceptable solution to the tangibility problem. However, current work on such counterfactuals cannot be readily applied to neural models. In this work, we propose ACCENT, the first general framework for finding counterfactual explanations for neural recommenders.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsCounterfactuals Explanations
