TL;DR
TimeSHAP is a novel, model-agnostic explanation method for recurrent neural networks that provides detailed attributions at multiple levels and includes a pruning technique to improve efficiency and interpretability.
Contribution
The paper introduces TimeSHAP, extending KernelSHAP for sequential data, with a pruning method to reduce computational cost and attribution variance.
Findings
Sequences can be pruned to 10% of original length without losing key attribution information.
Most recent events contribute significantly to model predictions, averaging 41%.
High attribution to client age revealed potential bias in fraud detection model.
Abstract
Although recurrent neural networks (RNNs) are state-of-the-art in numerous sequential decision-making tasks, there has been little research on explaining their predictions. In this work, we present TimeSHAP, a model-agnostic recurrent explainer that builds upon KernelSHAP and extends it to the sequential domain. TimeSHAP computes feature-, timestep-, and cell-level attributions. As sequences may be arbitrarily long, we further propose a pruning method that is shown to dramatically decrease both its computational cost and the variance of its attributions. We use TimeSHAP to explain the predictions of a real-world bank account takeover fraud detection RNN model, and draw key insights from its explanations: i) the model identifies important features and events aligned with what fraud analysts consider cues for account takeover; ii) positive predicted sequences can be pruned to only 10% of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
