TimeSHAP: Explaining Recurrent Models through Sequence Perturbations

Jo\~ao Bento; Pedro Saleiro; Andr\'e F. Cruz; M\'ario A.T. Figueiredo,; Pedro Bizarro

arXiv:2012.00073·cs.LG·June 29, 2021

TimeSHAP: Explaining Recurrent Models through Sequence Perturbations

Jo\~ao Bento, Pedro Saleiro, Andr\'e F. Cruz, M\'ario A.T. Figueiredo,, Pedro Bizarro

PDF

1 Repo

TL;DR

TimeSHAP is a novel, model-agnostic explanation method for recurrent neural networks that provides detailed attributions at multiple levels and includes a pruning technique to improve efficiency and interpretability.

Contribution

The paper introduces TimeSHAP, extending KernelSHAP for sequential data, with a pruning method to reduce computational cost and attribution variance.

Findings

01

Sequences can be pruned to 10% of original length without losing key attribution information.

02

Most recent events contribute significantly to model predictions, averaging 41%.

03

High attribution to client age revealed potential bias in fraud detection model.

Abstract

Although recurrent neural networks (RNNs) are state-of-the-art in numerous sequential decision-making tasks, there has been little research on explaining their predictions. In this work, we present TimeSHAP, a model-agnostic recurrent explainer that builds upon KernelSHAP and extends it to the sequential domain. TimeSHAP computes feature-, timestep-, and cell-level attributions. As sequences may be arbitrarily long, we further propose a pruning method that is shown to dramatically decrease both its computational cost and the variance of its attributions. We use TimeSHAP to explain the predictions of a real-world bank account takeover fraud detection RNN model, and draw key insights from its explanations: i) the model identifies important features and events aligned with what fraud analysts consider cues for account takeover; ii) positive predicted sequences can be pruned to only 10% of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

feedzai/timeshap
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning