Benchmarking Deep Learning Interpretability in Time Series Predictions

Aya Abdelsalam Ismail; Mohamed Gunady; H\'ector Corrada Bravo; and; Soheil Feizi

arXiv:2010.13924·cs.LG·October 28, 2020·87 cites

Benchmarking Deep Learning Interpretability in Time Series Predictions

Aya Abdelsalam Ismail, Mohamed Gunady, H\'ector Corrada Bravo, and, Soheil Feizi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper benchmarks various saliency-based interpretability methods across different neural architectures for time series prediction, revealing their limitations and proposing a two-step rescaling approach to improve feature importance detection.

Contribution

It provides a comprehensive comparison of interpretability methods for time series and introduces a novel two-step temporal saliency rescaling technique.

Findings

01

Saliency methods often fail to reliably identify feature importance in time series.

02

Failures are mainly due to conflation of time and feature domains.

03

Two-step temporal saliency rescaling significantly improves interpretability.

Abstract

Saliency methods are used extensively to highlight the importance of input features in model predictions. These methods are mostly used in vision and language tasks, and their applications to time series data is relatively unexplored. In this paper, we set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures, including Recurrent Neural Network, Temporal Convolutional Networks, and Transformers in a new benchmark of synthetic time series data. We propose and report multiple metrics to empirically evaluate the performance of saliency methods for detecting feature importance over time using both precision (i.e., whether identified features contain meaningful signals) and recall (i.e., the number of features with signal identified as important). Through several experiments, we show that (i) in general, network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ayaabdelsalam91/TS-Interpretability-Benchmark
pytorchOfficial

Videos

Benchmarking Deep Learning Interpretability in Time Series Predictions· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning

MethodsInterpretability