Unifying Prediction and Explanation in Time-Series Transformers via Shapley-based Pretraining
Qisen Cheng, Jinming Xing, Chang Xue, Xiaoran Yang

TL;DR
ShapTST introduces a pre-training framework for time-series transformers that unifies prediction and explanation, significantly reducing explanation computation costs while improving robustness and maintaining competitive accuracy.
Contribution
The paper presents a novel Shapley-based pre-training method that integrates explanation generation into training, eliminating expensive inference-time computations for explanations.
Findings
Achieved competitive classification and regression results on eight datasets.
Provided Shapley-value explanations comparable to post-hoc methods.
Enhanced model robustness against data noise.
Abstract
In this paper, we propose ShapTST, a framework that enables time-series transformers to efficiently generate Shapley-value-based explanations alongside predictions in a single forward pass. Shapley values are widely used to evaluate the contribution of different time-steps and features in a test sample, and are commonly generated through repeatedly inferring on each sample with different parts of information removed. Therefore, it requires expensive inference-time computations that occur at every request for model explanations. In contrast, our framework unifies the explanation and prediction in training through a novel Shapley-based pre-training design, which eliminates the undesirable test-time computation and replaces it with a single-time pre-training. Moreover, this specialized pre-training benefits the prediction performance by making the transformer model more effectively weigh…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Neural Networks and Applications · Anomaly Detection Techniques and Applications
