PAX-TS: Model-agnostic multi-granular explanations for time series forecasting via localized perturbations
Tim Kreuzer, Jelena Zdravkovic, Panagiotis Papapetrou

TL;DR
PAX-TS is a model-agnostic explanation method for time series forecasting that uses localized perturbations to generate multi-granular explanations and characterize cross-channel correlations, applicable across diverse datasets and models.
Contribution
It introduces PAX-TS, a novel post-hoc explanation algorithm tailored for time series forecasting, capable of multi-granular and multivariate explanations, outperforming existing methods.
Findings
Explanations differ between high- and low-performing models.
Identified 6 recurring pattern classes linked to performance.
Demonstrated effectiveness on 7 algorithms and 10 datasets.
Abstract
Time series forecasting has seen considerable improvement during the last years, with transformer models and large language models driving advancements of the state of the art. Modern forecasting models are generally opaque and do not provide explanations for their forecasts, while well-known post-hoc explainability methods like LIME are not suitable for the forecasting context. We propose PAX-TS, a model-agnostic post-hoc algorithm to explain time series forecasting models and their forecasts. Our method is based on localized input perturbations and results in multi-granular explanations. Further, it is able to characterize cross-channel correlations for multivariate time series forecasts. We clearly outline the algorithmic procedure behind PAX-TS, demonstrate it on a benchmark with 7 algorithms and 10 diverse datasets, compare it with two other state-of-the-art explanation algorithms,…
Peer Reviews
Decision·ICLR 2026 Conference Withdrawn Submission
- The paper addresses a practically important problem in time series forecasting. - PAX-TS is model-agnostic. The authors tested PAX-TS on various time series forecasting backbones. - Experiments are extensively conducted using multiple datasets.
- Some existing approaches for time series interpretation are not discussed or compared: - Prototype-based time series (or sequence) interpretability methods - Interpretable and steerable sequence learning via prototypes (KDD 2019) - Interpreting Convolutional Sequence Model by Learning Local Prototypes with Adaptation Regularization (CIKM 2021) - Protgnn: Towards self-explaining graph neural networks (AAAI 2022) - LLM-based interpretable methods - Explainable Multi-modal Tim
The patterns discovered in the experiments are intriguing and offer valuable insights. It would be interesting to explore how generalizable these patterns are and whether they could inform the design of inherently interpretable time-series forecasting models.
The provided code link is inaccessible, preventing reproducibility. The formalization section is difficult to follow—Section 3.1 describes localized perturbations without pseudocode or formulas, leaving room for ambiguity. It is also unclear whether Sections 3.2 and 3.3 present original contributions or restate existing concepts, and the inclusion of a running example would greatly improve clarity. Moreover, the experimental objectives are not clearly defined, and the presentation of results is
- The explanation methods seem to be simple enough and explainable itself. The methods are more based on traditional statistical properties of time series.
- Notations and methods are not thoroughly included. Methods are not carefully described. - As I am not an expert of time series forecasting, I am not confident about the following statement - I think the methods included for comparison are not SOTA. Comparing with other time series explainability papers (which are not limiting to forecasting), for example FIT, Dynamask, WinIT, TimeX, etc., the other methods chosen are ShapTime and TS-MULE. TS-MULE was from 4 years ago and ShapTime is not availa
1. The idea of using localized Gaussian perturbations to derive fine-grained importance maps is interesting. 2. The paper is clear-written.
1. The paper states that PAX-TS provides multi-granular explanations, but this concept is not clearly defined. It remains unclear whether the granularity refers to temporal resolution, statistical properties, or interpretive abstraction levels. 2. In experiment part, while the paper positions PAX-TS as superior to existing explainers, the experimental evidence is mostly qualitative. Providing quantitative evaluation would strengthen model credibility. 3. The introduction of width 𝑤, softness 𝑠
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Forecasting Techniques and Applications · Stock Market Forecasting Methods
