Exposing Vulnerabilities in Explanation for Time Series Classifiers via Dual-Target Attacks

Bohan Wang; Zewen Liu; Lu Lin; Hui Liu; Li Xiong; Ming Jin; Wei Jin

arXiv:2602.02763·cs.LG·February 10, 2026

Exposing Vulnerabilities in Explanation for Time Series Classifiers via Dual-Target Attacks

Bohan Wang, Zewen Liu, Lu Lin, Hui Liu, Li Xiong, Ming Jin, Wei Jin

PDF

Open Access

TL;DR

This paper demonstrates that explanations for time series classifiers can be manipulated independently of predictions, revealing that explanation stability is not a reliable indicator of model robustness and introducing a dual-target attack method.

Contribution

The authors propose TSEF, a novel dual-target attack that jointly manipulates classifier predictions and explanations, exposing vulnerabilities in explanation-based robustness assessments.

Findings

01

TSEF can successfully alter predictions while maintaining consistent explanations.

02

Explanation stability does not reliably indicate model robustness.

03

The attack method is effective across multiple datasets and explanation methods.

Abstract

Interpretable time series deep learning systems are often assessed by checking temporal consistency on explanations, implicitly treating this as evidence of robustness. We show that this assumption can fail: Predictions and explanations can be adversarially decoupled, enabling targeted misclassification while the explanation remains plausible and consistent with a chosen reference rationale. We propose TSEF (Time Series Explanation Fooler), a dual-target attack that jointly manipulates the classifier and explainer outputs. In contrast to single-objective misclassification attacks that disrupt explanation and spread attribution mass broadly, TSEF achieves targeted prediction changes while keeping explanations consistent with the reference. Across multiple datasets and explainer backbones, our results consistently reveal that explanation stability is a misleading proxy for decision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Graph Neural Networks