Learning to Reason: Temporal Saliency Distillation for Interpretable Knowledge Transfer

Nilushika Udayangani Hewa Dehigahawattage; Kishor Nandakishor; Marimuthu Palaniswami

arXiv:2601.04263·cs.LG·January 12, 2026

Learning to Reason: Temporal Saliency Distillation for Interpretable Knowledge Transfer

Nilushika Udayangani Hewa Dehigahawattage, Kishor Nandakishor, Marimuthu Palaniswami

PDF

Open Access

TL;DR

This paper introduces Temporal Saliency Distillation, a method for transferring interpretable, time-aware knowledge from teacher to student models in time series analysis, enhancing interpretability and performance.

Contribution

It extends traditional knowledge distillation by incorporating temporal saliency to improve interpretability and effectiveness in time series models.

Findings

01

Improves student model performance over baseline methods.

02

Enhances interpretability by aligning input feature importance.

03

Requires no additional parameters or architecture changes.

Abstract

Knowledge distillation has proven effective for model compression by transferring knowledge from a larger network called the teacher to a smaller network called the student. Current knowledge distillation in time series is predominantly based on logit and feature aligning techniques originally developed for computer vision tasks. These methods do not explicitly account for temporal data and fall short in two key aspects. First, the mechanisms by which the transferred knowledge helps the student model learning process remain unclear due to uninterpretability of logits and features. Second, these methods transfer only limited knowledge, primarily replicating the teacher predictive accuracy. As a result, student models often produce predictive distributions that differ significantly from those of their teachers, hindering their safe substitution for teacher models. In this work, we propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning