MemKD: Memory-Discrepancy Knowledge Distillation for Efficient Time Series Classification

Nilushika Udayangani; Kishor Nandakishor; and Marimuthu Palaniswami

arXiv:2601.04264·cs.LG·January 12, 2026

MemKD: Memory-Discrepancy Knowledge Distillation for Efficient Time Series Classification

Nilushika Udayangani, Kishor Nandakishor, and Marimuthu Palaniswami

PDF

Open Access

TL;DR

MemKD introduces a novel knowledge distillation method that captures memory discrepancies in time series models, enabling the creation of compact yet high-performing models suitable for resource-constrained environments.

Contribution

We propose MemKD, a knowledge distillation framework specifically designed for time series models that accounts for memory retention discrepancies, improving efficiency and performance.

Findings

01

Reduces model size and memory usage by ~500 times.

02

Maintains comparable performance to large teacher models.

03

Outperforms existing KD methods on time series tasks.

Abstract

Deep learning models, particularly recurrent neural networks and their variants, such as long short-term memory, have significantly advanced time series data analysis. These models capture complex, sequential patterns in time series, enabling real-time assessments. However, their high computational complexity and large model sizes pose challenges for deployment in resource-constrained environments, such as wearable devices and edge computing platforms. Knowledge Distillation (KD) offers a solution by transferring knowledge from a large, complex model (teacher) to a smaller, more efficient model (student), thereby retaining high performance while reducing computational demands. Current KD methods, originally designed for computer vision tasks, neglect the unique temporal dependencies and memory retention characteristics of time series models. To this end, we propose a novel KD framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Machine Learning in Healthcare · EEG and Brain-Computer Interfaces