DIET: Learning to Distill Dataset Continually for Recommender Systems

Jiaqing Zhang; Hao Wang; Mingjia Yin; Bo Chen; Qinglin Jia; Rui Zhou; Ruiming Tang; ChaoYi Ma; Enhong Chen

arXiv:2603.24958·cs.IR·March 27, 2026

DIET: Learning to Distill Dataset Continually for Recommender Systems

Jiaqing Zhang, Hao Wang, Mingjia Yin, Bo Chen, Qinglin Jia, Rui Zhou, Ruiming Tang, ChaoYi Ma, Enhong Chen

PDF

Open Access

TL;DR

DIET introduces an evolving dataset distillation framework for recommender systems, significantly reducing training data size and computational costs while maintaining performance, thus enabling scalable continual learning.

Contribution

This paper proposes DIET, a novel method for streaming dataset distillation that maintains a dynamic, compact dataset aligned with long-term training dynamics in recommender systems.

Findings

01

Reduces training data to 1-2% of original size

02

Speeds up model iteration by up to 60 times

03

Distilled datasets generalize across different models

Abstract

Modern deep recommender models are trained under a continual learning paradigm, relying on massive and continuously growing streaming behavioral logs. In large-scale platforms, retraining models on full historical data for architecture comparison or iteration is prohibitively expensive, severely slowing down model development. This challenge calls for data-efficient approaches that can faithfully approximate full-data training behavior without repeatedly processing the entire evolving data stream. We formulate this problem as \emph{streaming dataset distillation for recommender systems} and propose \textbf{DIET}, a unified framework that maintains a compact distilled dataset which evolves alongside streaming data while preserving training-critical signals. Unlike existing dataset distillation methods that construct a static distilled set, DIET models distilled data as an evolving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks