Continual Collaborative Distillation for Recommender System
Gyuseok Lee, SeongKu Kang, Wonbin Kweon, Hwanjo Yu

TL;DR
This paper introduces a continual collaborative distillation framework for recommender systems that enables both teacher and student models to adapt over data streams, improving efficiency and accuracy in dynamic environments.
Contribution
It presents a novel continual collaborative distillation method allowing dynamic adaptation of teacher and student models in non-stationary data streams for recommender systems.
Findings
CCD effectively adapts to new data in real-world datasets.
The framework maintains high recommendation accuracy.
Experimental results show improved efficiency and adaptability.
Abstract
Knowledge distillation (KD) has emerged as a promising technique for addressing the computational challenges associated with deploying large-scale recommender systems. KD transfers the knowledge of a massive teacher system to a compact student model, to reduce the huge computational burdens for inference while retaining high accuracy. The existing KD studies primarily focus on one-time distillation in static environments, leaving a substantial gap in their applicability to real-world scenarios dealing with continuously incoming users, items, and their interactions. In this work, we delve into a systematic approach to operating the teacher-student KD in a non-stationary data stream. Our goal is to enable efficient deployment through a compact student, which preserves the high performance of the massive teacher, while effectively adapting to continuously incoming data. We propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProcess Optimization and Integration
MethodsFocus · Collaborative Distillation
