Active Large Language Model-based Knowledge Distillation for Session-based Recommendation
Yingpeng Du, Zhu Sun, Ziyan Wang, Haoyan Chua, Jie Zhang, Yew-Soon Ong

TL;DR
This paper introduces an active knowledge distillation approach for session-based recommendation using large language models, focusing on selecting the most informative instances to improve efficiency and effectiveness.
Contribution
It proposes a novel active learning strategy that optimally selects instances for knowledge distillation from LLMs, reducing cost and enhancing recommendation accuracy.
Findings
Significantly outperforms state-of-the-art methods on real-world datasets.
Efficiently distills knowledge from LLMs with limited computational resources.
Effectively avoids extracting ineffective instances during distillation.
Abstract
Large language models (LLMs) provide a promising way for accurate session-based recommendation (SBR), but they demand substantial computational time and memory. Knowledge distillation (KD)-based methods can alleviate these issues by transferring the knowledge to a small student, which trains a student based on the predictions of a cumbersome teacher. However, these methods encounter difficulties for \textit{LLM-based KD in SBR}. 1) It is expensive to make LLMs predict for all instances in KD. 2) LLMs may make ineffective predictions for some instances in KD, e.g., incorrect predictions for hard instances or similar predictions as existing recommenders for easy instances. In this paper, we propose an active LLM-based KD method in SBR, contributing to sustainable AI. To efficiently distill knowledge from LLMs with limited cost, we propose to extract a small proportion of instances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling
MethodsKnowledge Distillation
