TL;DR
This paper introduces DLLM2Rec, a novel knowledge distillation method that effectively transfers knowledge from large language models to lightweight sequential recommenders, significantly improving their performance and reducing inference latency.
Contribution
The paper proposes DLLM2Rec, a new distillation strategy addressing reliability, capacity gap, and semantic divergence challenges in transferring knowledge from LLMs to sequential recommenders.
Findings
Boosts sequential models by an average of 47.97% in performance.
Enables lightweight models to surpass some LLM-based recommenders.
Demonstrates effectiveness through extensive experiments.
Abstract
Owing to their powerful semantic reasoning capabilities, Large Language Models (LLMs) have been effectively utilized as recommenders, achieving impressive performance. However, the high inference latency of LLMs significantly restricts their practical deployment. To address this issue, this work investigates knowledge distillation from cumbersome LLM-based recommendation models to lightweight conventional sequential models. It encounters three challenges: 1) the teacher's knowledge may not always be reliable; 2) the capacity gap between the teacher and student makes it difficult for the student to assimilate the teacher's knowledge; 3) divergence in semantic space poses a challenge to distill the knowledge from embeddings. To tackle these challenges, this work proposes a novel distillation strategy, DLLM2Rec, specifically tailored for knowledge distillation from LLM-based recommendation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
