Distillation Matters: Empowering Sequential Recommenders to Match the   Performance of Large Language Model

Yu Cui; Feng Liu; Pengbo Wang; Bohao Wang; Heng Tang; Yi Wan; Jun; Wang; and Jiawei Chen

arXiv:2405.00338·cs.IR·August 21, 2024

Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language Model

Yu Cui, Feng Liu, Pengbo Wang, Bohao Wang, Heng Tang, Yi Wan, Jun, Wang, and Jiawei Chen

PDF

1 Repo

TL;DR

This paper introduces DLLM2Rec, a novel knowledge distillation method that effectively transfers knowledge from large language models to lightweight sequential recommenders, significantly improving their performance and reducing inference latency.

Contribution

The paper proposes DLLM2Rec, a new distillation strategy addressing reliability, capacity gap, and semantic divergence challenges in transferring knowledge from LLMs to sequential recommenders.

Findings

01

Boosts sequential models by an average of 47.97% in performance.

02

Enables lightweight models to surpass some LLM-based recommenders.

03

Demonstrates effectiveness through extensive experiments.

Abstract

Owing to their powerful semantic reasoning capabilities, Large Language Models (LLMs) have been effectively utilized as recommenders, achieving impressive performance. However, the high inference latency of LLMs significantly restricts their practical deployment. To address this issue, this work investigates knowledge distillation from cumbersome LLM-based recommendation models to lightweight conventional sequential models. It encounters three challenges: 1) the teacher's knowledge may not always be reliable; 2) the capacity gap between the teacher and student makes it difficult for the student to assimilate the teacher's knowledge; 3) divergence in semantic space poses a challenge to distill the knowledge from embeddings. To tackle these challenges, this work proposes a novel distillation strategy, DLLM2Rec, specifically tailored for knowledge distillation from LLM-based recommendation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

istarryn/dllm2rec
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation