Collaboration of Large Language Models and Small Recommendation Models   for Device-Cloud Recommendation

Zheqi Lv; Tianyu Zhan; Wenjie Wang; Xinyu Lin; Shengyu Zhang; Wenqiao; Zhang; Jiwei Li; Kun Kuang; Fei Wu

arXiv:2501.05647·cs.IR·February 26, 2025

Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation

Zheqi Lv, Tianyu Zhan, Wenjie Wang, Xinyu Lin, Shengyu Zhang, Wenqiao, Zhang, Jiwei Li, Kun Kuang, Fei Wu

PDF

TL;DR

This paper proposes a collaborative device-cloud framework combining large language models and small recommendation models to improve real-time recommendation accuracy while reducing costs and resource usage.

Contribution

It introduces the LSC4Rec framework that synergistically integrates LLMs and SRMs with collaborative training and inference strategies for device-cloud recommendation systems.

Findings

01

Enhanced recommendation accuracy through collaborative strategies

02

Effective real-time user preference capture on devices

03

Validated improvements via extensive experiments

Abstract

Large Language Models (LLMs) for Recommendation (LLM4Rec) is a promising research direction that has demonstrated exceptional performance in this field. However, its inability to capture real-time user preferences greatly limits the practical application of LLM4Rec because (i) LLMs are costly to train and infer frequently, and (ii) LLMs struggle to access real-time data (its large number of parameters poses an obstacle to deployment on devices). Fortunately, small recommendation models (SRMs) can effectively supplement these shortcomings of LLM4Rec diagrams by consuming minimal resources for frequent training and inference, and by conveniently accessing real-time data on devices. In light of this, we designed the Device-Cloud LLM-SRM Collaborative Recommendation Framework (LSC4Rec) under a device-cloud collaboration setting. LSC4Rec aims to integrate the advantages of both LLMs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methodsstyle-based recalibration module