SCaLRec: Semantic Calibration for LLM-enabled Cloud-Device Sequential Recommendation

Ruiqi Zheng; Jinli Cao; Jiao Yin; Hongzhi Yin

arXiv:2601.22543·cs.IR·February 2, 2026

SCaLRec: Semantic Calibration for LLM-enabled Cloud-Device Sequential Recommendation

Ruiqi Zheng, Jinli Cao, Jiao Yin, Hongzhi Yin

PDF

Open Access

TL;DR

This paper introduces SCaLRec, a method that calibrates cached cloud semantic embeddings on-device to address staleness and improve sequential recommendation quality without frequent cloud LLM inference.

Contribution

SCaLRec proposes a novel on-device semantic calibration technique to maintain high-quality recommendations despite cloud semantic staleness, reducing reliance on cloud LLMs.

Findings

01

SCaLRec improves recommendation accuracy over baselines.

02

It effectively detects when cached semantics are still reliable.

03

The method reduces the need for frequent cloud LLM calls.

Abstract

Cloud-device collaborative recommendation partitions computation across the cloud and user devices: the cloud provides semantic user modeling, while the device leverages recent interactions and cloud semantic signals for privacy-preserving, responsive reranking. With large language models (LLMs) on the cloud, semantic user representations can improve sequential recommendation by capturing high-level intent. However, regenerating such representations via cloud LLM inference for every request is often infeasible at real-world scale. As a result, on-device reranking commonly reuses a cached cloud semantic user embedding across requests. We empirically identify a cloud semantic staleness effect: reused embeddings become less aligned with the user's latest interactions, leading to measurable ranking degradation. Most existing LLM-enabled cloud-device recommenders are typically designed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · IoT and Edge/Fog Computing · Big Data and Digital Economy