Multi-Grained Patch Training for Efficient LLM-based Recommendation
Jiayi Liao, Ruobing Xie, Sihang Li, Xiang Wang, Xingwu Sun, Zhanhui Kang, Xiangnan He

TL;DR
This paper introduces PatchRec, a two-stage training method that enhances LLMs to model long-term user interaction histories efficiently for recommendation tasks by using aggregated embeddings and time-aware fine-tuning.
Contribution
It proposes a novel multi-grained patch training approach that enables LLMs to better understand long-term user behavior in recommendation systems.
Findings
Improved ability to model longer user histories.
Enhanced efficiency in training and inference.
Better recommendation performance on benchmark datasets.
Abstract
Large Language Models (LLMs) have emerged as a new paradigm for recommendation by converting interacted item history into language modeling. However, constrained by the limited context length of LLMs, existing approaches have to truncate item history in the prompt, focusing only on recent interactions and sacrificing the ability to model long-term history. To enable LLMs to model long histories, we pursue a concise embedding representation for items and sessions. In the LLM embedding space, we construct an item's embedding by aggregating its textual token embeddings; similarly, we construct a session's embedding by aggregating its item embeddings. While efficient, this way poses two challenges since it ignores the temporal significance of user interactions and LLMs do not natively interpret our custom embeddings. To overcome these, we propose PatchRec, a multi-grained patch training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Image Retrieval and Classification Techniques · Data Management and Algorithms
MethodsActivation Patching
