Rethinking Large Language Model Architectures for Sequential   Recommendations

Hanbing Wang; Xiaorui Liu; Wenqi Fan; Xiangyu Zhao; Venkataramana; Kini; Devendra Yadav; Fei Wang; Zhen Wen; Jiliang Tang; Hui Liu

arXiv:2402.09543·cs.IR·February 16, 2024·2 cites

Rethinking Large Language Model Architectures for Sequential Recommendations

Hanbing Wang, Xiaorui Liu, Wenqi Fan, Xiangyu Zhao, Venkataramana, Kini, Devendra Yadav, Fei Wang, Zhen Wen, Jiliang Tang, Hui Liu

PDF

Open Access

TL;DR

This paper introduces Lite-LLM4Rec, a streamlined large language model architecture for sequential recommendation that significantly reduces inference costs while maintaining high performance, by eliminating beam search and employing a hierarchical structure.

Contribution

The paper proposes Lite-LLM4Rec, a novel LLM-based recommendation model that improves inference efficiency and effectiveness through a simple item projection head and hierarchical design.

Findings

01

Achieves 46.8% performance improvement on ML-1m dataset.

02

Reduces inference time by 97.28% compared to existing methods.

03

Maintains high recommendation quality with simplified architecture.

Abstract

Recently, sequential recommendation has been adapted to the LLM paradigm to enjoy the power of LLMs. LLM-based methods usually formulate recommendation information into natural language and the model is trained to predict the next item in an auto-regressive manner. Despite their notable success, the substantial computational overhead of inference poses a significant obstacle to their real-world applicability. In this work, we endeavor to streamline existing LLM-based recommendation models and propose a simple yet highly effective model Lite-LLM4Rec. The primary goal of Lite-LLM4Rec is to achieve efficient inference for the sequential recommendation task. Lite-LLM4Rec circumvents the beam search decoding by using a straight item projection head for ranking scores generation. This design stems from our empirical observation that beam search decoding is ultimately unnecessary for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Recommender Systems and Techniques · Natural Language Processing Techniques