LASAR: Latent Adaptive Semantic Aligned Reasoning for Generative Recommendation
Yiwen Chen, Fuwei Zhang, Zehao Chen, Deqing Wang, Hehan Li, Peizhi Xu, Hanmeng Liu, Shuanglong Li, Xin Pei, Fuzhen Zhuang, Zhao Zhang

TL;DR
LASAR introduces an efficient latent reasoning framework for generative recommendation, significantly reducing inference latency and improving recommendation quality by adaptively controlling reasoning steps.
Contribution
It proposes a novel SFT-then-RL training paradigm with semantic alignment and adaptive reasoning depth, addressing key challenges in latent reasoning for recommendation systems.
Findings
LASAR nearly halves the average latent reasoning steps.
It is roughly 20 times faster than explicit CoT text generation.
LASAR outperforms all baseline methods on three real-world datasets.
Abstract
Large Language Models (LLMs) have demonstrated powerful reasoning capabilities through Chain-of-Thought (CoT) in various tasks, yet the inefficiency of token-by-token generation hinders real-world deployment in latency-sensitive recommender systems. Latent reasoning has emerged as an effective paradigm in LLMs, performing multi-step inference in a continuous hidden-state space to achieve stronger reasoning at lower cost. However, this paradigm remains underexplored in mainstream generative recommendation. Adapting it reveals three unique challenges: (1) the gap between prior-less Semantic ID (SID) symbols and continuous latent reasoning - SIDs lack pre-trained semantics, hindering joint optimization; (2) representation drift due to a lack of reasoning chain supervision; and (3) the suboptimality of applying a globally fixed reasoning depth. To address these, we propose LASAR (Latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
