STARec: An Efficient Agent Framework for Recommender Systems via Autonomous Deliberate Reasoning

Chenghao Wu; Ruiyang Ren; Junjie Zhang; Ruirui Wang; Zhongrui Ma; Qi Ye; Wayne Xin Zhao

arXiv:2508.18812·cs.AI·August 27, 2025

STARec: An Efficient Agent Framework for Recommender Systems via Autonomous Deliberate Reasoning

Chenghao Wu, Ruiyang Ren, Junjie Zhang, Ruirui Wang, Zhongrui Ma, Qi Ye, Wayne Xin Zhao

PDF

TL;DR

STARec introduces a deliberative agent framework for recommender systems that enhances reasoning and decision-making, leading to improved performance with minimal training data.

Contribution

It presents a novel slow-thinking agent framework with anchored reinforcement training, enabling autonomous reasoning in recommender systems.

Findings

01

Significant performance improvements on MovieLens 1M and Amazon CDs datasets.

02

Achieves these gains using only 0.4% of the full training data.

03

Demonstrates the effectiveness of autonomous deliberative reasoning in recommendation accuracy.

Abstract

While modern recommender systems are instrumental in navigating information abundance, they remain fundamentally limited by static user modeling and reactive decision-making paradigms. Current large language model (LLM)-based agents inherit these shortcomings through their overreliance on heuristic pattern matching, yielding recommendations prone to shallow correlation bias, limited causal inference, and brittleness in sparse-data scenarios. We introduce STARec, a slow-thinking augmented agent framework that endows recommender systems with autonomous deliberative reasoning capabilities. Each user is modeled as an agent with parallel cognitions: fast response for immediate interactions and slow reasoning that performs chain-of-thought rationales. To cultivate intrinsic slow thinking, we develop anchored reinforcement training - a two-stage paradigm combining structured knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.