Think before Recommendation: Autonomous Reasoning-enhanced Recommender

Xiaoyu Kong; Junguang Jiang; Bin Liu; Ziru Xu; Han Zhu; Jian Xu; Bo Zheng; Jiancan Wu; Xiang Wang

arXiv:2510.23077·cs.IR·October 28, 2025

Think before Recommendation: Autonomous Reasoning-enhanced Recommender

Xiaoyu Kong, Junguang Jiang, Bin Liu, Ziru Xu, Han Zhu, Jian Xu, Bo Zheng, Jiancan Wu, Xiang Wang

PDF

TL;DR

This paper introduces RecZero, an RL-based approach that trains a single LLM to develop autonomous reasoning capabilities for recommendation tasks, outperforming existing methods by avoiding distillation limitations.

Contribution

Proposes RecZero, a novel RL-based framework for training LLMs to autonomously reason for recommendations, eliminating the need for multi-model distillation.

Findings

01

RecZero outperforms baseline methods on benchmark datasets.

02

RecOne, a hybrid model, further improves performance with supervised fine-tuning.

03

Autonomous reasoning enhances recommendation accuracy.

Abstract

The core task of recommender systems is to learn user preferences from historical user-item interactions. With the rapid development of large language models (LLMs), recent research has explored leveraging the reasoning capabilities of LLMs to enhance rating prediction tasks. However, existing distillation-based methods suffer from limitations such as the teacher model's insufficient recommendation capability, costly and static supervision, and superficial transfer of reasoning ability. To address these issues, this paper proposes RecZero, a reinforcement learning (RL)-based recommendation paradigm that abandons the traditional multi-model and multi-stage distillation approach. Instead, RecZero trains a single LLM through pure RL to autonomously develop reasoning capabilities for rating prediction. RecZero consists of two key components: (1) "Think-before-Recommendation" prompt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.