Reasoning Meets Personalization: Unleashing the Potential of Large Reasoning Model for Personalized Generation

Sichun Luo; Guanzhi Deng; Jian Xu; Xiaojie Zhang; Hanxu Hou; Linqi Song

arXiv:2505.17571·cs.CL·May 26, 2025

Reasoning Meets Personalization: Unleashing the Potential of Large Reasoning Model for Personalized Generation

Sichun Luo, Guanzhi Deng, Jian Xu, Xiaojie Zhang, Hanxu Hou, Linqi Song

PDF

TL;DR

This paper evaluates large reasoning models for personalization, identifies key limitations, and proposes a novel framework with hierarchical reasoning and intervention methods to improve personalized generation performance.

Contribution

It introduces Reinforced Reasoning for Personalization ( extit{ReaPer}), a framework that enhances LRM personalization through structured reasoning templates and alignment techniques.

Findings

01

LRMs do not always outperform general LLMs in retrieval scenarios

02

The proposed extit{ReaPer} framework significantly improves personalization quality

03

Hierarchical reasoning and intervention methods enhance model alignment and consistency

Abstract

Personalization is a critical task in modern intelligent systems, with applications spanning diverse domains, including interactions with large language models (LLMs). Recent advances in reasoning capabilities have significantly enhanced LLMs, enabling unprecedented performance in tasks such as mathematics and coding. However, their potential for personalization tasks remains underexplored. In this paper, we present the first systematic evaluation of large reasoning models (LRMs) for personalization tasks. Surprisingly, despite generating more tokens, LRMs do not consistently outperform general-purpose LLMs, especially in retrieval-intensive scenarios where their advantages diminish. Our analysis identifies three key limitations: divergent thinking, misalignment of response formats, and ineffective use of retrieved information. To address these challenges, we propose Reinforced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.