Learning to Reflect and Correct: Towards Better Decoding Trajectories for Large-Scale Generative Recommendation

Haibo Xing; Hao Deng; Lingyu Mu; Jinxin Hu; Yu Zhang; Xiaoyi Zeng; Jing Zhang

arXiv:2602.23639·cs.IR·March 2, 2026

Learning to Reflect and Correct: Towards Better Decoding Trajectories for Large-Scale Generative Recommendation

Haibo Xing, Hao Deng, Lingyu Mu, Jinxin Hu, Yu Zhang, Xiaoyi Zeng, Jing Zhang

PDF

Open Access

TL;DR

This paper introduces GRC, a structured reflection-correction framework for generative recommendation that improves decoding quality through multi-stage refinement and reinforcement learning, leading to significant performance gains.

Contribution

It proposes the first structured reflection-correction framework for generative recommendation, enhancing decoding via multi-granular reflection and reinforcement learning.

Findings

01

GRC outperforms six state-of-the-art baselines by up to 15.74%.

02

Online A/B tests show a 1.79% increase in advertising revenue.

03

The EGRS strategy effectively allocates correction resources during decoding.

Abstract

Generative Recommendation (GR) has become a promising paradigm for large-scale recommendation systems. However, existing GR models typically perform single-pass decoding without explicit refinement, causing early deviations to accumulate and ultimately degrade recommendation quality. To tackle this problem, we propose GRC, which is, to our knowledge, the first structured reflection-correction framework for GR that extends standard decoding into a Generation-Reflection-Correction (GRC) process. Concretely, GRC introduces a supervised reflection-correction template that decomposes the decoding process into initial draft generation, multi-granular reflection, and reflection-guided correction, thereby enabling structured reflection and correction in the semantic token space. To further explore the enlarged refinement space introduced by the GRC process, we optimize the entire GRC trajectory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Generative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI)