PROMISE: Process Reward Models Unlock Test-Time Scaling Laws in Generative Recommendations

Chengcheng Guo; Kuo Cai; Yu Zhou; Qiang Luo; Ruiming Tang; Han Li; Kun Gai; Guorui Zhou

arXiv:2601.04674·cs.IR·January 9, 2026

PROMISE: Process Reward Models Unlock Test-Time Scaling Laws in Generative Recommendations

Chengcheng Guo, Kuo Cai, Yu Zhou, Qiang Luo, Ruiming Tang, Han Li, Kun Gai, Guorui Zhou

PDF

Open Access

TL;DR

Promise introduces a framework that integrates step-by-step verification into generative recommendation models, reducing semantic errors and enabling smaller models to perform as well as larger ones through test-time scaling.

Contribution

It proposes a novel PRM-based approach with dense verification and guided search, unlocking test-time scaling laws for recommender systems.

Findings

01

Reduces Semantic Drift in generative recommendations

02

Enables smaller models to match larger models' performance with increased inference compute

03

Improves recommendation accuracy significantly in large-scale online tests

Abstract

Generative Recommendation has emerged as a promising paradigm, reformulating recommendation as a sequence-to-sequence generation task over hierarchical Semantic IDs. However, existing methods suffer from a critical issue we term Semantic Drift, where errors in early, high-level tokens irreversibly divert the generation trajectory into irrelevant semantic subspaces. Inspired by Process Reward Models (PRMs) that enhance reasoning in Large Language Models, we propose Promise, a novel framework that integrates dense, step-by-step verification into generative models. Promise features a lightweight PRM to assess the quality of intermediate inference steps, coupled with a PRM-guided Beam Search strategy that leverages dense feedback to dynamically prune erroneous branches. Crucially, our approach unlocks Test-Time Scaling Laws for recommender systems: by increasing inference compute, smaller…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Topic Modeling