SAGE: Sequence-level Adaptive Gradient Evolution for Generative Recommendation
Yu Xie, Xing Kai Ren, Ying Qi, Hu Yao

TL;DR
This paper introduces SAGE, a novel sequence-level adaptive gradient optimizer designed for generative recommendation systems, addressing limitations of existing methods and improving diversity, accuracy, and cold-start performance.
Contribution
SAGE provides a unified, sequence-aware optimization framework with adaptive bounds and multi-objective advantages, improving training stability and recommendation quality.
Findings
Improves top-K accuracy and cold-start recall.
Enhances diversity in recommendations.
Maintains training stability across datasets.
Abstract
Reinforcement learning-based preference optimization is increasingly used to align list-wise generative recommenders with complex, multi-objective user feedback, yet existing optimizers such as Gradient-Bounded Policy Optimization (GBPO) exhibit structural limitations in recommendation settings. We identify a Symmetric Conservatism failure mode in which symmetric update bounds suppress learning from rare positive signals (e.g., cold-start items), static negative-sample constraints fail to prevent diversity collapse under rejection-dominated feedback, and group-normalized multi-objective rewards lead to low-resolution training signals. To address these issues, we propose SAGE (Sequence-level Adaptive Gradient Evolution), a unified optimizer designed for list-wise generative recommendation. SAGE introduces sequence-level signal alignment via a geometric-mean importance ratio and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Generative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI)
