GRAM: A Generative Foundation Reward Model for Reward Generalization

Chenglong Wang; Yang Gan; Yifu Huo; Yongyu Mu; Qiaozhi He; Murun Yang; Bei Li; Tong Xiao; Chunliang Zhang; Tongran Liu; Jingbo Zhu

arXiv:2506.14175·cs.CL·January 27, 2026

GRAM: A Generative Foundation Reward Model for Reward Generalization

Chenglong Wang, Yang Gan, Yifu Huo, Yongyu Mu, Qiaozhi He, Murun Yang, Bei Li, Tong Xiao, Chunliang Zhang, Tongran Liu, Jingbo Zhu

PDF

Open Access 2 Repos 5 Models 2 Datasets

TL;DR

This paper introduces a generative reward model trained with both unlabeled and labeled data, improving reward generalization across multiple tasks in language model alignment.

Contribution

It develops a novel generative reward model trained via unsupervised and supervised learning, linking generative and discriminative approaches under common training objectives.

Findings

01

Model generalizes well across multiple tasks

02

Achieves significant performance improvements

03

Effective with little or no fine-tuning

Abstract

In aligning large language models (LLMs), reward models have played an important role, but are standardly trained as discriminative models and rely only on labeled human preference data. In this paper, we explore methods that train reward models using both unlabeled and labeled data. Building on the generative models in LLMs, we develop a generative reward model that is first trained via large-scale unsupervised learning and then fine-tuned via supervised learning. We also show that by using label smoothing, we are in fact optimizing a regularized pairwise ranking loss. This result, in turn, provides a new view of training reward models, which links generative models and discriminative models under the same class of training objectives. The outcome of these techniques is a foundation reward model, which can be applied to a wide range of tasks with little or no further fine-tuning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical and Computational Modeling