Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Xueru Wen, Jie Lou, Zichao Li, Yaojie Lu, Xing Yu, Yuqiu Ji, Guohai Xu, Hongyu Lin, Ben He, Xianpei Han, Le Sun, Debing Zhang

TL;DR
This paper introduces CheemsBench and CheemsPreference to improve Chinese reward models, highlighting the importance of human supervision for better alignment with human preferences in Chinese language tasks.
Contribution
It provides the first Chinese-specific RM evaluation benchmark and a large-scale preference dataset, demonstrating the effectiveness of human supervision in training high-quality Chinese reward models.
Findings
Open-source RMs have limited ability to capture Chinese human preferences.
Human-supervised training yields state-of-the-art Chinese RM performance.
AI-generated data alone is insufficient for aligning models with human preferences.
Abstract
Reward models (RMs) are crucial for aligning large language models (LLMs) with human preferences. However, most RM research is centered on English and relies heavily on synthetic resources, which leads to limited and less reliable datasets and benchmarks for Chinese. To address this gap, we introduce CheemsBench, a fully human-annotated RM evaluation benchmark within Chinese contexts, and CheemsPreference, a large-scale and diverse preference dataset annotated through human-machine collaboration to support Chinese RM training. We systematically evaluate open-source discriminative and generative RMs on CheemsBench and observe significant limitations in their ability to capture human preferences in Chinese scenarios. Additionally, based on CheemsPreference, we construct an RM that achieves state-of-the-art performance on CheemsBench, demonstrating the necessity of human supervision in RM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Explainable Artificial Intelligence (XAI)
