WorldPM: Scaling Human Preference Modeling

Binghai Wang; Runji Lin; Keming Lu; Le Yu; Zhenru Zhang; Fei Huang; Chujie Zheng; Kai Dang; Yang Fan; Xingzhang Ren; An Yang; Binyuan Hui; Dayiheng Liu; Tao Gui; Qi Zhang; Xuanjing Huang; Yu-Gang Jiang; Bowen Yu; Jingren Zhou; Junyang Lin

arXiv:2505.10527·cs.CL·May 20, 2025

WorldPM: Scaling Human Preference Modeling

Binghai Wang, Runji Lin, Keming Lu, Le Yu, Zhenru Zhang, Fei Huang, Chujie Zheng, Kai Dang, Yang Fan, Xingzhang Ren, An Yang, Binyuan Hui, Dayiheng Liu, Tao Gui, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang, Bowen Yu, Jingren Zhou, Junyang Lin

PDF

Open Access 1 Repo 5 Models

TL;DR

This paper introduces WorldPM, a scalable human preference modeling framework that leverages large-scale data and models to improve preference alignment, showing significant gains across multiple benchmarks and evaluation metrics.

Contribution

We propose WorldPM, a unified preference modeling approach demonstrating scalable improvements in preference prediction and fine-tuning for large language models.

Findings

01

Preference metrics scale with data and model size, especially adversarial and objective metrics.

02

WorldPM improves performance on 7 benchmarks with over 5% gains.

03

Integration into RLHF pipeline yields 4-8% improvements in evaluations.

Abstract

Motivated by scaling laws in language modeling that demonstrate how test loss scales as a power law with model and dataset sizes, we find that similar laws exist in preference modeling. We propose World Preference Modeling$ (WorldPM) to emphasize this scaling potential, where World Preference embodies a unified representation of human preferences. In this paper, we collect preference data from public forums covering diverse user communities, and conduct extensive training using 15M-scale data across models ranging from 1.5B to 72B parameters. We observe distinct patterns across different evaluation metrics: (1) Adversarial metrics (ability to identify deceptive features) consistently scale up with increased training data and base model size; (2) Objective metrics (objective knowledge with well-defined answers) show emergent behavior in larger language models, highlighting WorldPM's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qwenlm/worldpm
noneOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Recommender Systems and Techniques · Topic Modeling

MethodsBalanced Selection