Aligning Large Language Models with Implicit Preferences from User-Generated Content

Zhaoxuan Tan; Zheng Li; Tianyi Liu; Haodong Wang; Hyokun Yun; Ming Zeng; Pei Chen; Zhihan Zhang; Yifan Gao; Ruijie Wang; Priyanka Nigam; Bing Yin; Meng Jiang

arXiv:2506.04463·cs.CL·June 6, 2025

Aligning Large Language Models with Implicit Preferences from User-Generated Content

Zhaoxuan Tan, Zheng Li, Tianyi Liu, Haodong Wang, Hyokun Yun, Ming Zeng, Pei Chen, Zhihan Zhang, Yifan Gao, Ruijie Wang, Priyanka Nigam, Bing Yin, Meng Jiang

PDF

Open Access 1 Models 1 Datasets 1 Video

TL;DR

This paper introduces PUGC, a scalable framework that leverages implicit preferences from unlabeled user-generated content to improve large language model alignment with human values, achieving state-of-the-art results.

Contribution

The paper presents PUGC, a novel method that uses implicit preferences from UGC for scalable LLM alignment, reducing reliance on costly curated data.

Findings

01

Models trained with PUGC outperform traditional methods by 9.37% on Alpaca Eval 2.

02

Achieves a 35.93% state-of-the-art length-controlled win rate.

03

Enhances reward quality, domain-specific alignment, and robustness.

Abstract

Learning from preference feedback is essential for aligning large language models (LLMs) with human values and improving the quality of generated responses. However, existing preference learning methods rely heavily on curated data from humans or advanced LLMs, which is costly and difficult to scale. In this work, we present PUGC, a novel framework that leverages implicit human Preferences in unlabeled User-Generated Content (UGC) to generate preference data. Although UGC is not explicitly created to guide LLMs in generating human-preferred responses, it often reflects valuable insights and implicit preferences from its creators that has the potential to address readers' questions. PUGC transforms UGC into user queries and generates responses from the policy model. The UGC is then leveraged as a reference text for response scoring, aligning the model with these implicit preferences.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Zhaoxuan/PUGC-Mistral-DPO
model· 4 dl· ♡ 2
4 dl♡ 2

Datasets

Zhaoxuan/PUGC_data
dataset· 12 dl
12 dl

Videos

Aligning Large Language Models with Implicit Preferences from User-Generated Content· underline

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Sentiment Analysis and Opinion Mining

MethodsDirect Preference Optimization