Hummer: Towards Limited Competitive Preference Dataset

Li Jiang; Yusen Wu; Junwu Xiong; Jingqing Ruan; Yichuan Ding; Qingpei Guo; Zujie Wen; Jun Zhou; Xiaotie Deng

arXiv:2405.11647·cs.AI·October 23, 2025

Hummer: Towards Limited Competitive Preference Dataset

Li Jiang, Yusen Wu, Junwu Xiong, Jingqing Ruan, Yichuan Ding, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng

PDF

Open Access 1 Datasets

TL;DR

Hummer introduces a new preference dataset with reduced conflicting alignment objectives, leveraging AI feedback to improve alignment and robustness in language models.

Contribution

The paper presents Hummer, a novel preference dataset designed to minimize conflicts between alignment objectives, and develops reward models that effectively balance diverse alignment goals.

Findings

01

Hummer dataset reduces conflicts in preference data.

02

HummerRM reward models balance multiple alignment objectives.

03

Enhanced robustness against jailbreak attacks.

Abstract

Preference datasets are essential for incorporating human preferences into pre-trained language models, playing a key role in the success of Reinforcement Learning from Human Feedback. However, these datasets often demonstrate conflicting alignment objectives, leading to increased vulnerability to jailbreak attacks and challenges in adapting downstream tasks to prioritize specific alignment objectives without negatively impacting others. In this work, we introduce a novel statistical metric, Alignment Dimension Conflict, to quantify the degree of conflict within preference datasets. We then present \texttt{Hummer} and its fine-grained variant, \texttt{Hummer-F}, as innovative pairwise preference datasets with reduced-conflict alignment objectives. \texttt{Hummer} is built based on UltraFeedback and is enhanced by AI feedback from GPT-4, marking as the first preference dataset aimed at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

sarinw-2024/Hummer
dataset· 30 dl
30 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Management and Algorithms

MethodsAttention Is All You Need · Dense Connections · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding · Adam · Dropout