Relative Preference Optimization: Enhancing LLM Alignment through   Contrasting Responses across Identical and Diverse Prompts

Yueqin Yin; Zhendong Wang; Yi Gu; Hai Huang; Weizhu Chen; Mingyuan; Zhou

arXiv:2402.10958·cs.CL·May 29, 2024·1 cites

Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts

Yueqin Yin, Zhendong Wang, Yi Gu, Hai Huang, Weizhu Chen, Mingyuan, Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces Relative Preference Optimization (RPO), a novel method for aligning large language models with user preferences by contrasting responses across identical and related prompts, improving adaptability and preference alignment.

Contribution

RPO extends preference optimization by incorporating contrastive weighting across diverse prompts, enhancing LLM alignment without requiring additional reward models.

Findings

01

RPO outperforms DPO in alignment tasks.

02

Improves adaptability in dialogue and summarization.

03

Demonstrates superior performance on AlpacaEval2.0.

Abstract

In the field of large language models (LLMs), aligning models with the diverse preferences of users is a critical challenge. Direct Preference Optimization (DPO) has played a key role in this area. It works by using pairs of preferences derived from the same prompts, and it functions without needing an additional reward model. However, DPO does not fully reflect the complex nature of human learning, which often involves understanding contrasting responses to not only identical but also similar questions. To overcome this shortfall, we propose Relative Preference Optimization (RPO). RPO is designed to discern between more and less preferred responses derived from both identical and related prompts. It introduces a contrastive weighting mechanism, enabling the tuning of LLMs using a broader range of preference data, including both paired and unpaired sets. This approach expands the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yinyueqin/relative-preference-optimization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications

MethodsDirect Preference Optimization · Sparse Evolutionary Training · ALIGN