Gradient Imbalance in Direct Preference Optimization

Qinwei Ma; Jingzhe Shi; Can Jin; Jenq-Neng Hwang; Serge Belongie; Lei; Li

arXiv:2502.20847·cs.LG·March 3, 2025

Gradient Imbalance in Direct Preference Optimization

Qinwei Ma, Jingzhe Shi, Can Jin, Jenq-Neng Hwang, Serge Belongie, Lei, Li

PDF

TL;DR

This paper identifies gradient imbalance as a key issue limiting Direct Preference Optimization (DPO) performance and proposes Balanced-DPO, a simple modification that improves training stability and effectiveness.

Contribution

The paper provides a systematic analysis of DPO's training dynamics, revealing gradient imbalance as a critical problem, and introduces Balanced-DPO with a gradient reweighting mechanism to enhance performance.

Findings

01

Gradient imbalance destabilizes DPO training.

02

Balanced-DPO improves convergence and stability.

03

Addressing gradient imbalance enhances DPO effectiveness.

Abstract

Direct Preference Optimization (DPO) has been proposed as a promising alternative to Proximal Policy Optimization (PPO) based Reinforcement Learning with Human Feedback (RLHF). However, empirical evaluations consistently reveal suboptimal performance in DPO compared to common RLHF pipelines. In this work, we conduct a systematic analysis of DPO's training dynamics and identify gradient imbalance as a critical limitation. We demonstrate theoretically and empirically that this imbalance perturbs optimization trajectories, destabilizes learning, and induces suboptimal convergence. To address this issue, we propose Balanced-DPO, a simple yet effective modification to the DPO objective that introduces a computationally efficient gradient reweighting mechanism. Our experiments demonstrate the effectiveness of Balanced-DPO, validating the theoretical findings and confirming that addressing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.