DynamicPO: Dynamic Preference Optimization for Recommendation

Xingyu Hu; Kai Zhang; Jiancan Wu; Shuli Wang; Chi Wang; Wenshuai Chen; Yinhua Zhu; Haitao Wang; Xingxing Wang; Xiang Wang

arXiv:2605.00327·cs.IR·May 4, 2026

DynamicPO: Dynamic Preference Optimization for Recommendation

Xingyu Hu, Kai Zhang, Jiancan Wu, Shuli Wang, Chi Wang, Wenshuai Chen, Yinhua Zhu, Haitao Wang, Xingxing Wang, Xiang Wang

PDF

1 Repo

TL;DR

DynamicPO introduces adaptive mechanisms to improve preference optimization in LLM-based recommendation systems, preventing collapse and enhancing accuracy.

Contribution

It proposes a novel, lightweight framework with boundary-aware negative selection and dynamic calibration to address optimization collapse.

Findings

01

DynamicPO prevents preference optimization collapse.

02

It improves recommendation accuracy on multiple datasets.

03

The framework adds negligible computational overhead.

Abstract

In large language model (LLM)-based recommendation systems, direct preference optimization (DPO) effectively aligns recommendations with user preferences, requiring multi-negative objective functions to leverage abundant implicit-feedback negatives and sharpen preference boundaries. However, our empirical analyses reveal a counterintuitive phenomenon, preference optimization collapse, where increasing the number of negative samples can lead to performance degradation despite a continuously decreasing training loss. We further theoretically demonstrate that this collapse arises from gradient suppression, caused by the dominance of easily discriminable negatives over boundary-critical negatives that truly define user preference boundaries. As a result, boundary-relevant signals are under-optimized, weakening the model's decision boundary. Motivated by these observations, we propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xingyuHuxingyu/DynamicPO
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.