PersoDPO: Scalable Preference Optimization for Instruction-Adherent, Persona-Grounded Dialogue via Multi-LLM Evaluation

Saleh Afzoon; MohammadHossein Ahmadi; Usman Naseem; Amin Beheshti

arXiv:2602.04493·cs.CL·February 5, 2026

PersoDPO: Scalable Preference Optimization for Instruction-Adherent, Persona-Grounded Dialogue via Multi-LLM Evaluation

Saleh Afzoon, MohammadHossein Ahmadi, Usman Naseem, Amin Beheshti

PDF

Open Access

TL;DR

PersoDPO is a scalable framework that improves persona-grounded dialogue models by automatically optimizing preferences based on automatic evaluation signals, enhancing contextual coherence and personalization.

Contribution

The paper introduces PersoDPO, a novel automatic preference optimization method that leverages automatic evaluation signals to fine-tune dialogue models without manual annotation.

Findings

01

Outperforms baseline models on the FoCus dataset

02

Enhances coherence and personalization in dialogue responses

03

Automates preference pair construction for scalable training

Abstract

Personalization and contextual coherence are two essential components in building effective persona-grounded dialogue systems. These aspects play a crucial role in enhancing user engagement and ensuring responses are more relevant and consistent with user identity. However, recent studies indicate that open-source large language models (LLMs) continue to struggle to generate responses that are both contextually grounded and aligned with persona cues, despite exhibiting strong general conversational abilities like fluency and naturalness. We present PersoDPO, a scalable preference optimisation framework that uses supervision signals from automatic evaluations of responses generated by both closed-source and open-source LLMs to fine-tune dialogue models. The framework integrates evaluation metrics targeting coherence and personalization, along with a length-format compliance feature to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPersona Design and Applications · Topic Modeling · AI in Service Interactions