PersoDPO: Scalable Preference Optimization for Instruction-Adherent, Persona-Grounded Dialogue via Multi-LLM Evaluation
Saleh Afzoon, MohammadHossein Ahmadi, Usman Naseem, Amin Beheshti

TL;DR
PersoDPO is a scalable framework that improves persona-grounded dialogue models by automatically optimizing preferences based on automatic evaluation signals, enhancing contextual coherence and personalization.
Contribution
The paper introduces PersoDPO, a novel automatic preference optimization method that leverages automatic evaluation signals to fine-tune dialogue models without manual annotation.
Findings
Outperforms baseline models on the FoCus dataset
Enhances coherence and personalization in dialogue responses
Automates preference pair construction for scalable training
Abstract
Personalization and contextual coherence are two essential components in building effective persona-grounded dialogue systems. These aspects play a crucial role in enhancing user engagement and ensuring responses are more relevant and consistent with user identity. However, recent studies indicate that open-source large language models (LLMs) continue to struggle to generate responses that are both contextually grounded and aligned with persona cues, despite exhibiting strong general conversational abilities like fluency and naturalness. We present PersoDPO, a scalable preference optimisation framework that uses supervision signals from automatic evaluations of responses generated by both closed-source and open-source LLMs to fine-tune dialogue models. The framework integrates evaluation metrics targeting coherence and personalization, along with a length-format compliance feature to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersona Design and Applications · Topic Modeling · AI in Service Interactions
