Dialogue Language Model with Large-Scale Persona Data Engineering

Mengze Hong; Chen Jason Zhang; Chaotao Chen; Rongzhong Lian; Di Jiang

arXiv:2412.09034·cs.CL·February 20, 2025

Dialogue Language Model with Large-Scale Persona Data Engineering

Mengze Hong, Chen Jason Zhang, Chaotao Chen, Rongzhong Lian, Di Jiang

PDF

Open Access 1 Video

TL;DR

This paper introduces PPDS, a large-scale persona dialogue system that leverages extensive pre-training, a novel persona extraction model, and augmentation techniques to improve persona consistency and response quality in open-domain dialogue models.

Contribution

The paper presents a new large-scale persona dialogue dataset, a persona extraction model, and augmentation methods to enhance persona consistency in dialogue systems.

Findings

01

PPDS outperforms baseline models in response quality.

02

Enhanced persona consistency demonstrated through evaluations.

03

Effective dataset augmentation reduces invalid persona bias.

Abstract

Maintaining persona consistency is paramount in the application of open-domain dialogue systems, as exemplified by models like ChatGPT. Despite significant advancements, the limited scale and diversity of current persona dialogue datasets remain challenges to achieving robust persona-consistent dialogue models. In this study, drawing inspiration from the success of large-scale pre-training, we introduce PPDS, an open-domain persona dialogue system that employs extensive generative pre-training on a persona dialogue dataset to enhance persona consistency. Specifically, we present a persona extraction model designed to autonomously and precisely generate vast persona dialogue datasets. Additionally, we unveil a pioneering persona augmentation technique to address the invalid persona bias inherent in the constructed dataset. Both quantitative and human evaluations consistently highlight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Dialogue Language Model with Large-Scale Persona Data Engineering· underline

Taxonomy

TopicsPersona Design and Applications