Personalized Dialogue Generation with Diversified Traits
Yinhe Zheng, Guanyi Chen, Minlie Huang, Song Liu, Xuan Zhu

TL;DR
This paper introduces a large-scale dataset and models for personalized dialogue generation that incorporate explicit personality traits, aiming to produce more human-like conversations.
Contribution
It constructs the PersonalDialog dataset with 20.83M sessions and 56.25M utterances, and proposes persona-aware models using trait fusion, attention, and bias mechanisms.
Findings
Models effectively incorporate personality traits into dialogues.
The dataset enables research on personalized and sociolinguistic dialogue generation.
Case studies show improved trait consistency in generated responses.
Abstract
Endowing a dialogue system with particular personality traits is essential to deliver more human-like conversations. However, due to the challenge of embodying personality via language expression and the lack of large-scale persona-labeled dialogue data, this research problem is still far from well-studied. In this paper, we investigate the problem of incorporating explicit personality traits in dialogue generation to deliver personalized dialogues. To this end, firstly, we construct PersonalDialog, a large-scale multi-turn dialogue dataset containing various traits from a large number of speakers. The dataset consists of 20.83M sessions and 56.25M utterances from 8.47M speakers. Each utterance is associated with a speaker who is marked with traits like Age, Gender, Location, Interest Tags, etc. Several anonymization schemes are designed to protect the privacy of each speaker. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
