Personalized Dialogue Generation with Diversified Traits

Yinhe Zheng; Guanyi Chen; Minlie Huang; Song Liu; Xuan Zhu

arXiv:1901.09672·cs.CL·January 3, 2020·89 cites

Personalized Dialogue Generation with Diversified Traits

Yinhe Zheng, Guanyi Chen, Minlie Huang, Song Liu, Xuan Zhu

PDF

Open Access 3 Repos 2 Datasets

TL;DR

This paper introduces a large-scale dataset and models for personalized dialogue generation that incorporate explicit personality traits, aiming to produce more human-like conversations.

Contribution

It constructs the PersonalDialog dataset with 20.83M sessions and 56.25M utterances, and proposes persona-aware models using trait fusion, attention, and bias mechanisms.

Findings

01

Models effectively incorporate personality traits into dialogues.

02

The dataset enables research on personalized and sociolinguistic dialogue generation.

03

Case studies show improved trait consistency in generated responses.

Abstract

Endowing a dialogue system with particular personality traits is essential to deliver more human-like conversations. However, due to the challenge of embodying personality via language expression and the lack of large-scale persona-labeled dialogue data, this research problem is still far from well-studied. In this paper, we investigate the problem of incorporating explicit personality traits in dialogue generation to deliver personalized dialogues. To this end, firstly, we construct PersonalDialog, a large-scale multi-turn dialogue dataset containing various traits from a large number of speakers. The dataset consists of 20.83M sessions and 56.25M utterances from 8.47M speakers. Each utterance is associated with a speaker who is marked with traits like Age, Gender, Location, Interest Tags, etc. Several anonymization schemes are designed to protect the privacy of each speaker. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems