A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse Data
Yinhe Zheng, Rongsheng Zhang, Xiaoxi Mao, Minlie Huang

TL;DR
This paper introduces a pre-training based personalized dialogue generation model that effectively handles persona-sparse data by integrating persona attributes into a pre-trained language model with an attention routing mechanism, improving response coherence and persona consistency.
Contribution
The paper proposes a novel method combining pre-trained language models with persona embeddings and an attention routing decoder to better utilize persona-sparse dialogue data.
Findings
Outperforms state-of-the-art methods in coherence and persona consistency
Effectively incorporates persona information from sparse data during training
Demonstrates improved response quality through automatic and manual evaluations
Abstract
Endowing dialogue systems with personas is essential to deliver more human-like conversations. However, this problem is still far from well explored due to the difficulties of both embodying personalities in natural languages and the persona sparsity issue observed in most dialogue corpora. This paper proposes a pre-training based personalized dialogue model that can generate coherent responses using persona-sparse dialogue data. In this method, a pre-trained language model is used to initialize an encoder and decoder, and personal attribute embeddings are devised to model richer dialogue contexts by encoding speakers' personas together with dialogue histories. Further, to incorporate the target persona in the decoding process and to balance its contribution, an attention routing structure is devised in the decoder to merge features extracted from the target persona and dialogue…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
