Empathetic Dialogue Generation with Pre-trained RoBERTa-GPT2 and   External Knowledge

Ye Liu; Wolfgang Maier; Wolfgang Minker; Stefan Ultes

arXiv:2109.03004·cs.CL·September 8, 2021·1 cites

Empathetic Dialogue Generation with Pre-trained RoBERTa-GPT2 and External Knowledge

Ye Liu, Wolfgang Maier, Wolfgang Minker, Stefan Ultes

PDF

Open Access

TL;DR

This paper introduces a novel empathetic dialogue generation model using pre-trained RoBERTa as encoder and GPT-2 as decoder, enhanced with external commonsense and emotional knowledge, achieving state-of-the-art emotion accuracy.

Contribution

It combines pre-trained encoder-decoder architecture with external knowledge extraction to improve empathetic response generation in dialogue systems.

Findings

01

Achieves new state-of-the-art emotion accuracy.

02

External knowledge improves empathetic response quality.

03

Pre-trained models enhance dialogue understanding and generation.

Abstract

One challenge for dialogue agents is to recognize feelings of the conversation partner and respond accordingly. In this work, RoBERTa-GPT2 is proposed for empathetic dialogue generation, where the pre-trained auto-encoding RoBERTa is utilised as encoder and the pre-trained auto-regressive GPT-2 as decoder. With the combination of the pre-trained RoBERTa and GPT-2, our model realizes a new state-of-the-art emotion accuracy. To enable the empathetic ability of RoBERTa-GPT2 model, we propose a commonsense knowledge and emotional concepts extractor, in which the commonsensible and emotional concepts of dialogue context are extracted for the GPT-2 decoder. The experiment results demonstrate that the empathetic dialogue generation benefits from both pre-trained encoder-decoder architecture and external knowledge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · WordPiece · Linear Warmup With Linear Decay · BERT · Dropout · Softmax · Attention Dropout · Cosine Annealing