Empathetic Dialogue Generation with Pre-trained RoBERTa-GPT2 and External Knowledge
Ye Liu, Wolfgang Maier, Wolfgang Minker, Stefan Ultes

TL;DR
This paper introduces a novel empathetic dialogue generation model using pre-trained RoBERTa as encoder and GPT-2 as decoder, enhanced with external commonsense and emotional knowledge, achieving state-of-the-art emotion accuracy.
Contribution
It combines pre-trained encoder-decoder architecture with external knowledge extraction to improve empathetic response generation in dialogue systems.
Findings
Achieves new state-of-the-art emotion accuracy.
External knowledge improves empathetic response quality.
Pre-trained models enhance dialogue understanding and generation.
Abstract
One challenge for dialogue agents is to recognize feelings of the conversation partner and respond accordingly. In this work, RoBERTa-GPT2 is proposed for empathetic dialogue generation, where the pre-trained auto-encoding RoBERTa is utilised as encoder and the pre-trained auto-regressive GPT-2 as decoder. With the combination of the pre-trained RoBERTa and GPT-2, our model realizes a new state-of-the-art emotion accuracy. To enable the empathetic ability of RoBERTa-GPT2 model, we propose a commonsense knowledge and emotional concepts extractor, in which the commonsensible and emotional concepts of dialogue context are extracted for the GPT-2 decoder. The experiment results demonstrate that the empathetic dialogue generation benefits from both pre-trained encoder-decoder architecture and external knowledge.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · WordPiece · Linear Warmup With Linear Decay · BERT · Dropout · Softmax · Attention Dropout · Cosine Annealing
