DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks
Semih Yavuz, Abhinav Rastogi, Guan-Lin Chao, Dilek Hakkani-Tur

TL;DR
DeepCopy introduces a hierarchical pointer network model that enhances dialogue response generation by grounding responses in external knowledge, leading to more accurate, informative, and engaging conversations.
Contribution
The paper proposes a novel hierarchical pointer network that enables response models to copy from external knowledge sources, improving grounded dialogue generation.
Findings
Outperforms baseline models on CONVAI2 dataset
Produces more accurate and engaging responses
Improves grounding in external knowledge
Abstract
Recent advances in neural sequence-to-sequence models have led to promising results for several language generation-based tasks, including dialogue response generation, summarization, and machine translation. However, these models are known to have several problems, especially in the context of chit-chat based dialogue systems: they tend to generate short and dull responses that are often too generic. Furthermore, these models do not ground conversational responses on knowledge and facts, resulting in turns that are not accurate, informative and engaging for the users. In this paper, we propose and experiment with a series of response generation models that aim to serve in the general scenario where in addition to the dialogue context, relevant unstructured external knowledge in the form of text is also assumed to be available for models to harness. Our proposed approach extends…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
