Achieving Conversational Goals with Unsupervised Post-hoc Knowledge   Injection

Bodhisattwa Prasad Majumder; Harsh Jhamtani; Taylor Berg-Kirkpatrick,; Julian McAuley

arXiv:2203.11399·cs.CL·March 23, 2022

Achieving Conversational Goals with Unsupervised Post-hoc Knowledge Injection

Bodhisattwa Prasad Majumder, Harsh Jhamtani, Taylor Berg-Kirkpatrick,, Julian McAuley

PDF

Open Access 1 Repo

TL;DR

This paper introduces an unsupervised post-hoc knowledge injection method for neural dialog models, enhancing response specificity and informativeness by incorporating external knowledge snippets during decoding, leading to more engaging and goal-oriented conversations.

Contribution

The paper presents a novel unsupervised technique for injecting external knowledge into dialog responses after initial generation, improving informativeness and goal achievement.

Findings

01

Responses are judged more engaging and informative by humans.

02

Knowledge augmentation increases success in achieving conversational goals.

03

Method outperforms prior dialog systems in experimental evaluations.

Abstract

A limitation of current neural dialog models is that they tend to suffer from a lack of specificity and informativeness in generated responses, primarily due to dependence on training data that covers a limited variety of scenarios and conveys limited knowledge. One way to alleviate this issue is to extract relevant knowledge from external sources at decoding time and incorporate it into the dialog response. In this paper, we propose a post-hoc knowledge-injection technique where we first retrieve a diverse set of relevant knowledge snippets conditioned on both the dialog history and an initial response from an existing dialog model. We construct multiple candidate responses, individually injecting each retrieved snippet into the initial response using a gradient-based decoding method, and then select the final response with an unsupervised ranking step. Our experiments in goal-oriented…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

majumderb/poki
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications