Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue
Byeongchang Kim, Jaewoo Ahn, Gunhee Kim

TL;DR
This paper introduces a sequential latent variable model called SKT for improved knowledge selection in multi-turn knowledge-grounded dialogue, leading to better response generation and state-of-the-art results on large-scale benchmarks.
Contribution
It is the first to model sequential knowledge selection with a latent variable approach, enhancing accuracy and dialogue quality.
Findings
Achieved state-of-the-art on Wizard of Wikipedia.
Improved knowledge selection accuracy.
Validated effectiveness on Holl-E dataset.
Abstract
Knowledge-grounded dialogue is a task of generating an informative response based on both discourse context and external knowledge. As we focus on better modeling the knowledge selection in the multi-turn knowledge-grounded dialogue, we propose a sequential latent variable model as the first approach to this matter. The model named sequential knowledge transformer (SKT) can keep track of the prior and posterior distribution over knowledge; as a result, it can not only reduce the ambiguity caused from the diversity in knowledge selection of conversation but also better leverage the response information for proper choice of knowledge. Our experimental results show that the proposed model improves the knowledge selection accuracy and subsequently the performance of utterance generation. We achieve the new state-of-the-art performance on Wizard of Wikipedia (Dinan et al., 2019) as one of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsWizard: Unsupervised goats tracking algorithm · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam
