KPT: Keyword-guided Pre-training for Grounded Dialog Generation
Qi Zhu, Fei Mi, Zheng Zhang, Yasheng Wang, Yitong Li, Xin Jiang, Qun, Liu, Xiaoyan Zhu, Minlie Huang

TL;DR
KPT is a self-supervised pre-training method that enhances grounded dialog generation by extracting keywords from dialogs, enabling models to better incorporate various types of external knowledge without additional annotations.
Contribution
This work introduces KPT, a novel keyword-guided pre-training approach that leverages dialog-internal keywords for improved knowledge-grounded response generation across multiple knowledge sources.
Findings
KPT outperforms state-of-the-art methods on multiple knowledge-grounded tasks.
It effectively handles diverse knowledge types like graphs, personas, and passages.
The method works well with large-scale, unlabeled dialog data.
Abstract
Incorporating external knowledge into the response generation process is essential to building more helpful and reliable dialog agents. However, collecting knowledge-grounded conversations is often costly, calling for a better pre-trained model for grounded dialog generation that generalizes well w.r.t. different types of knowledge. In this work, we propose KPT (Keyword-guided Pre-Training), a novel self-supervised pre-training method for grounded dialog generation without relying on extra knowledge annotation. Specifically, we use a pre-trained language model to extract the most uncertain tokens in the dialog as keywords. With these keywords, we construct two kinds of knowledge and pre-train a knowledge-grounded response generation model, aiming at handling two different scenarios: (1) the knowledge should be faithfully grounded; (2) it can be selectively used. For the former, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
