A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation
Shilei Liu, Xiaofeng Zhao, Bochao Li, Feiliang Ren, Longhui Zhang,, Shujuan Yin

TL;DR
This paper introduces a three-stage weakly supervised learning framework for low-resource knowledge-grounded dialogue generation, leveraging unstructured knowledge bases and ungrounded dialogues to improve performance in limited data scenarios.
Contribution
It proposes a novel three-stage framework and a Transformer variant with decoupled decoder to enhance knowledge-grounded dialogue generation under low-resource conditions.
Findings
Outperforms state-of-the-art methods with less training data.
Performs well even in zero-resource scenarios.
Effective use of unstructured knowledge bases and ungrounded dialogues.
Abstract
Neural conversation models have shown great potentials towards generating fluent and informative responses by introducing external background knowledge. Nevertheless, it is laborious to construct such knowledge-grounded dialogues, and existing models usually perform poorly when transfer to new domains with limited training samples. Therefore, building a knowledge-grounded dialogue system under the low-resource setting is a still crucial issue. In this paper, we propose a novel three-stage learning framework based on weakly supervised learning which benefits from large scale ungrounded dialogues and unstructured knowledge base. To better cooperate with this framework, we devise a variant of Transformer with decoupled decoder which facilitates the disentangled learning of response generation and knowledge incorporation. Evaluation results on two benchmarks indicate that our approach can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dropout · Layer Normalization · Softmax · Label Smoothing · Byte Pair Encoding
