Thinking Clearly, Talking Fast: Concept-Guided Non-Autoregressive   Generation for Open-Domain Dialogue Systems

Yicheng Zou; Zhihua Liu; Xingwu Hu; Qi Zhang

arXiv:2109.04084·cs.CL·September 10, 2021

Thinking Clearly, Talking Fast: Concept-Guided Non-Autoregressive Generation for Open-Domain Dialogue Systems

Yicheng Zou, Zhihua Liu, Xingwu Hu, Qi Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a concept-guided non-autoregressive model for open-domain dialogue systems that improves response diversity, coherence, and inference speed by effectively managing multiple concepts during response generation.

Contribution

The paper presents a novel multi-concept planning module and a customized Insertion Transformer for non-autoregressive dialogue generation, enabling better concept management and faster responses.

Findings

01

Outperforms state-of-the-art baselines in automatic evaluations

02

Produces more diverse and coherent responses

03

Achieves substantially faster inference speed

Abstract

Human dialogue contains evolving concepts, and speakers naturally associate multiple concepts to compose a response. However, current dialogue models with the seq2seq framework lack the ability to effectively manage concept transitions and can hardly introduce multiple concepts to responses in a sequential decoding manner. To facilitate a controllable and coherent dialogue, in this work, we devise a concept-guided non-autoregressive model (CG-nAR) for open-domain dialogue generation. The proposed model comprises a multi-concept planning module that learns to identify multiple associated concepts from a concept graph and a customized Insertion Transformer that performs concept-guided non-autoregressive generation to complete a response. The experimental results on two public datasets show that CG-nAR can produce diverse and coherent responses, outperforming state-of-the-art baselines in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rowitzou/cg-nar
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Tanh Activation · Sigmoid Activation · Dropout · Layer Normalization · Softmax