Controllable Dialogue Generation with Disentangled Multi-grained Style   Specification and Attribute Consistency Reward

Zhe Hu; Zhiwei Cao; Hou Pong Chan; Jiachen Liu; Xinyan Xiao; Jinsong; Su; Hua Wu

arXiv:2109.06717·cs.CL·October 24, 2022·1 cites

Controllable Dialogue Generation with Disentangled Multi-grained Style Specification and Attribute Consistency Reward

Zhe Hu, Zhiwei Cao, Hou Pong Chan, Jiachen Liu, Xinyan Xiao, Jinsong, Su, Hua Wu

PDF

Open Access

TL;DR

This paper introduces a novel controllable dialogue generation model that uses multi-grained style specifications and an attribute consistency reward to produce responses that are both relevant and attribute-controlled.

Contribution

It proposes a two-stage decoder with style specification and a new attribute consistency reward for improved controllable dialogue generation.

Findings

01

Outperforms baselines in response quality

02

Enhances content diversity and controllability

03

Effectively maintains attribute fidelity

Abstract

Controllable text generation is an appealing but challenging task, which allows users to specify particular attributes of the generated outputs. In this paper, we propose a controllable dialogue generation model to steer response generation under multi-attribute constraints. Specifically, we define and categorize the commonly used control attributes into global and local ones, which possess different granularities of effects on response generation. Then, we significantly extend the conventional seq2seq framework by introducing a novel two-stage decoder, which first uses a multi-grained style specification layer to impose the stylistic constraints and determine word-level control states of responses based on the attributes, and then employs a response generation layer to generate final responses maintaining both semantic relevancy to the contexts and fidelity to the attributes.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence