Controllable Dialogue Generation with Disentangled Multi-grained Style Specification and Attribute Consistency Reward
Zhe Hu, Zhiwei Cao, Hou Pong Chan, Jiachen Liu, Xinyan Xiao, Jinsong, Su, Hua Wu

TL;DR
This paper introduces a novel controllable dialogue generation model that uses multi-grained style specifications and an attribute consistency reward to produce responses that are both relevant and attribute-controlled.
Contribution
It proposes a two-stage decoder with style specification and a new attribute consistency reward for improved controllable dialogue generation.
Findings
Outperforms baselines in response quality
Enhances content diversity and controllability
Effectively maintains attribute fidelity
Abstract
Controllable text generation is an appealing but challenging task, which allows users to specify particular attributes of the generated outputs. In this paper, we propose a controllable dialogue generation model to steer response generation under multi-attribute constraints. Specifically, we define and categorize the commonly used control attributes into global and local ones, which possess different granularities of effects on response generation. Then, we significantly extend the conventional seq2seq framework by introducing a novel two-stage decoder, which first uses a multi-grained style specification layer to impose the stylistic constraints and determine word-level control states of responses based on the attributes, and then employs a response generation layer to generate final responses maintaining both semantic relevancy to the contexts and fidelity to the attributes.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence
