Progressive Open-Domain Response Generation with Multiple Controllable Attributes
Haiqin Yang, Xiaoyuan Yao, Yiqun Duan, Jianping Shen, Jie Zhong, Kun, Zhang

TL;DR
This paper introduces PHED, a hierarchical model that progressively incorporates multiple controllable attributes into open-domain response generation, significantly improving diversity and attribute control over previous methods.
Contribution
The paper proposes a novel progressive training framework with CVAE and Transformer to effectively generate responses with multiple controllable attributes.
Findings
PHED outperforms state-of-the-art models in diversity and control.
The model effectively separates semantic and attribute information.
Extensive evaluations validate the approach's effectiveness.
Abstract
It is desirable to include more controllable attributes to enhance the diversity of generated responses in open-domain dialogue systems. However, existing methods can generate responses with only one controllable attribute or lack a flexible way to generate them with multiple controllable attributes. In this paper, we propose a Progressively trained Hierarchical Encoder-Decoder (PHED) to tackle this task. More specifically, PHED deploys Conditional Variational AutoEncoder (CVAE) on Transformer to include one aspect of attributes at one stage. A vital characteristic of the CVAE is to separate the latent variables at each stage into two types: a global variable capturing the common semantic features and a specific variable absorbing the attribute information at that stage. PHED then couples the CVAE latent variables with the Transformer encoder and is trained by minimizing a newly derived…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Dropout · Layer Normalization · Multi-Head Attention · Label Smoothing
