Predict and Use Latent Patterns for Short-Text Conversation
Hung-Ting Chen, Yu-Chieh Chao, Ta-Hsuan Chao, Wei-Yun Ma

TL;DR
This paper introduces a method to incorporate detailed latent semantic patterns, such as latent responses and POS sequences, to improve the diversity, informativeness, and quality of short-text conversational responses.
Contribution
It proposes using complex semantic forms as controllable signals for response generation, enhancing response diversity and quality over prior simpler latent word approaches.
Findings
Richer semantic forms improve response diversity.
Semantic guidance enhances fluency and coherence.
Method achieves higher response quality metrics.
Abstract
Many neural network models nowadays have achieved promising performances in Chit-chat settings. The majority of them rely on an encoder for understanding the post and a decoder for generating the response. Without given assigned semantics, the models lack the fine-grained control over responses as the semantic mapping between posts and responses is hidden on the fly within the end-to-end manners. Some previous works utilize sampled latent words as a controllable semantic form to drive the generated response around the work, but few works attempt to use more complex semantic patterns to guide the generation. In this paper, we propose to use more detailed semantic forms, including latent responses and part-of-speech sequences sampled from the corresponding distributions, as the controllable semantics to guide the generation. Our results show that the richer semantics are not only able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
