HSCJN: A Holistic Semantic Constraint Joint Network for Diverse Response Generation
Yiru Wang, Pengda Si, Zeyang Lei, Guangxu Xun, Yujiu Yang

TL;DR
This paper introduces HSCJN, a novel neural network that enhances response diversity and semantic consistency in dialogue generation by incorporating holistic semantic constraints and entropy regularization.
Contribution
The paper proposes a generic, diversity-promoting joint network that improves semantic relevance and diversity in Seq2Seq dialogue models, applicable to various architectures.
Findings
Improves response diversity in dialogue generation.
Enhances semantic consistency of generated responses.
Outperforms existing methods on multiple dialogue datasets.
Abstract
The sequence-to-sequence (Seq2Seq) model generates target words iteratively given the previously observed words during decoding process, which results in the loss of the holistic semantics in the target response and the complete semantic relationship between responses and dialogue histories. In this paper, we propose a generic diversity-promoting joint network, called Holistic Semantic Constraint Joint Network (HSCJN), enhancing the global sentence information, and then regularizing the objective function with penalizing the low entropy output. Our network introduces more target information to improve diversity, and captures direct semantic information to better constrain the relevance simultaneously. Moreover, the proposed method can be easily applied to any Seq2Seq structure. Extensive experiments on several dialogue corpuses show that our method effectively improves both semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
