Posterior-GAN: Towards Informative and Coherent Response Generation with   Posterior Generative Adversarial Network

Shaoxiong Feng; Hongshen Chen; Kan Li; Dawei Yin

arXiv:2003.02020·cs.CL·March 5, 2020·1 cites

Posterior-GAN: Towards Informative and Coherent Response Generation with Posterior Generative Adversarial Network

Shaoxiong Feng, Hongshen Chen, Kan Li, Dawei Yin

PDF

Open Access

TL;DR

Posterior-GAN introduces a novel adversarial framework that leverages query-response-future turn triples to generate more informative and coherent responses in neural conversational models, addressing the issues of dull responses and loose coupling.

Contribution

The paper proposes a new encoder-decoder generative adversarial network with dual discriminators to improve response informativeness and coherence by considering future conversation context.

Findings

01

Enhanced response informativeness demonstrated in experiments

02

Improved coherence of generated responses verified by human evaluation

03

Outperforms baseline models on automatic metrics

Abstract

Neural conversational models learn to generate responses by taking into account the dialog history. These models are typically optimized over the query-response pairs with a maximum likelihood estimation objective. However, the query-response tuples are naturally loosely coupled, and there exist multiple responses that can respond to a given query, which leads the conversational model learning burdensome. Besides, the general dull response problem is even worsened when the model is confronted with meaningless response training instances. Intuitively, a high-quality response not only responds to the given query but also links up to the future conversations, in this paper, we leverage the query-response-future turn triples to induce the generated responses that consider both the given context and the future conversations. To facilitate the modeling of these triples, we further propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Multimodal Machine Learning Applications