Better Conversations by Modeling,Filtering,and Optimizing for Coherence   and Diversity

Xinnuo Xu; Ond\v{r}ej Du\v{s}ek; Ioannis Konstas; Verena Rieser

arXiv:1809.06873·cs.CL·November 22, 2018·5 cites

Better Conversations by Modeling,Filtering,and Optimizing for Coherence and Diversity

Xinnuo Xu, Ond\v{r}ej Du\v{s}ek, Ioannis Konstas, Verena Rieser

PDF

Open Access 2 Repos

TL;DR

This paper enhances open-domain conversational models by introducing coherence measures, filtering training data for topical relevance, and employing a variational autoencoder to improve response coherence and diversity, resulting in significant performance gains.

Contribution

The paper proposes a novel approach combining coherence measurement, data filtering, and a variational autoencoder to improve dialogue response quality.

Findings

01

Improved BLEU scores over baseline models

02

Enhanced coherence and diversity metrics

03

Effective use of coherence as a latent variable

Abstract

We present three enhancements to existing encoder-decoder models for open-domain conversational agents, aimed at effectively modeling coherence and promoting output diversity: (1) We introduce a measure of coherence as the GloVe embedding similarity between the dialogue context and the generated response, (2) we filter our training corpora based on the measure of coherence to obtain topically coherent and lexically diverse context-response pairs, (3) we then train a response generator using a conditional variational autoencoder model that incorporates the measure of coherence as a latent variable and uses a context gate to guarantee topical consistency with the context and promote lexical diversity. Experiments on the OpenSubtitles corpus show a substantial improvement over competitive neural models in terms of BLEU score as well as metrics of coherence and diversity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems