Towards Efficiently Diversifying Dialogue Generation via Embedding   Augmentation

Yu Cao; Liang Ding; Zhiliang Tian; Meng Fang

arXiv:2103.01534·cs.CL·March 3, 2021

Towards Efficiently Diversifying Dialogue Generation via Embedding Augmentation

Yu Cao, Liang Ding, Zhiliang Tian, Meng Fang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel embedding augmentation method with soft labels to enhance diversity in dialogue generation models, achieving more varied responses without sacrificing response quality.

Contribution

The paper proposes a new soft embedding augmentation technique combined with soft labels to improve diversity in neural dialogue generation models.

Findings

01

Generated responses are more diverse than baseline models.

02

The method maintains similar n-gram accuracy, ensuring response quality.

03

Experimental results on two datasets validate the effectiveness of the approach.

Abstract

Dialogue generation models face the challenge of producing generic and repetitive responses. Unlike previous augmentation methods that mostly focus on token manipulation and ignore the essential variety within a single sample using hard labels, we propose to promote the generation diversity of the neural dialogue models via soft embedding augmentation along with soft labels in this paper. Particularly, we select some key input tokens and fuse their embeddings together with embeddings from their semantic-neighbor tokens. The new embeddings serve as the input of the model to replace the original one. Besides, soft labels are used in loss calculation, resulting in multi-target supervision for a given input. Our experimental results on two datasets illustrate that our proposed method is capable of generating more diverse responses than raw models while remains a similar n-gram accuracy that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

caoyu-noob/embedding_augmentation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques