Similarity Based Label Smoothing For Dialogue Generation

Sougata Saha; Souvik Das; Rohini Srihari

arXiv:2107.11481·cs.CL·July 27, 2021·1 cites

Similarity Based Label Smoothing For Dialogue Generation

Sougata Saha, Souvik Das, Rohini Srihari

PDF

Open Access

TL;DR

This paper proposes a data-dependent, similarity-based label smoothing technique for dialogue generation, replacing the uniform distribution of incorrect targets with a semantic similarity-informed distribution, leading to improved performance.

Contribution

It introduces a novel similarity-based weighting method for label smoothing in dialogue systems, enhancing training by incorporating semantic relations among words.

Findings

01

Significant performance improvements over standard label smoothing.

02

Effective incorporation of semantic similarity into label smoothing.

03

Validated on two open domain dialogue datasets.

Abstract

Generative neural conversational systems are generally trained with the objective of minimizing the entropy loss between the training "hard" targets and the predicted logits. Often, performance gains and improved generalization can be achieved by using regularization techniques like label smoothing, which converts the training "hard" targets to "soft" targets. However, label smoothing enforces a data independent uniform distribution on the incorrect training targets, which leads to an incorrect assumption of equi-probable incorrect targets for each correct target. In this paper we propose and experiment with incorporating data dependent word similarity based weighing methods to transforms the uniform distribution of the incorrect target probabilities in label smoothing, to a more natural distribution based on semantics. We introduce hyperparameters to control the incorrect target…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsLabel Smoothing