# Improving Neural Response Diversity with Frequency-Aware Cross-Entropy   Loss

**Authors:** Shaojie Jiang, Pengjie Ren, Christof Monz, Maarten de Rijke

arXiv: 1902.09191 · 2019-02-26

## TL;DR

This paper introduces the Frequency-Aware Cross-Entropy (FACE) loss to enhance response diversity in Seq2Seq dialogue models by addressing the over-confidence on high-frequency tokens, leading to more varied and engaging responses.

## Contribution

The paper proposes a novel FACE loss function that adjusts token weighting based on frequency, improving diversity over standard CE loss in dialogue response generation.

## Key findings

- FACE significantly increases response diversity
- Improves both automatic and human evaluation metrics
- Outperforms existing methods on benchmark datasets

## Abstract

Sequence-to-Sequence (Seq2Seq) models have achieved encouraging performance on the dialogue response generation task. However, existing Seq2Seq-based response generation methods suffer from a low-diversity problem: they frequently generate generic responses, which make the conversation less interesting. In this paper, we address the low-diversity problem by investigating its connection with model over-confidence reflected in predicted distributions. Specifically, we first analyze the influence of the commonly used Cross-Entropy (CE) loss function, and find that the CE loss function prefers high-frequency tokens, which results in low-diversity responses. We then propose a Frequency-Aware Cross-Entropy (FACE) loss function that improves over the CE loss function by incorporating a weighting mechanism conditioned on token frequency. Extensive experiments on benchmark datasets show that the FACE loss function is able to substantially improve the diversity of existing state-of-the-art Seq2Seq response generation methods, in terms of both automatic and human evaluations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.09191/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1902.09191/full.md

## References

20 references — full list in the complete paper: https://tomesphere.com/paper/1902.09191/full.md

---
Source: https://tomesphere.com/paper/1902.09191