Self-Attentional Models Application in Task-Oriented Dialogue Generation   Systems

Mansour Saffar Mehrjardi; Amine Trabelsi; Osmar R. Zaiane

arXiv:1909.05246·cs.CL·September 13, 2019

Self-Attentional Models Application in Task-Oriented Dialogue Generation Systems

Mansour Saffar Mehrjardi, Amine Trabelsi, Osmar R. Zaiane

PDF

Open Access

TL;DR

This paper explores the use of self-attentional models for training end-to-end task-oriented dialogue systems, demonstrating improved performance and efficiency over traditional recurrence-based models.

Contribution

It is the first to apply self-attentional models to task-oriented dialogue generation, showing their advantages in accuracy and efficiency.

Findings

01

Self-attentional models outperform recurrence-based models in dialogue tasks.

02

Self-attentional models achieve higher evaluation scores.

03

Models are trained efficiently on multiple datasets.

Abstract

Self-attentional models are a new paradigm for sequence modelling tasks which differ from common sequence modelling methods, such as recurrence-based and convolution-based sequence learning, in the way that their architecture is only based on the attention mechanism. Self-attentional models have been used in the creation of the state-of-the-art models in many NLP tasks such as neural machine translation, but their usage has not been explored for the task of training end-to-end task-oriented dialogue generation systems yet. In this study, we apply these models on the three different datasets for training task-oriented chatbots. Our finding shows that self-attentional models can be exploited to create end-to-end task-oriented chatbots which not only achieve higher evaluation scores compared to recurrence-based models, but also do so more efficiently.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems