End to End Dialogue Transformer

Ond\v{r}ej M\v{e}kota; Memduh G\"ok{\i}rmak; Petr Laitoch

arXiv:2008.10392·cs.CL·August 25, 2020

End to End Dialogue Transformer

Ond\v{r}ej M\v{e}kota, Memduh G\"ok{\i}rmak, Petr Laitoch

PDF

Open Access

TL;DR

This paper introduces an end-to-end dialogue system leveraging Transformer architecture, replacing RNN-based models like Sequicity, to improve conversational AI capabilities.

Contribution

It presents a novel Transformer-based dialogue system that operates in an end-to-end sequence-to-sequence manner, enhancing upon prior RNN-based approaches.

Findings

01

Transformer-based model achieves competitive dialogue generation.

02

End-to-end architecture simplifies the dialogue pipeline.

03

Potential improvements in response quality and coherence.

Abstract

Dialogue systems attempt to facilitate conversations between humans and computers, for purposes as diverse as small talk to booking a vacation. We are here inspired by the performance of the recurrent neural network-based model Sequicity, which when conducting a dialogue uses a sequence-to-sequence architecture to first produce a textual representation of what is going on in the dialogue, and in a further step use this along with database findings to produce a reply to the user. We here propose a dialogue system based on the Transformer architecture instead of Sequicity's RNN-based architecture, that works similarly in an end-to-end, sequence-to-sequence fashion.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Dropout · Label Smoothing · Multi-Head Attention · Residual Connection · Softmax