A Comparative Study on Language Models for Task-Oriented Dialogue   Systems

Vinsen Marselino Andreas; Genta Indra Winata; Ayu Purwarianti

arXiv:2201.08687·cs.CL·January 24, 2022

A Comparative Study on Language Models for Task-Oriented Dialogue Systems

Vinsen Marselino Andreas, Genta Indra Winata, Ayu Purwarianti

PDF

Open Access 1 Repo

TL;DR

This paper compares recent pretrained language models like BART and T5 for end-to-end task-oriented dialogue systems, demonstrating their superior performance and ability to generate accurate, fluent responses without dialogue state tracking.

Contribution

It provides a comprehensive comparison of pretrained models for ToD, highlighting their effectiveness and establishing new state-of-the-art results.

Findings

01

BART and T5 outperform GPT-based models in BLEU and F1 scores

02

Fine-tuning improves response fluency and accuracy

03

Models effectively incorporate knowledge to reduce hallucinations

Abstract

The recent development of language models has shown promising results by achieving state-of-the-art performance on various natural language tasks by fine-tuning pretrained models. In task-oriented dialogue (ToD) systems, language models can be used for end-to-end training without relying on dialogue state tracking to track the dialogue history but allowing the language models to generate responses according to the context given as input. This paper conducts a comparative study to show the effectiveness and strength of using recent pretrained models for fine-tuning, such as BART and T5, on endto-end ToD systems. The experimental results show substantial performance improvements after language model fine-tuning. The models produce more fluent responses after adding knowledge to the context that guides the model to avoid hallucination and generate accurate entities in the generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sen33/end-to-end-dialogue-system
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Intelligent Tutoring Systems and Adaptive Learning

MethodsAttention Is All You Need · Linear Layer · Inverse Square Root Schedule · Residual Connection · Dropout · SentencePiece · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Attention Dropout