Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System

Namo Bang; Jeehyun Lee; Myoung-Wan Koo

arXiv:2305.02468·cs.CL·June 1, 2023·2 cites

Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System

Namo Bang, Jeehyun Lee, Myoung-Wan Koo

PDF

Open Access 2 Repos

TL;DR

This paper introduces task-optimized adapters for end-to-end task-oriented dialogue systems, enabling lightweight, independent task learning, and improves dialogue state tracking and response generation with reinforcement learning, achieving state-of-the-art results.

Contribution

The paper proposes a model-agnostic adapter-based approach for task-specific modules in dialogue systems, reducing complexity and enhancing performance without prompt-tuning.

Findings

01

Achieves state-of-the-art dialogue state tracking accuracy on MultiWOZ.

02

Demonstrates competitive overall performance on the MultiWOZ benchmark.

03

Enhances response generation quality through reinforcement learning.

Abstract

Task-Oriented Dialogue (TOD) systems are designed to carry out specific tasks by tracking dialogue states and generating appropriate responses to help users achieve defined goals. Recently, end-to-end dialogue models pre-trained based on large datasets have shown promising performance in the conversational system. However, they share the same parameters to train tasks of the dialogue system (NLU, DST, NLG), so debugging each task is challenging. Also, they require a lot of effort to fine-tune large parameters to create a task-oriented chatbot, making it difficult for non-experts to handle. Therefore, we intend to train relatively lightweight and fast models compared to PLM. In this paper, we propose an End-to-end TOD system with Task-Optimized Adapters which learn independently per task, adding only small number of parameters after fixed layers of pre-trained network. We also enhance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Speech Recognition and Synthesis

MethodsDynamic Sparse Training · Adapter