Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System
Namo Bang, Jeehyun Lee, Myoung-Wan Koo

TL;DR
This paper introduces task-optimized adapters for end-to-end task-oriented dialogue systems, enabling lightweight, independent task learning, and improves dialogue state tracking and response generation with reinforcement learning, achieving state-of-the-art results.
Contribution
The paper proposes a model-agnostic adapter-based approach for task-specific modules in dialogue systems, reducing complexity and enhancing performance without prompt-tuning.
Findings
Achieves state-of-the-art dialogue state tracking accuracy on MultiWOZ.
Demonstrates competitive overall performance on the MultiWOZ benchmark.
Enhances response generation quality through reinforcement learning.
Abstract
Task-Oriented Dialogue (TOD) systems are designed to carry out specific tasks by tracking dialogue states and generating appropriate responses to help users achieve defined goals. Recently, end-to-end dialogue models pre-trained based on large datasets have shown promising performance in the conversational system. However, they share the same parameters to train tasks of the dialogue system (NLU, DST, NLG), so debugging each task is challenging. Also, they require a lot of effort to fine-tune large parameters to create a task-oriented chatbot, making it difficult for non-experts to handle. Therefore, we intend to train relatively lightweight and fast models compared to PLM. In this paper, we propose an End-to-end TOD system with Task-Optimized Adapters which learn independently per task, adding only small number of parameters after fixed layers of pre-trained network. We also enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Speech Recognition and Synthesis
MethodsDynamic Sparse Training · Adapter
