Many Hands Make Light Work: Task-Oriented Dialogue System with   Module-Based Mixture-of-Experts

Ruolin Su; Biing-Hwang Juang

arXiv:2405.09744·cs.CL·May 17, 2024

Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts

Ruolin Su, Biing-Hwang Juang

PDF

Open Access

TL;DR

This paper introduces SMETOD, a module-based mixture-of-experts approach for task-oriented dialogue systems that improves performance and efficiency by specializing subcomponents, outperforming existing models on key benchmarks.

Contribution

The paper proposes a novel Soft Mixture-of-Experts framework for dialogue systems, enhancing scalability, flexibility, and inference efficiency while achieving state-of-the-art results.

Findings

01

SMETOD outperforms baseline models on intent prediction, dialogue state tracking, and response generation.

02

SMETOD maintains high inference efficiency with reduced computational costs.

03

Experimental results demonstrate superior accuracy and problem-solving ability.

Abstract

Task-oriented dialogue systems are broadly used in virtual assistants and other automated services, providing interfaces between users and machines to facilitate specific tasks. Nowadays, task-oriented dialogue systems have greatly benefited from pre-trained language models (PLMs). However, their task-solving performance is constrained by the inherent capacities of PLMs, and scaling these models is expensive and complex as the model size becomes larger. To address these challenges, we propose Soft Mixture-of-Expert Task-Oriented Dialogue system (SMETOD) which leverages an ensemble of Mixture-of-Experts (MoEs) to excel at subproblems and generate specialized outputs for task-oriented dialogues. SMETOD also scales up a task-oriented dialogue system with simplicity and flexibility while maintaining inference efficiency. We extensively evaluate our model on three benchmark functionalities:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Multi-Agent Systems and Negotiation · Context-Aware Activity Recognition Systems