Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt   Tuning

Mingyu Derek Ma; Jiun-Yu Kao; Shuyang Gao; Arpit Gupta; Di Jin,; Tagyoung Chung; Nanyun Peng

arXiv:2301.10915·cs.CL·May 31, 2023·1 cites

Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning

Mingyu Derek Ma, Jiun-Yu Kao, Shuyang Gao, Arpit Gupta, Di Jin,, Tagyoung Chung, Nanyun Peng

PDF

Open Access

TL;DR

This paper introduces a prompt tuning approach for dialogue state tracking that significantly reduces parameter requirements and improves performance in low-resource settings by using soft prompt token embeddings without fine-tuning the entire language model.

Contribution

It proposes a novel soft prompt tuning method for DST that minimizes parameter tuning and enhances low-resource domain adaptation.

Findings

01

Reduces parameters to less than 0.5% of traditional fine-tuning methods.

02

Achieves better performance in low-resource DST scenarios.

03

Requires no tuning of the entire language model.

Abstract

Dialogue state tracking (DST) is an important step in dialogue management to keep track of users' beliefs. Existing works fine-tune all language model (LM) parameters to tackle the DST task, which requires significant data and computing resources for training and hosting. The cost grows exponentially in the real-world deployment where dozens of fine-tuned LM are used for different domains and tasks. To reduce parameter size and better utilize cross-task shared information, we propose to use soft prompt token embeddings to learn task properties. Without tuning LM parameters, our method drastically reduces the number of parameters needed to less than 0.5% of prior works while achieves better low-resource DST performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Context-Aware Activity Recognition Systems

MethodsDynamic Sparse Training