Self-Training with Purpose Preserving Augmentation Improves Few-shot   Generative Dialogue State Tracking

Jihyun Lee; Chaebin Lee; Yunsu Kim; Gary Geunbae Lee

arXiv:2211.09379·cs.CL·November 18, 2022

Self-Training with Purpose Preserving Augmentation Improves Few-shot Generative Dialogue State Tracking

Jihyun Lee, Chaebin Lee, Yunsu Kim, Gary Geunbae Lee

PDF

Open Access

TL;DR

This paper introduces a self-training framework with Purpose Preserving Augmentation to improve few-shot generative dialogue state tracking, reducing labeling effort and enhancing performance on benchmark datasets.

Contribution

It presents a novel self-training approach combined with PPAug for few-shot DST, addressing overfitting and improving accuracy with limited labeled data.

Findings

01

Achieved 4% performance increase on MultiWOZ 2.1 with 10% labeled data

02

Enhanced slot-recall by 8.34% for unseen values

03

Demonstrated effectiveness of PPAug in preventing overfitting

Abstract

In dialogue state tracking (DST), labeling the dataset involves considerable human labor. We propose a new self-training framework for few-shot generative DST that utilize unlabeled data. Our self-training method iteratively improves the model by pseudo labeling and employs Purpose Preserving Augmentation (PPAug) to prevent overfitting. We increaese the few-shot 10% performance by approximately 4% on MultiWOZ 2.1 and enhances the slot-recall 8.34% for unseen values compared to baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Context-Aware Activity Recognition Systems

MethodsDynamic Sparse Training