Robust Task-Oriented Dialogue Generation with Contrastive Pre-training   and Adversarial Filtering

Shiquan Yang; Xinting Huang; Jey Han Lau; Sarah Erfani

arXiv:2205.10363·cs.CL·May 24, 2022·1 cites

Robust Task-Oriented Dialogue Generation with Contrastive Pre-training and Adversarial Filtering

Shiquan Yang, Xinting Huang, Jey Han Lau, Sarah Erfani

PDF

Open Access

TL;DR

This paper addresses data artifacts in task-oriented dialogue datasets by proposing contrastive pre-training and adversarial filtering, leading to models that generalize better across domains and adversarial settings.

Contribution

It introduces a contrastive learning framework combined with adversarial filtering to reduce reliance on spurious data artifacts in dialogue models.

Findings

01

Models trained with the proposed methods outperform baselines in cross-domain tests.

02

The approach significantly reduces model reliance on frequent phrase shortcuts.

03

Robustness to adversarial and dataset shifts is substantially improved.

Abstract

Data artifacts incentivize machine learning models to learn non-transferable generalizations by taking advantage of shortcuts in the data, and there is growing evidence that data artifacts play a role for the strong results that deep learning models achieve in recent natural language processing benchmarks. In this paper, we focus on task-oriented dialogue and investigate whether popular datasets such as MultiWOZ contain such data artifacts. We found that by only keeping frequent phrases in the training examples, state-of-the-art models perform similarly compared to the variant trained with full data, suggesting they exploit these spurious correlations to solve the task. Motivated by this, we propose a contrastive learning based framework to encourage the model to ignore these cues and focus on learning generalisable patterns. We also experiment with adversarial filtering to remove…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsContrastive Learning