Robust Task-Oriented Dialogue Generation with Contrastive Pre-training and Adversarial Filtering
Shiquan Yang, Xinting Huang, Jey Han Lau, Sarah Erfani

TL;DR
This paper addresses data artifacts in task-oriented dialogue datasets by proposing contrastive pre-training and adversarial filtering, leading to models that generalize better across domains and adversarial settings.
Contribution
It introduces a contrastive learning framework combined with adversarial filtering to reduce reliance on spurious data artifacts in dialogue models.
Findings
Models trained with the proposed methods outperform baselines in cross-domain tests.
The approach significantly reduces model reliance on frequent phrase shortcuts.
Robustness to adversarial and dataset shifts is substantially improved.
Abstract
Data artifacts incentivize machine learning models to learn non-transferable generalizations by taking advantage of shortcuts in the data, and there is growing evidence that data artifacts play a role for the strong results that deep learning models achieve in recent natural language processing benchmarks. In this paper, we focus on task-oriented dialogue and investigate whether popular datasets such as MultiWOZ contain such data artifacts. We found that by only keeping frequent phrases in the training examples, state-of-the-art models perform similarly compared to the variant trained with full data, suggesting they exploit these spurious correlations to solve the task. Motivated by this, we propose a contrastive learning based framework to encourage the model to ignore these cues and focus on learning generalisable patterns. We also experiment with adversarial filtering to remove…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsContrastive Learning
