Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue   Models

Tong Niu; Mohit Bansal

arXiv:1809.02079·cs.CL·September 7, 2018

Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models

Tong Niu, Mohit Bansal

PDF

Open Access 1 Repo

TL;DR

This paper introduces adversarial strategies to identify and improve the robustness of dialogue models against subtle semantic changes, leading to enhanced performance and state-of-the-art results through adversarial training and subword modeling.

Contribution

It proposes novel adversarial strategies for evaluating and training dialogue models, significantly improving their robustness and performance, and introduces subword-based models for efficiency and resilience.

Findings

01

Adversarial training improves robustness and performance.

02

Combined strategies achieve state-of-the-art results.

03

Subword models are efficient and robust without adversarial training.

Abstract

We present two categories of model-agnostic adversarial strategies that reveal the weaknesses of several generative, task-oriented dialogue models: Should-Not-Change strategies that evaluate over-sensitivity to small and semantics-preserving edits, as well as Should-Change strategies that test if a model is over-stable against subtle yet semantics-changing modifications. We next perform adversarial training with each strategy, employing a max-margin approach for negative generative examples. This not only makes the target dialogue model more robust to the adversarial inputs, but also helps it perform significantly better on the original inputs. Moreover, training on all strategies combined achieves further improvements, achieving a new state-of-the-art performance on the original task (also verified via human evaluation). In addition to adversarial training, we also address the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

WolfNiu/AdversarialDialogue
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications