On the Robustness of Intent Classification and Slot Labeling in   Goal-oriented Dialog Systems to Real-world Noise

Sailik Sengupta; Jason Krone; Saab Mansour

arXiv:2104.07149·cs.CL·November 3, 2021

On the Robustness of Intent Classification and Slot Labeling in Goal-oriented Dialog Systems to Real-world Noise

Sailik Sengupta, Jason Krone, Saab Mansour

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the robustness of intent classification and slot labeling models in goal-oriented dialogue systems against real-world noise, and proposes data augmentation techniques to improve their resilience across multiple noise types.

Contribution

It introduces a comprehensive noise test-suite and a novel data augmentation approach that enhances model robustness to diverse real-world noise in dialogue systems.

Findings

01

Noise significantly degrades model performance.

02

Data augmentation improves robustness by over 10% in accuracy.

03

First model to be robust across multiple noise types.

Abstract

Intent Classification (IC) and Slot Labeling (SL) models, which form the basis of dialogue systems, often encounter noisy data in real-word environments. In this work, we investigate how robust IC/SL models are to noisy data. We collect and publicly release a test-suite for seven common noise types found in production human-to-bot conversations (abbreviations, casing, misspellings, morphological variants, paraphrases, punctuation and synonyms). On this test-suite, we show that common noise types substantially degrade the IC accuracy and SL F1 performance of state-of-the-art BERT-based IC/SL models. By leveraging cross-noise robustness transfer -- training on one noise type to improve robustness on another noise type -- we design aggregate data-augmentation approaches that increase the model performance across all seven noise types by +10.8% for IC accuracy and +15 points for SL F1 on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-research/real-world-noisy-benchmarks-for-natural-language-understanding
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Weight Decay · WordPiece · Dropout