NATURE: Natural Auxiliary Text Utterances for Realistic Spoken Language   Evaluation

David Alfonso-Hermelo; Ahmad Rashid; Abbas Ghaddar; Philippe Langlais,; Mehdi Rezagholizadeh

arXiv:2111.05196·cs.CL·January 31, 2022

NATURE: Natural Auxiliary Text Utterances for Realistic Spoken Language Evaluation

David Alfonso-Hermelo, Ahmad Rashid, Abbas Ghaddar, Philippe Langlais,, Mehdi Rezagholizadeh

PDF

Open Access

TL;DR

This paper introduces NATURE, a set of transformations that simulate human spoken language variations to evaluate the robustness of slot-filling and intent detection models, revealing significant performance drops on standard benchmarks.

Contribution

NATURE provides a simple, effective method to test the generalization of spoken language understanding models under realistic spoken language variations.

Findings

01

Model accuracy drops by up to 40% with NATURE transformations.

02

Standard benchmarks may overestimate model robustness in real-world scenarios.

03

Simple perturbations reveal vulnerabilities in current spoken language models.

Abstract

Slot-filling and intent detection are the backbone of conversational agents such as voice assistants, and are active areas of research. Even though state-of-the-art techniques on publicly available benchmarks show impressive performance, their ability to generalize to realistic scenarios is yet to be demonstrated. In this work, we present NATURE, a set of simple spoken-language oriented transformations, applied to the evaluation set of datasets, to introduce human spoken language variations while preserving the semantics of an utterance. We apply NATURE to common slot-filling and intent detection benchmarks and demonstrate that simple perturbations from the standard evaluation set by NATURE can deteriorate model performance significantly. Through our experiments we demonstrate that when NATURE operators are applied to evaluation set of popular benchmarks the model accuracy can drop by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Speech Recognition and Synthesis