SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding   Tasks

Suwon Shon; Siddhant Arora; Chyi-Jiunn Lin; Ankita Pasad; Felix Wu,; Roshan Sharma; Wei-Lun Wu; Hung-Yi Lee; Karen Livescu; Shinji Watanabe

arXiv:2212.10525·cs.CL·June 19, 2023

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

Suwon Shon, Siddhant Arora, Chyi-Jiunn Lin, Ankita Pasad, Felix Wu,, Roshan Sharma, Wei-Lun Wu, Hung-Yi Lee, Karen Livescu, Shinji Watanabe

PDF

Open Access 2 Datasets

TL;DR

This paper introduces a new suite of diverse spoken language understanding benchmark tasks based on freely available data, filling gaps in existing SLU evaluation resources and enabling better model development and comparison.

Contribution

It presents four novel SLU tasks, provides annotated datasets and baseline models, and analyzes the impact of speech recognition accuracy on SLU performance.

Findings

01

Baseline models demonstrate competitive performance on new tasks.

02

Speech recognition accuracy significantly affects SLU pipeline performance.

03

Annotated datasets and benchmarks are publicly available for research use.

Abstract

Spoken language understanding (SLU) tasks have been studied for many decades in the speech research community, but have not received as much attention as lower-level tasks like speech and speaker recognition. In particular, there are not nearly as many SLU task benchmarks, and many of the existing ones use data that is not freely available to all researchers. Recent work has begun to introduce such benchmark datasets for several tasks. In this work, we introduce several new annotated SLU benchmark tasks based on freely available speech data, which complement existing benchmarks and address gaps in the SLU evaluation landscape. We contribute four tasks: question answering and summarization involve inference over longer speech sequences; named entity localization addresses the speech-specific task of locating the targeted content in the signal; dialog act classification identifies the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsTest