Timers and Such: A Practical Benchmark for Spoken Language Understanding   with Numbers

Loren Lugosch; Piyush Papreja; Mirco Ravanelli; Abdelwahab Heba,; Titouan Parcollet

arXiv:2104.01604·cs.CL·October 4, 2021

Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Loren Lugosch, Piyush Papreja, Mirco Ravanelli, Abdelwahab Heba,, Titouan Parcollet

PDF

Open Access 2 Repos 1 Models

TL;DR

This paper presents Timers and Such, a new open source dataset of spoken English commands involving numbers, designed to improve spoken language understanding for voice control applications, along with baseline model experiments.

Contribution

The paper introduces a novel dataset, Timers and Such, filling a gap in spoken language understanding resources for number-related commands, and provides baseline model evaluations.

Findings

01

Baseline models achieve moderate accuracy on the dataset.

02

The dataset enables targeted evaluation of number understanding in speech models.

03

Open source code facilitates further research and development.

Abstract

This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the design and creation of the dataset, and experiments with a number of ASR-based and end-to-end baseline models, the code for which has been made available as part of the SpeechBrain toolkit.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
speechbrain/slu-timers-and-such-direct-librispeech-asr
model· 63 dl· ♡ 1
63 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Natural Language Processing Techniques