SLURP: A Spoken Language Understanding Resource Package
Emanuele Bastianelli, Andrea Vanzo, Pawel Swietojanski, Verena Rieser

TL;DR
SLURP is a comprehensive resource package for spoken language understanding, including a large diverse dataset, baseline models, and a new evaluation metric to advance research in audio-based semantic understanding.
Contribution
It introduces a new challenging English SLU dataset, competitive baselines, and a transparent metric for detailed error analysis, addressing limitations of existing resources.
Findings
The dataset covers 18 diverse domains, surpassing existing datasets in size and diversity.
Baseline models demonstrate competitive performance on the new dataset.
The new metric enables detailed error analysis for entity labelling improvements.
Abstract
Spoken Language Understanding infers semantic meaning directly from audio data, and thus promises to reduce error propagation and misunderstandings in end-user applications. However, publicly available SLU resources are limited. In this paper, we release SLURP, a new SLU package containing the following: (1) A new challenging dataset in English spanning 18 domains, which is substantially bigger and linguistically more diverse than existing datasets; (2) Competitive baselines based on state-of-the-art NLU and ASR systems; (3) A new transparent metric for entity labelling which enables a detailed error analysis for identifying potential areas of improvement. SLURP is available at https: //github.com/pswietojanski/slurp.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗speechbrain/SLU-direct-SLURP-hubert-encmodel· 20 dl· ♡ 420 dl♡ 4
- 🤗nvidia/slu_conformer_transformer_large_slurpmodel· 13 dl· ♡ 213 dl♡ 2
- 🤗alkiskoudounas/hubert-base-slurpmodel· 2 dl2 dl
- 🤗alkiskoudounas/wav2vec2-base-slurpmodel· 5 dl5 dl
- 🤗alkiskoudounas/wav2vec2-large-slurpmodel· 2 dl2 dl
- 🤗alkiskoudounas/hubert-large-slurpmodel· 2 dl2 dl
- 🤗alkiskoudounas/wavlm-base-plus-slurpmodel· 2 dl2 dl
- 🤗alkiskoudounas/hubert-base-unslurpmodel· 2 dl2 dl
- 🤗alkiskoudounas/wav2vec2-base-unslurpmodel· 2 dl2 dl
- 🤗alkiskoudounas/wav2vec2-base-unslurp-goldmodel· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Music and Audio Processing · Speech Recognition and Synthesis
