Out-of-distribution generalisation in spoken language understanding

Dejan Porjazovski; Anssi Moisio; Mikko Kurimo

arXiv:2407.07425·cs.CL·September 30, 2024

Out-of-distribution generalisation in spoken language understanding

Dejan Porjazovski, Anssi Moisio, Mikko Kurimo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new dataset split for evaluating out-of-distribution generalisation in spoken language understanding, revealing current models' limited ability to generalise and highlighting the need for novel techniques.

Contribution

It presents SLURPFOOD, a modified SLU dataset with OOD splits, and analyses the generalisation challenges faced by models using interpretability methods.

Findings

01

End-to-end SLU models show limited OOD generalisation.

02

Model interpretability reveals factors affecting generalisation.

03

Two techniques improve some OOD split performances.

Abstract

Test data is said to be out-of-distribution (OOD) when it unexpectedly differs from the training data, a common challenge in real-world use cases of machine learning. Although OOD generalisation has gained interest in recent years, few works have focused on OOD generalisation in spoken language understanding (SLU) tasks. To facilitate research on this topic, we introduce a modified version of the popular SLU dataset SLURP, featuring data splits for testing OOD generalisation in the SLU task. We call our modified dataset SLURP For OOD generalisation, or SLURPFOOD. Utilising our OOD data splits, we find end-to-end SLU models to have limited capacity for generalisation. Furthermore, by employing model interpretability techniques, we shed light on the factors contributing to the generalisation difficulties of the models. To improve the generalisation, we experiment with two techniques,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aalto-speech/slurpfood
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems