Where are we in semantic concept extraction for Spoken Language   Understanding?

Sahar Ghannay; Antoine Caubri\`ere; Salima Mdhaffar; Ga\"elle; Laperri\`ere; Bassam Jabaian; Yannick Est\`eve

arXiv:2106.13045·cs.CL·October 12, 2022

Where are we in semantic concept extraction for Spoken Language Understanding?

Sahar Ghannay, Antoine Caubri\`ere, Salima Mdhaffar, Ga\"elle, Laperri\`ere, Bassam Jabaian, Yannick Est\`eve

PDF

TL;DR

This paper reviews recent advances in semantic concept extraction for Spoken Language Understanding, highlighting the shift to end-to-end neural models and demonstrating significant performance improvements on the French MEDIA benchmark.

Contribution

It provides an overview of recent progress in SLU, especially in end-to-end neural approaches, and reports new results that outperform current state-of-the-art systems.

Findings

01

End-to-end neural models outperform cascade approaches in SLU.

02

Self-supervised training enhances speech and semantic extraction.

03

Achieved a Concept Error Rate of 11.2%, surpassing previous best.

Abstract

Spoken language understanding (SLU) topic has seen a lot of progress these last three years, with the emergence of end-to-end neural approaches. Spoken language understanding refers to natural language processing tasks related to semantic extraction from speech signal, like named entity recognition from speech or slot filling task in a context of human-machine dialogue. Classically, SLU tasks were processed through a cascade approach that consists in applying, firstly, an automatic speech recognition process, followed by a natural language processing module applied to the automatic transcriptions. These three last years, end-to-end neural approaches, based on deep neural networks, have been proposed in order to directly extract the semantics from speech signal, by using a single neural model. More recent works on self-supervised training with unlabeled data open new perspectives in term…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.