Multilingual and Cross-Lingual Intent Detection from Spoken Data

Daniela Gerz; Pei-Hao Su; Razvan Kusztos; Avishek Mondal; Micha{\l}; Lis; Eshan Singhal; Nikola Mrk\v{s}i\'c; Tsung-Hsien Wen; Ivan Vuli\'c

arXiv:2104.08524·cs.CL·April 20, 2021

Multilingual and Cross-Lingual Intent Detection from Spoken Data

Daniela Gerz, Pei-Hao Su, Razvan Kusztos, Avishek Mondal, Micha{\l}, Lis, Eshan Singhal, Nikola Mrk\v{s}i\'c, Tsung-Hsien Wen, Ivan Vuli\'c

PDF

4 Datasets

TL;DR

This paper introduces MInDS-14, a new multilingual spoken intent detection dataset, and demonstrates that combining machine translation with multilingual encoders improves intent detection across 14 languages, advancing inclusive multilingual spoken language understanding.

Contribution

The paper presents MInDS-14, the first resource for spoken intent detection across 14 languages, and analyzes the effectiveness of translation and multilingual encoders for this task.

Findings

01

Combining machine translation with LaBSE improves intent detection accuracy.

02

Zero-shot and few-shot learning approaches are effective across languages.

03

Speech recognition impacts intent detection performance.

Abstract

We present a systematic study on multilingual and cross-lingual intent detection from spoken data. The study leverages a new resource put forth in this work, termed MInDS-14, a first training and evaluation resource for the intent detection task with spoken data. It covers 14 intents extracted from a commercial system in the e-banking domain, associated with spoken examples in 14 diverse language varieties. Our key results indicate that combining machine translation models with state-of-the-art multilingual sentence encoders (e.g., LaBSE) can yield strong intent detectors in the majority of target languages covered in MInDS-14, and offer comparative analyses across different axes: e.g., zero-shot versus few-shot learning, translation direction, and impact of speech recognition. We see this work as an important step towards more inclusive development and evaluation of multilingual intent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.