Does language matter for spoken word classification? A multilingual generative meta-learning approach

Batsirayi Mupamhi Ziki; Louise Beyers; Ruan van der Merwe

arXiv:2605.13084·cs.CL·May 15, 2026

Does language matter for spoken word classification? A multilingual generative meta-learning approach

Batsirayi Mupamhi Ziki, Louise Beyers, Ruan van der Merwe

PDF

TL;DR

This paper explores the effectiveness of a generative meta-learning approach for multilingual spoken word classification, highlighting data exposure over language diversity.

Contribution

It applies a generative meta-continual learning algorithm to multilingual spoken word classification, demonstrating its viability and analyzing factors influencing performance.

Findings

01

Multilingual models perform best overall.

02

Differences in model performance are surprisingly small.

03

Training data volume correlates more with performance than language diversity.

Abstract

Meta-learning has been shown to have better performance than supervised learning for few-shot monolingual spoken word classification. However, the meta-learning approach remains under-explored in multilingual spoken word classification. In this paper, we apply the Generative Meta-Continual Learning algorithm to spoken word classification. The generative nature of this algorithm makes it viable for use in application, and the meta-learning aspect promotes generalisation, which is crucial in a multilingual setting. We train monolingual models on English, German, French, and Catalan, a bilingual model on English and German, and a multilingual model on all four languages. We find that although the multilingual model performs best, the differences between model performance is unexpectedly low. We also find that the hours of unique data seen during training seems to be a stronger performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.