Does language matter for spoken word classification? A multilingual generative meta-learning approach
Batsirayi Mupamhi Ziki, Louise Beyers, Ruan van der Merwe

TL;DR
This paper explores the effectiveness of a generative meta-learning approach for multilingual spoken word classification, highlighting data exposure over language diversity.
Contribution
It applies a generative meta-continual learning algorithm to multilingual spoken word classification, demonstrating its viability and analyzing factors influencing performance.
Findings
Multilingual models perform best overall.
Differences in model performance are surprisingly small.
Training data volume correlates more with performance than language diversity.
Abstract
Meta-learning has been shown to have better performance than supervised learning for few-shot monolingual spoken word classification. However, the meta-learning approach remains under-explored in multilingual spoken word classification. In this paper, we apply the Generative Meta-Continual Learning algorithm to spoken word classification. The generative nature of this algorithm makes it viable for use in application, and the meta-learning aspect promotes generalisation, which is crucial in a multilingual setting. We train monolingual models on English, German, French, and Catalan, a bilingual model on English and German, and a multilingual model on all four languages. We find that although the multilingual model performs best, the differences between model performance is unexpectedly low. We also find that the hours of unique data seen during training seems to be a stronger performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
