Multilingual Transfer Learning for Code-Switched Language and Speech Neural Modeling
Genta Indra Winata

TL;DR
This paper proposes novel multilingual transfer learning methods, including meta-learning, meta-embeddings, multi-task syntactic learning, and data augmentation, to improve code-switching language and speech modeling without relying on linguistic theory.
Contribution
It introduces a suite of innovative transfer learning techniques that effectively model code-switching, outperforming traditional linguistic theory-based approaches.
Findings
Meta-transfer learning adapts quickly from monolingual data to code-switching.
Multilingual meta-embeddings efficiently represent code-switching data across languages.
The proposed model outperforms linguistic theory-based models in code-switching tasks.
Abstract
In this thesis, we address the data scarcity and limitations of linguistic theory by proposing language-agnostic multi-task training methods. First, we introduce a meta-learning-based approach, meta-transfer learning, in which information is judiciously extracted from high-resource monolingual speech data to the code-switching domain. The meta-transfer learning quickly adapts the model to the code-switching task from a number of monolingual tasks by learning to learn in a multi-task learning fashion. Second, we propose a novel multilingual meta-embeddings approach to effectively represent code-switching data by acquiring useful knowledge learned in other languages, learning the commonalities of closely related languages and leveraging lexical composition. The method is far more efficient compared to contextualized pre-trained multilingual models. Third, we introduce multi-task learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling
