Effects of Language Relatedness for Cross-lingual Transfer Learning in Character-Based Language Models
Mittul Singh, Peter Smit, Sami Virpioja, Mikko Kurimo

TL;DR
This paper investigates how the relatedness of source languages affects cross-lingual transfer learning in character-based neural language models for low-resource speech recognition, showing related languages improve performance.
Contribution
It demonstrates that for character-based NNLMs, pretraining with related languages enhances ASR performance, unlike prior findings for multi-character models, especially in low-resource settings.
Findings
Related source languages improve transfer learning in character NNLMs.
Unrelated languages can deteriorate ASR performance.
Benefits are larger with less target data.
Abstract
Character-based Neural Network Language Models (NNLM) have the advantage of smaller vocabulary and thus faster training times in comparison to NNLMs based on multi-character units. However, in low-resource scenarios, both the character and multi-character NNLMs suffer from data sparsity. In such scenarios, cross-lingual transfer has improved multi-character NNLM performance by allowing information transfer from a source to the target language. In the same vein, we propose to use cross-lingual transfer for character NNLMs applied to low-resource Automatic Speech Recognition (ASR). However, applying cross-lingual transfer to character NNLMs is not as straightforward. We observe that relatedness of the source language plays an important role in cross-lingual pretraining of character NNLMs. We evaluate this aspect on ASR tasks for two target languages: Finnish (with English and Estonian as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling
