Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation
Zhong Zhou, Matthias Sperber, Alex Waibel

TL;DR
This paper presents a scalable cross-lingual translation system that improves low-resource language translation by leveraging multiple European language families, addressing data scarcity, transfer methods, and variable-binding issues.
Contribution
It introduces a novel multi-family transfer approach with family labels, an ablation study on data requirements, and an order-preserving model for variable-binding in low-resource translation.
Findings
+9.9 BLEU score improvement for English-Swedish translation
Training on two neighboring language families suffices
Achieved 60.6% accuracy in variable-binding task
Abstract
We work on translation from rich-resource languages to low-resource languages. The main challenges we identify are the lack of low-resource language data, effective methods for cross-lingual transfer, and the variable-binding problem that is common in neural systems. We build a translation system that addresses these challenges using eight European language families as our test ground. Firstly, we add the source and the target family labels and study intra-family and inter-family influences for effective cross-lingual transfer. We achieve an improvement of +9.9 in BLEU score for English-Swedish translation using eight families compared to the single-family multi-source multi-target baseline. Moreover, we find that training on two neighboring families closest to the low-resource language is often enough. Secondly, we construct an ablation study and find that reasonably good results can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
