Competence-based Curriculum Learning for Multilingual Machine Translation
Mingliang Zhang, Fandong Meng, Yunhai Tong, Jie Zhou

TL;DR
This paper introduces CCL-M, a curriculum learning approach that balances language competencies in multilingual machine translation, significantly improving performance especially for low-resource languages.
Contribution
It proposes a novel competence-based curriculum learning method with dynamic sampling to address imbalance in multilingual translation models.
Findings
Achieves significant performance gains over state-of-the-art methods.
Effectively balances learning across high-resource and low-resource languages.
Improves translation quality on the TED talks dataset.
Abstract
Currently, multilingual machine translation is receiving more and more attention since it brings better performance for low resource languages (LRLs) and saves more space. However, existing multilingual machine translation models face a severe challenge: imbalance. As a result, the translation performance of different languages in multilingual translation models are quite different. We argue that this imbalance problem stems from the different learning competencies of different languages. Therefore, we focus on balancing the learning competencies of different languages and propose Competence-based Curriculum Learning for Multilingual Machine Translation, named CCL-M. Specifically, we firstly define two competencies to help schedule the high resource languages (HRLs) and the low resource languages: 1) Self-evaluated Competence, evaluating how well the language itself has been learned;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
