Leveraging Auxiliary Domain Parallel Data in Intermediate Task   Fine-tuning for Low-resource Translation

Shravan Nayak; Surangika Ranathunga; Sarubi Thillainathan; Rikki Hung,; Anthony Rinaldi; Yining Wang; Jonah Mackey; Andrew Ho; En-Shiun Annie Lee

arXiv:2306.01382·cs.CL·September 26, 2023·1 cites

Leveraging Auxiliary Domain Parallel Data in Intermediate Task Fine-tuning for Low-resource Translation

Shravan Nayak, Surangika Ranathunga, Sarubi Thillainathan, Rikki Hung,, Anthony Rinaldi, Yining Wang, Jonah Mackey, Andrew Ho, En-Shiun Annie Lee

PDF

Open Access 1 Repo

TL;DR

This paper explores how intermediate-task fine-tuning of pre-trained multilingual models can improve low-resource, domain-specific neural machine translation, especially for under-represented languages and limited data scenarios.

Contribution

It demonstrates that intermediate-task fine-tuning enhances domain adaptation in low-resource NMT and mitigates domain divergence effects for under-represented languages.

Findings

01

ITFT improves translation quality in low-resource settings

02

ITFT reduces domain divergence impact

03

Under-represented languages benefit from auxiliary data

Abstract

NMT systems trained on Pre-trained Multilingual Sequence-Sequence (PMSS) models flounder when sufficient amounts of parallel data is not available for fine-tuning. This specifically holds for languages missing/under-represented in these models. The problem gets aggravated when the data comes from different domains. In this paper, we show that intermediate-task fine-tuning (ITFT) of PMSS models is extremely beneficial for domain-specific NMT, especially when target domain data is limited/unavailable and the considered languages are missing or under-represented in the PMSS model. We quantify the domain-specific results variations using a domain-divergence test, and show that ITFT can mitigate the impact of domain divergence to some extent.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LRLNMT/BibleCorpus
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification