SemiAdapt and SemiLoRA: Efficient Domain Adaptation for Transformer-based Low-Resource Language Translation with a Case Study on Irish
Josh McGiff, Nikola S. Nikolov

TL;DR
This paper introduces SemiAdapt and SemiLoRA, innovative semi-supervised, parameter-efficient methods that significantly improve domain adaptation in low-resource neural machine translation, especially for Irish, while reducing computational costs.
Contribution
The paper presents SemiAdapt and SemiLoRA, novel semi-supervised approaches that enhance domain adaptation in NMT using PEFT, outperforming traditional fine-tuning methods in low-resource settings.
Findings
SemiAdapt outperforms full-domain fine-tuning.
SemiLoRA matches or exceeds full-model fine-tuning performance.
Embedding-based inference excels on larger, noisier datasets.
Abstract
Fine-tuning is widely used to tailor large language models for specific tasks such as neural machine translation (NMT). However, leveraging transfer learning is computationally expensive when fine-tuning large multilingual models with billions of parameters, thus creating a barrier to entry for researchers working on low-resource domains such as Irish translation. Parameter-efficient fine-tuning (PEFT) bridges this gap by training on a fraction of the original model parameters, with the Low-Rank Adaptation (LoRA) approach introducing small, trainable adapter layers. We introduce SemiAdapt and SemiLoRA as semi-supervised inference-efficient approaches that strengthen domain adaptation and lead to improved overall performance in NMT. We demonstrate that SemiAdapt can outperform full-domain fine-tuning, while most notably, SemiLoRA can propel PEFT methods to match or even outperform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Domain Adaptation and Few-Shot Learning
