Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach
Salvador Carri\'on, Francisco Casacuberta

TL;DR
This paper introduces a low-rank adaptation framework for neural machine translation that enables efficient continual learning, real-time domain/style adjustments, and mitigates catastrophic forgetting with minimal parameter updates.
Contribution
It proposes a novel low-rank adaptation method with an interactive, user-controllable module and a gradient-based regularization strategy for continual NMT learning.
Findings
LoRA-based fine-tuning matches full-parameter performance with fewer parameters.
The interactive adaptation method allows real-time, user-controlled domain/style changes.
The gradient-based regularization effectively preserves prior knowledge during continual learning.
Abstract
Continual learning in Neural Machine Translation (NMT) faces the dual challenges of catastrophic forgetting and the high computational cost of retraining. This study establishes Low-Rank Adaptation (LoRA) as a parameter-efficient framework to address these challenges in dedicated NMT architectures. We first demonstrate that LoRA-based fine-tuning adapts NMT models to new languages and domains with performance on par with full-parameter techniques, while utilizing only a fraction of the parameter space. Second, we propose an interactive adaptation method using a calibrated linear combination of LoRA modules. This approach functions as a gate-free mixture of experts, enabling real-time, user-controllable adjustments to domain and style without retraining. Finally, to mitigate catastrophic forgetting, we introduce a novel gradient-based regularization strategy specifically designed for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Natural Language Processing Techniques · Topic Modeling
