Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach

Salvador Carri\'on; Francisco Casacuberta

arXiv:2512.09910·cs.CL·December 11, 2025

Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach

Salvador Carri\'on, Francisco Casacuberta

PDF

Open Access

TL;DR

This paper introduces a low-rank adaptation framework for neural machine translation that enables efficient continual learning, real-time domain/style adjustments, and mitigates catastrophic forgetting with minimal parameter updates.

Contribution

It proposes a novel low-rank adaptation method with an interactive, user-controllable module and a gradient-based regularization strategy for continual NMT learning.

Findings

01

LoRA-based fine-tuning matches full-parameter performance with fewer parameters.

02

The interactive adaptation method allows real-time, user-controlled domain/style changes.

03

The gradient-based regularization effectively preserves prior knowledge during continual learning.

Abstract

Continual learning in Neural Machine Translation (NMT) faces the dual challenges of catastrophic forgetting and the high computational cost of retraining. This study establishes Low-Rank Adaptation (LoRA) as a parameter-efficient framework to address these challenges in dedicated NMT architectures. We first demonstrate that LoRA-based fine-tuning adapts NMT models to new languages and domains with performance on par with full-parameter techniques, while utilizing only a fraction of the parameter space. Second, we propose an interactive adaptation method using a calibrated linear combination of LoRA modules. This approach functions as a gate-free mixture of experts, enabling real-time, user-controllable adjustments to domain and style without retraining. Finally, to mitigate catastrophic forgetting, we introduce a novel gradient-based regularization strategy specifically designed for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Natural Language Processing Techniques · Topic Modeling