Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Jing Xu; Minglin Wu; Xueyuan Chen; Xixin Wu; Helen Meng

arXiv:2602.12746·cs.CL·February 16, 2026

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Jing Xu, Minglin Wu, Xueyuan Chen, Xixin Wu, Helen Meng

PDF

Open Access

TL;DR

Lamer-SSL is a parameter-efficient framework that enables continual multilingual expansion of self-supervised speech models by balancing shared and language-specific representations, while mitigating forgetting through replay strategies.

Contribution

It introduces a layer-aware mixture of LoRA experts combined with replay to effectively extend models to new languages without losing prior knowledge.

Findings

01

Effective multilingual expansion on ASR and LID tasks.

02

Maintains performance on previous languages with minimal parameter updates.

03

Only 2.14% of parameters are trained during adaptation.

Abstract

Despite their impressive performance, self-supervised speech models often struggle to generalize to new languages and tend to forget previously acquired knowledge during continual training. To address this, we propose Lamer-SSL, a parameter-efficient framework that integrates a Layer-Aware MixturE of LoRA Experts (Lamer) module with a replay strategy. The Lamer module enables flexible balancing between shared and language-specific representations, while layer-aware expert allocation assigns more experts to deeper layers where semantic information is richer. Meanwhile, the replay strategy retains prior knowledge using minimal data, mitigating forgetting during continual training. Experiments on automatic speech recognition (ASR) and language identification (LID) demonstrate that Lamer-SSL extends self-supervised models to new languages effectively while maintaining strong performance on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning · Topic Modeling