Zipper-LoRA: Dynamic Parameter Decoupling for Speech-LLM based Multilingual Speech Recognition
Yuxiang Mei, Delai Qiu, Shengping Liu, Jiaen Liang, Yanhua Long

TL;DR
Zipper-LoRA introduces a dynamic, fine-grained parameter decoupling method for multilingual speech recognition, effectively balancing shared knowledge and language-specific adaptation to improve low-resource performance.
Contribution
It proposes Zipper-LoRA, a novel rank-level decoupling framework with dynamic control, and a two-stage training strategy to enhance multilingual speech recognition under data imbalance.
Findings
Outperforms fully shared and independent baselines in low-resource settings
Robust across chunked and non-chunked encoder configurations
Accelerates convergence with Initial-B warm start
Abstract
Speech Large Language Models (Speech-LLMs) have emerged as a powerful approach for automatic speech recognition (ASR) by aligning speech encoders with large language models. However, adapting these systems to multilingual settings with imbalanced data distributions remains challenging. In such scenarios, a stability-plasticity dilemma often arises: fully shared Parameter-Efficient Fine-Tuning (PEFT) can cause negative inter-lingual interference for under-represented languages, while fully language-specific tuning limits the cross-lingual beneficial knowledge transfer needed for low-resource tasks. To address this, we propose Zipper-LoRA, a novel rank-level decoupling framework with three variants (Static, Hard, and Soft) that dynamically synthesizes LoRA updates from shared and language-specific subspaces. By using a lightweight language-conditioned router, Zipper-LoRA dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Natural Language Processing Techniques
