Bayesian Low-Rank Factorization for Robust Model Adaptation

Enes Yavuz Ugan; Ngoc-Quan Pham; Alexander Waibel

arXiv:2510.18723·cs.CL·October 22, 2025

Bayesian Low-Rank Factorization for Robust Model Adaptation

Enes Yavuz Ugan, Ngoc-Quan Pham, Alexander Waibel

PDF

Open Access

TL;DR

This paper introduces Bayesian factorized adapters for speech models, enabling effective domain adaptation like code-switching with minimal loss of general performance and reduced catastrophic forgetting.

Contribution

It proposes a Bayesian approach to low-rank adaptation matrices that improves domain adaptation in speech models while preserving their broad capabilities.

Findings

01

Achieves 54% backward gain over LoRA

02

Only 4% performance drop on new domain

03

Minimal adaptation loss and reduced forgetting

Abstract

Large speech foundation models achieve strong performance across many domains, but they often require adaptation to handle local needs such as code-switching, where speakers mix languages within the same utterance. Direct fine-tuning of these models risks overfitting to the target domain and overwriting the broad capabilities of the base model. To address this challenge, we explore Bayesian factorized adapters for speech foundation models, which place priors near zero to achieve sparser adaptation matrices and thereby retain general performance while adapting to specific domains. We apply our approach to the Whisper model and evaluate on different multilingual code-switching scenarios. Our results show only minimal adaptation loss while significantly reducing catastrophic forgetting of the base model. Compared to LoRA, our method achieves a backward gain of 54% with only a 4% drop on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques