Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim; Wonjun Kang; Yuchen Zeng; Hyung Il Koo; Kangwook Lee

arXiv:2410.09016·cs.LG·June 10, 2025

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim, Wonjun Kang, Yuchen Zeng, Hyung Il Koo, Kangwook Lee

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates parameter-efficient fine-tuning methods for deep State Space Models, finds limitations of existing methods, and proposes a new tailored approach called Sparse Dimension Tuning (SDT) that improves performance.

Contribution

It introduces Sparse Dimension Tuning (SDT), a novel PEFT method specifically designed for SSM modules, enhancing fine-tuning efficiency and effectiveness.

Findings

01

LoRA outperforms other PEFT methods on SSMs

02

LoRA fails on SSM modules but still surpasses alternatives

03

SDT combined with LoRA achieves state-of-the-art results

Abstract

Deep State Space Models (SSMs), such as Mamba (Gu & Dao, 2024), have become powerful tools for language modeling, offering high performance and linear scalability with sequence length. However, the application of parameter-efficient fine-tuning (PEFT) methods to SSM-based models remains largely underexplored. We start by investigating two fundamental questions on existing PEFT methods: (i) How do they perform on SSM-based models? (ii) Which parameters should they target for optimal results? Our analysis shows that LoRA and its variants consistently outperform all other PEFT methods. While LoRA is effective for linear projection matrices, it fails on SSM modules-yet still outperforms other methods applicable to SSMs, indicating their limitations. This underscores the need for a specialized SSM tuning approach. To address this, we propose Sparse Dimension Tuning (SDT), a PEFT method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

furiosa-ai/ssm-peft
pytorchOfficial

Videos

Parameter-Efficient Fine-Tuning of State Space Models· slideslive

Taxonomy

TopicsNeural Networks and Applications · Control Systems and Identification · Fault Detection and Control Systems

MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces