Bilinear Input Modulation for Mamba: Koopman Bilinear Forms for Memory Retention and Multiplicative Computation
Hiroki Fujii, Masaki Yamakita

TL;DR
This paper introduces a factorized bilinear input modulation for Selective State Space Models, enhancing memory retention and bilinear computation by leveraging Koopman bilinear forms and various implementation strategies.
Contribution
It proposes a novel bilinear input modulation method with multiple implementations that improves memory and bilinear computation in SSMs, demonstrating robustness and scalability.
Findings
GM improves memory retention but not bilinear computation.
Seq-BIM and p-BIM enhance both memory and bilinear computation.
Bilinear variants outperform others and benefit from larger state dimensions.
Abstract
Selective State Space Models (SSMs), notably Mamba, employ diagonal state transitions that limit both memory retention and bilinear computational capacity. We propose a factorized bilinear input modulation that augments the SSM with a state-input product, interpretable as a finite-dimensional Koopman bilinear form. After introducing a shared state across channels (Coupled SSM), the modulation admits three implementations. Coupled Bilinear Input Modulation (seq-BIM) retains the full bilinear product on the input side at the cost of sequential computation, Coupled Gated Modulation (GM) linearizes it into a gate modulation that is compatible with the parallel scan, and Parallel Bilinear Input Modulation (p-BIM) places the same bilinear product on the state transition while remaining parallel-scannable. Experiments on a multiple input-delay pendulum (memory retention) and NARMA-10 (bilinear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
