FMamba: Mamba based on Fast-attention for Multivariate Time-series Forecasting
Shusen Ma, Yu Kang, Peng Bai, and Yun-Bo Zhao

TL;DR
FMamba is a novel multivariate time-series forecasting model that combines fast-attention with Mamba, achieving state-of-the-art accuracy with low computational cost by effectively capturing temporal and inter-variable dependencies.
Contribution
This paper introduces FMamba, integrating fast-attention with Mamba to model variable relationships and temporal features efficiently in multivariate time-series forecasting.
Findings
FMamba outperforms existing models on eight public datasets.
FMamba maintains low computational overhead while achieving high accuracy.
The model effectively captures both temporal and inter-variable dependencies.
Abstract
In multivariate time-series forecasting (MTSF), extracting the temporal correlations of the input sequences is crucial. While popular Transformer-based predictive models can perform well, their quadratic computational complexity results in inefficiency and high overhead. The recently emerged Mamba, a selective state space model, has shown promising results in many fields due to its strong temporal feature extraction capabilities and linear computational complexity. However, due to the unilateral nature of Mamba, channel-independent predictive models based on Mamba cannot attend to the relationships among all variables in the manner of Transformer-based models. To address this issue, we combine fast-attention with Mamba to introduce a novel framework named FMamba for MTSF. Technically, we first extract the temporal features of the input variables through an embedding layer, then compute…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces
