Sequential Order-Robust Mamba for Time Series Forecasting
Seunghan Lee, Juri Hong, Kibok Lee, Taeyoung Park

TL;DR
The paper introduces SOR-Mamba, a novel time series forecasting method that improves robustness to channel order variations and enhances channel dependency modeling by combining regularization and correlation preservation techniques.
Contribution
It proposes SOR-Mamba, which removes order bias and incorporates channel correlation modeling, advancing time series forecasting with improved robustness and dependency capture.
Findings
Enhanced robustness to channel order variations
Effective preservation of channel correlations
Improved forecasting accuracy in experiments
Abstract
Mamba has recently emerged as a promising alternative to Transformers, offering near-linear complexity in processing sequential data. However, while channels in time series (TS) data have no specific order in general, recent studies have adopted Mamba to capture channel dependencies (CD) in TS, introducing a sequential order bias. To address this issue, we propose SOR-Mamba, a TS forecasting method that 1) incorporates a regularization strategy to minimize the discrepancy between two embedding vectors generated from data with reversed channel orders, thereby enhancing robustness to channel order, and 2) eliminates the 1D-convolution originally designed to capture local information in sequential data. Furthermore, we introduce channel correlation modeling (CCM), a pretraining task aimed at preserving correlations between channels from the data space to the latent space in order to…
Peer Reviews
Decision·Submitted to ICLR 2025
* This paper uses a single unidirectional Mamba with regularization and non-1D convolutions to capture channel correlations. By modifying the traditional Mamba module, it can better address the issue of sequential order bias. The research problem is clearly defined, and the solution is reasonable. And the pretraining task channel correlation modeling also enhances the model's generalization and performance. * The details of the model are mostly clear, with each module explained and supported by
* The proposed method is mostly constructed on existing models -- reverse modeling has been widely used in literature and removing the 1d-conv is trival. * The experimental results show that the model's performance improvement across various datasets is not significant, mostly in the thousandth. * The removal of 1D-conv negatively impacts PEMS dataset in tale 7 and figure 5. The necessity to remove 1D-conv is uncertain. * The model uses the Mamba backbone, but the baseline only includes a single
1. Clear problem identification regarding Mamba's limitations in handling unordered channels 2. Comprehensive experiments 3. Improved efficiency compared to other approaches
**Limited Technical Novelty**: - The regularization strategy is overly simplistic that minimizing the distance between embeddings from different channel orders - The removal of 1D-conv lacks theoretical justification and appears to be an ad-hoc solution, As shown in Figure 5 of the paper, its removal may negatively impact datasets with ordered channels such as PEMS datasets
The proposed method addresses the sequential order bias in channel dependencies (CD). This bias can degrade performance when channels lack inherent order in multi-channel time series data. The SOR-Mamba method mitigates this bias by introducing a regularization technique that minimizes the discrepancy between embeddings generated from reversed channel orders, thus improving robustness. Additionally, the paper proposes Channel Correlation Modeling (CCM) as a pretraining task, which preserves ch
The paper’s approach is relatively complex, involving several architectural changes (e.g., removing 1D-convolutions, applying specific regularization, and adding CCM pretraining) that may complicate replication and limit accessibility. While it emphasizes efficiency improvements, the paper could provide more detailed explanations regarding the trade-offs involved in removing the 1D-convolution, especially on datasets where channel order may have some inherent structure (e.g., traffic data). Th
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Control Systems Optimization
MethodsSpatio-temporal stability analysis · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
