MambaCSP: Hybrid-Attention State Space Models for Hardware-Efficient Channel State Prediction
Aladin Djuhera, Haris Gacanin, Holger Boche

TL;DR
MambaCSP introduces a hybrid-attention state space model for efficient and accurate channel state prediction, outperforming LLM-based methods in speed and resource usage for wireless networks.
Contribution
The paper presents MambaCSP, a novel hybrid-attention SSM architecture that enhances CSI prediction efficiency and accuracy over traditional LLM-based models.
Findings
MambaCSP improves prediction accuracy by 9-12% over LLM-based approaches.
It achieves up to 3.0x higher throughput and 2.9x faster inference.
It reduces VRAM usage by 2.6x in simulations.
Abstract
Recent works have demonstrated that attention-based transformer and large language model (LLM) architectures can achieve strong channel state prediction (CSP) performance by capturing long-range temporal dependencies across channel state information (CSI) sequences. However, these models suffer from quadratic scaling in sequence length, leading to substantial computational cost, memory consumption, and inference latency, which limits their applicability in real-time and resource-constrained wireless deployments. In this paper, we investigate whether selective state space models (SSMs) can serve as a hardware-efficient alternative for CSI prediction. We propose MambaCSP, a hybrid-attention SSM architecture that replaces LLM-based prediction backbones with a linear-time Mamba model. To overcome the local-only dependencies of pure SSMs, we introduce lightweight patch-mixer attention layers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
