CausalMamba: Scalable Conditional State Space Models for Neural Causal Inference
Sangyoon Bae, Jiook Cha

TL;DR
CausalMamba is a scalable framework that improves neural causal inference from fMRI data by decomposing the problem into BOLD deconvolution and causal graph inference, outperforming existing methods in accuracy and fidelity.
Contribution
It introduces a novel two-stage approach combining BOLD deconvolution and a Conditional Mamba architecture for scalable neural causal inference from fMRI data.
Findings
Achieves 37% higher accuracy than DCM on simulated data.
Recovers neural pathways with 88% fidelity in real fMRI data.
Detects dynamic brain hub reconfiguration during working memory tasks.
Abstract
We introduce CausalMamba, a scalable framework that addresses fundamental limitations in fMRI-based causal inference: the ill-posed nature of inferring neural causality from hemodynamically distorted BOLD signals and the computational intractability of existing methods like Dynamic Causal Modeling (DCM). Our approach decomposes this complex inverse problem into two tractable stages: BOLD deconvolution to recover latent neural activity, followed by causal graph inference using a novel Conditional Mamba architecture. On simulated data, CausalMamba achieves 37% higher accuracy than DCM. Critically, when applied to real task fMRI data, our method recovers well-established neural pathways with 88% fidelity, whereas conventional approaches fail to identify these canonical circuits in over 99% of subjects. Furthermore, our network analysis of working memory data reveals that the brain…
Peer Reviews
Decision·Submitted to ICLR 2026
a nice use of MAMBA
The authors would like to report that we could use MAMBA to do causal inference on fMRI BOLD data. Although this would have been clear, it is nice to see that it works. However, the authors claim that this is a massive progress in the field because it is scalable and works better than the DCM family. This would have been a great method in 2020 (though MAMBA was not there yet), but now we have much better methodologies that do the same thing, and I suspect they will outperform the technique prop
The problem of improving causal inference is well motivated.
I found the paper hard to read, imprecise, and confusing. The methodology lacks a mathematically precise description. The description and the dimensionality of variables is not sufficient to understand what is going on in Figure 1. The figure wastes a lot of white space and could have a more compact representation. From my reading, the dimension variables are never defined in the text. The algorithm for conditional Mamba is very hard to parse as a commented pseudo code. Evidence is limited
1. The two-stage decomposition strategy is well-motivated, breaking down the complex BOLD-to-neural inversion into learnable hemodynamic deconvolution and causal inference, thereby significantly reducing modeling difficulty. 2. The Conditional Mamba architecture effectively integrates global temporal modeling with region-specific modulation, making it suitable for capturing brain network characteristics. 3. Experimental results show that CausalMamba outperforms all compared baselines across a
1. The two-stage decomposition in CausalMamba introduces strong modeling assumptions. In Stage 1, the model assumes a fixed double-gamma HRF form and a simple LFP-like neural event model. While biologically motivated, this may not capture the full diversity of real neural or hemodynamic responses. Since Stage 1 is trained solely on synthetic data, its deconvolution may not generalize well if real HRFs or neural patterns deviate from the assumed forms. 2. In Stage 2, temporal features are averag
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Brain Connectivity Studies · Explainable Artificial Intelligence (XAI) · Embodied and Extended Cognition
