BrainMT: A Hybrid Mamba-Transformer Architecture for Modeling Long-Range Dependencies in Functional MRI Data
Arunkumar Kannan, Martin A. Lindquist, and Brian Caffo

TL;DR
BrainMT is a hybrid deep learning framework combining Mamba blocks and transformers to effectively model long-range spatiotemporal dependencies in fMRI data, leading to superior phenotypic prediction performance.
Contribution
This paper introduces BrainMT, a novel hybrid architecture that efficiently captures long-range dependencies in fMRI data using a two-stage process with Mamba and transformer blocks.
Findings
Achieves state-of-the-art results on sex and intelligence prediction tasks.
Outperforms existing methods significantly on large-scale datasets.
Demonstrates effective modeling of complex fMRI relationships.
Abstract
Recent advances in deep learning have made it possible to predict phenotypic measures directly from functional magnetic resonance imaging (fMRI) brain volumes, sparking significant interest in the neuroimaging community. However, existing approaches, primarily based on convolutional neural networks or transformer architectures, often struggle to model the complex relationships inherent in fMRI data, limited by their inability to capture long-range spatial and temporal dependencies. To overcome these shortcomings, we introduce BrainMT, a novel hybrid framework designed to efficiently learn and integrate long-range spatiotemporal attributes in fMRI data. Our framework operates in two stages: (1) a bidirectional Mamba block with a temporal-first scanning mechanism to capture global temporal interactions in a computationally efficient manner; and (2) a transformer block leveraging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Brain Connectivity Studies · Machine Learning in Healthcare · Face Recognition and Perception
