Dual Stream Graph Transformer Fusion Networks for Enhanced Brain Decoding
Lucas Goene, Siamak Mehrkanoon

TL;DR
This paper introduces a dual stream graph-transformer architecture for classifying task-based MEG data, combining spatial graph attention networks with temporal transformers to improve brain decoding accuracy.
Contribution
The novel DS-GTF architecture integrates spatial graph attention and temporal transformers for MEG classification, with new adjacency initialization methods.
Findings
Improved classification accuracy over existing models
Reduced standard deviation across test subjects
Effective fusion of spatial and temporal features
Abstract
This paper presents the novel Dual Stream Graph-Transformer Fusion (DS-GTF) architecture designed specifically for classifying task-based Magnetoencephalography (MEG) data. In the spatial stream, inputs are initially represented as graphs, which are then passed through graph attention networks (GAT) to extract spatial patterns. Two methods, TopK and Thresholded Adjacency are introduced for initializing the adjacency matrix used in the GAT. In the temporal stream, the Transformer Encoder receives concatenated windowed input MEG data and learns new temporal representations. The learned temporal and spatial representations from both streams are fused before reaching the output layer. Experimental results demonstrate an enhancement in classification performance and a reduction in standard deviation across multiple test subjects compared to other examined models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Label Smoothing · Position-Wise Feed-Forward Layer · Dense Connections · Residual Connection · Dropout · Layer Normalization · Adam · Byte Pair Encoding
