Biologically Plausible Brain Graph Transformer
Ciyuan Peng, Yuelong Huang, Qichao Dong, Shuo Yu, Feng Xia, Chengqi, Zhang, Yaochu Jin

TL;DR
This paper introduces BioBGT, a novel brain graph transformer that encodes the brain's small-world architecture and functional modules, improving biological plausibility and performance in brain analysis tasks.
Contribution
The paper proposes a biologically plausible brain graph transformer with network entanglement-based node importance encoding and functional module-aware self-attention.
Findings
BioBGT outperforms existing models on benchmark datasets.
It better captures the small-world architecture of brain graphs.
Enhances brain graph representations for analytical tasks.
Abstract
State-of-the-art brain graph analysis methods fail to fully encode the small-world architecture of brain graphs (accompanied by the presence of hubs and functional modules), and therefore lack biological plausibility to some extent. This limitation hinders their ability to accurately represent the brain's structural and functional properties, thereby restricting the effectiveness of machine learning models in tasks such as brain disorder detection. In this work, we propose a novel Biologically Plausible Brain Graph Transformer (BioBGT) that encodes the small-world architecture inherent in brain graphs. Specifically, we present a network entanglement-based node importance encoding technique that captures the structural importance of nodes in global information propagation during brain graph communication, highlighting the biological properties of the brain structure. Furthermore, we…
Peer Reviews
Decision·ICLR 2025 Poster
- As emphasized by the authors, their method highlights the biological properties of the brain structure via biological plausibility. - Overall, the presentations are clear and easy to understand their framework and results. - The experiments are well-structured with multiple brain datasets, and the proposed model demonstrates strong performance in most cases.
- The authors propose three main methods: network entanglement-based node importance encoding to reflect the different importance of each node like the hub node, community contrastive strategy-based functional module extractor and an updated self-attention mechanism for functional module-aware self-attention mechanism. However, I couldn’t find much difference from the original works of each methods the authors mentioned they were inspired by. On top of that, the equation 6 and its proof in the a
- Motivation to incorporate prior knowledge of the brain into the framework - The proposed method shows clear improvement over the presented baselines
- The authors’ emphasis on biological plausibility is weakly supported both theoretically and experimentally. - The baseline model performance in the experiments is generally lower than expected, so verification may be necessary.
I found this paper compelling and I believe it has strong potential for ICLR. The writing is clear and well-organized, making it highly readable. The paper’s originality lies in its combination of concepts from another field, applied thoughtfully to the fMRI domain, with evidence that this approach yields positive outcomes. In this sense, the results in tables 1 and 2 are quite impressive, while those in Table 6, though somewhat less robust, still demonstrate solid performance. The authors leve
1. No experiment is conducted with a smaller parcellation. The results using the ADNI dataset - where a smaller parcellation was used - are not as strong, which in my opinion are an indication that a smaller number of ROIs could impact the performance of this method. Given the focus on community structure and node importance of this work, I think it would have been important to understand whether the size of the parcellations are an important factor in this models performance. 2. Even though the
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroscience and Neural Engineering · Photoreceptor and optogenetics research · Molecular Communication and Nanonetworks
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Softmax · Absolute Position Encodings · Laplacian EigenMap · Dropout · Laplacian Positional Encodings
