FCN-LLM: Empower LLM for Brain Functional Connectivity Network Understanding via Graph-level Multi-task Instruction Tuning
Xingcan Hu, Wei Wang, Li Xiao

TL;DR
This paper introduces FCN-LLM, a novel framework that enables large language models to understand brain functional connectivity networks through multi-task instruction tuning, improving zero-shot generalization in neuroscience applications.
Contribution
The paper presents a multi-scale FCN encoder and multi-paradigm instruction tasks to align FCNs with LLMs, a new approach for integrating brain networks with language models.
Findings
FCN-LLM outperforms traditional models on unseen datasets
Achieves strong zero-shot generalization in brain network understanding
Demonstrates flexible integration of brain data with LLMs
Abstract
Large Language Models have achieved remarkable success in language understanding and reasoning, and their multimodal extensions enable comprehension of images, video, and audio. Inspired by this, foundation models for brain functional connectivity networks derived from resting-state fMRI have shown promise in clinical tasks. However, existing methods do not align FCNs with the text modality, limiting the ability of LLMs to directly understand FCNs. To address this, we propose FCN-LLM, a framework that enables LLMs to understand FCNs through graph-level, multi-task instruction tuning. Our approach employs a multi-scale FCN encoder capturing brain-region, functional subnetwork, and whole-brain features, projecting them into the semantic space of LLM. We design multi-paradigm instruction tasks covering 19 subject-specific attributes across demographics, phenotypes, and psychiatric…
Peer Reviews
Decision·Submitted to ICLR 2026
- **Novel cross-modal alignment of brain networks and LLMs.** The paper is the first to directly align FCN representations with the language modality, enabling semantic reasoning and text-based interaction with brain network data—an original and conceptually appealing contribution. - **Hierarchical multi-scale encoder design.** The proposed ROI / subnetwork / global-level architecture effectively captures both fine-grained and global connectivity structures, improving representational r
1. **Lack of ablation on pretraining stage** Stage 1 relies on time-window–specific FCNs for data augmentation, yet no analysis is provided on how the window length \(L\) or stride \(P\) affect alignment or downstream performance. Since fMRI datasets differ in temporal resolution (TR), fixing \(L{=}100\) may yield inconsistent temporal coverage and introduce noise. A sensitivity study on \(L\) and \(P\) would clarify whether the augmentation benefits outweigh potential degradation of repr
1. The paper proposes to bridge fMRI-based brain FCNs with large language models through graph-level instruction tuning. The FCN-LLM introduces a text-aligned multimodal interface, enabling LLMs to reason about neural connectivity in a semantically grounded way. 2. The author uses the multi-scale FCN encoder to jointly represent region-level, subnetwork-level, and global-level connectivity patterns. This hierarchical formulation mirrors neuroscientific organization principles and leads to more
1. The framework relies heavily on functional connectivity graphs derived from fMRI, which are known to be noisy, non-stationary, and highly sensitive to preprocessing choices. The paper does not sufficiently address how this inherent variability might affect graph quality and, consequently, the overall performance of FCN-LLM. Without explicit denoising, robustness checks, or uncertainty modeling, it is difficult to determine whether the observed improvements stem from the proposed graph–languag
• The proposed FCN-LLM framework is conceptually simple yet effective, providing a clear and interpretable approach for integrating FCN representations into LLMs. • The authors have collected a large-scale, multi-site FCN dataset for alignment learning, which is valuable for advancing domain-specific representations in neuroimaging.
• The evaluation protocol for disease classification lacks rigor. Specifically, the definition of “healthy controls” may not be consistent across different disorders. For example, healthy controls used for ADHD may not serve as valid controls for OCD or schizophrenia. Therefore, FCN-LLM should be evaluated on each dataset separately rather than combining them. • For the ABIDE binary classification task, the reported zero-shot performance of FCN-LLM is close to random guessing, which makes the r
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Brain Connectivity Studies · EEG and Brain-Computer Interfaces · Machine Learning in Healthcare
