Accelerating Benchmarking of Functional Connectivity Modeling via Structure-aware Core-set Selection
Ling Zhan, Zhen Li, Junjie Huang, Tao Jia

TL;DR
This paper introduces a novel self-supervised framework, SCLCS, for selecting small, representative core-sets that preserve the performance ranking of functional connectivity models, enabling efficient large-scale benchmarking in neuroscience.
Contribution
It formalizes core-set selection for FC benchmarking and proposes SCLCS, which effectively preserves model rankings using structural stability and diversity strategies.
Findings
SCLCS maintains ground-truth model ranking with only 10% data.
Outperforms state-of-the-art core-set methods by up to 23.2% in ranking consistency.
First formalization of core-set selection for FC operator benchmarking.
Abstract
Benchmarking the hundreds of functional connectivity (FC) modeling methods on large-scale fMRI datasets is critical for reproducible neuroscience. However, the combinatorial explosion of model-data pairings makes exhaustive evaluation computationally prohibitive, preventing such assessments from becoming a routine pre-analysis step. To break this bottleneck, we reframe the challenge of FC benchmarking by selecting a small, representative core-set whose sole purpose is to preserve the relative performance ranking of FC operators. We formalize this as a ranking-preserving subset selection problem and propose Structure-aware Contrastive Learning for Core-set Selection (SCLCS), a self-supervised framework to select these core-sets. SCLCS first uses an adaptive Transformer to learn each sample's unique FC structure. It then introduces a novel Structural Perturbation Score (SPS) to quantify…
Peer Reviews
Decision·ICLR 2026 Poster
While the concept of core-set selection is not new, to my knowledge, this is the first work that is undertaking this problem for ranking functional connectivity performance. This is the main strength of the paper. The issue of core-set selection has been mostly ignored in neuroimaging, mainly because the datasets are generally small. However, as the number of datasets are growing in size, the number of algorithms are also getting contributed and tested, the problem of benchmarking is suddenly
The main weakness in the paper is that the ranking is based on Statistical Pairwise Interactions (SPIs). Although the authors test 130 SPIs, these are functional connectivity estimators and not end to end predictive machine learning models. SPIs are generally used in classical fMRI analyses and not widely used in modern ML based approaches including deep learning. In modern data-driven models (graph neural nets, attention-based architectures), the functional network connectivity structure is l
- **Well-motivated objective and metricization.** Focusing on cross-model *ranking* (not accuracy) matches the real need in SPI selection; nDCG@k is a sensible measure. - **Principled structure-aware selection.** SPS is theoretically grounded (mixture-driven perturbation; stationarity/ergodicity) and operationally tied to the encoder’s attention dynamics; density-balanced sampling addresses the known failure mode of top-k. - **Comprehensive empirical study and practicality.** The s
1. **External validity limited by parcellation and preprocessing choices.** Experiments are confined to 33 DMN ROIs (Dosenbach-160) with global signal regression; it remains unclear whether conclusions (e.g., SPS behavior, density effects, ranking stability) hold for whole-brain parcellations (e.g., Schaefer, AAL), alternative TRs, or non-GSR pipelines. 2. **Ground-truth ranking restricted to *fast* SPIs.** To keep evaluation tractable, the “full-set” ranking is computed on a subset of S
- Raises an unaddressed, important research question in the field of neuroimage analysis (efficient comparison between different SPIs) - Theoretical rigor is well aligned with the research motivation.
- Ultimately, the work could provide further insight to the neuroscience community if SCLCS could be used to derive experimental suggestions on which SPIs are suggested to be representative for constructing the functional connectivity matrices. The experiments had a limited scope on validating that the efficiency and robustness of the core-set selection of SCLCS. - Connectivity patterns constructed from different SPIs can be significantly susceptible to different pre/post-processing of the BOLD
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Brain Connectivity Studies · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning
