Defining and Benchmarking a Data-Centric Design Space for Brain Graph Construction
Qinwen Ge, Roza G. Bayrak, Anwar Said, Catie Chang, Xenofon Koutsoukos, Tyler Derr

TL;DR
This paper systematically explores how data-centric choices in brain graph construction from fMRI data affect downstream neuroimaging classification performance, emphasizing the importance of upstream data decisions over model-centric approaches.
Contribution
It defines and benchmarks a comprehensive data-centric design space for brain graph construction, evaluating the impact of various data processing choices on neuroimaging classification.
Findings
Data-centric configurations improve classification accuracy.
Thoughtful data choices outperform standard pipelines.
Systematic exploration highlights critical upstream decisions.
Abstract
The construction of brain graphs from functional Magnetic Resonance Imaging (fMRI) data plays a crucial role in enabling graph machine learning for neuroimaging. However, current practices often rely on rigid pipelines that overlook critical data-centric choices in how brain graphs are constructed. In this work, we adopt a Data-Centric AI perspective and systematically define and benchmark a data-centric design space for brain graph construction, constrasting with primarily model-centric prior work. We organize this design space into three stages: temporal signal processing, topology extraction, and graph featurization. Our contributions lie less in novel components and more in evaluating how combinations of existing and modified techniques influence downstream performance. Specifically, we study high-amplitude BOLD signal filtering, sparsification and unification strategies for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Advanced Graph Neural Networks · Semantic Web and Ontologies
