EEG-FM-Bench: A Comprehensive Benchmark for the Systematic Evaluation of EEG Foundation Models
Wei Xiong, Jiangtong Li, Jie Li, Kun Zhu, Changjun Jiang

TL;DR
EEG-FM-Bench provides a standardized, comprehensive evaluation framework for EEG foundation models, enabling fair comparison, reproducibility, and deeper understanding of model behaviors across diverse datasets and experimental settings.
Contribution
It introduces a unified benchmark with diverse datasets and analysis tools, addressing evaluation inconsistencies and revealing key insights into EEG model scaling and training.
Findings
Multi-task learning reduces overfitting in data-scarce scenarios.
Gradient conflicts limit pre-training efficiency.
Compact, domain-specific models outperform larger models.
Abstract
Electroencephalography foundation models (EEG-FMs) have advanced brain signal analysis, but the lack of standardized evaluation benchmarks impedes model comparison and scientific progress. Current evaluations rely on inconsistent protocols that render cross-model comparisons unreliable, while a lack of diagnostic analyses obscures the internal mechanisms driving transfer efficiency and scaling behaviors. To address this, we introduce \textbf{EEG-FM-Bench}, a unified system for the standardized evaluation of EEG-FMs. The benchmark integrates 14 datasets across 10 paradigms and incorporates diverse experimental settings, including multiple fine-tuning strategies, task organizations, and classifier configurations, supported by tools for gradient and representation analysis. Our experiments and analysis reveal several critical insights: (1) multi-task learning acts as a critical regularizer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
