Beyond Accuracy: Robustness, Interpretability and Expressiveness of EEG Foundation Models
Urban \v{S}irca, Maryam Alimardani, Stefanos Zafeiriou, Konstantinos Barmpas

TL;DR
This paper systematically evaluates EEG foundation models on robustness, interpretability, and expressiveness, revealing strengths and vulnerabilities beyond just accuracy.
Contribution
It introduces comprehensive benchmarks and analyses for EEG-FMs, including robustness tests, interpretability methods, and probing techniques, filling critical gaps in understanding their capabilities.
Findings
No single model dominates all failure modes.
Attention-Aware Layer-Wise Relevance Propagation shows models focus on task-relevant brain regions.
Early blocks in models contain task-related information, and pooling affects performance.
Abstract
EEG foundation models (EEG-FMs) have been evaluated predominantly on clean, in-distribution accuracy, leaving their robustness, interpretability and representational quality largely unexamined. This study addresses these gaps by benchmarking six EEG-FMs against a baseline deep learning model across eight datasets. Beyond clean accuracy, we conduct three layers of analysis: (i) Robustness: we apply test-time perturbations including additive noise, random and region-based channel dropout and region-specific noise injection. Our analyses show that no single model dominates all failure modes. The most noise-robust model is among the most fragile under channel dropout and much of the dropout fragility disappears when channels are removed rather than zero-padded. (ii) Interpretability: we present the first application of Attention-Aware Layer-Wise Relevance Propagation (AttnLRP) to EEG-FMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
