Beyond Accuracy: Robustness, Interpretability and Expressiveness of EEG Foundation Models

Urban \v{S}irca; Maryam Alimardani; Stefanos Zafeiriou; Konstantinos Barmpas

arXiv:2605.17562·cs.LG·May 19, 2026

Beyond Accuracy: Robustness, Interpretability and Expressiveness of EEG Foundation Models

Urban \v{S}irca, Maryam Alimardani, Stefanos Zafeiriou, Konstantinos Barmpas

PDF

TL;DR

This paper systematically evaluates EEG foundation models on robustness, interpretability, and expressiveness, revealing strengths and vulnerabilities beyond just accuracy.

Contribution

It introduces comprehensive benchmarks and analyses for EEG-FMs, including robustness tests, interpretability methods, and probing techniques, filling critical gaps in understanding their capabilities.

Findings

01

No single model dominates all failure modes.

02

Attention-Aware Layer-Wise Relevance Propagation shows models focus on task-relevant brain regions.

03

Early blocks in models contain task-related information, and pooling affects performance.

Abstract

EEG foundation models (EEG-FMs) have been evaluated predominantly on clean, in-distribution accuracy, leaving their robustness, interpretability and representational quality largely unexamined. This study addresses these gaps by benchmarking six EEG-FMs against a baseline deep learning model across eight datasets. Beyond clean accuracy, we conduct three layers of analysis: (i) Robustness: we apply test-time perturbations including additive noise, random and region-based channel dropout and region-specific noise injection. Our analyses show that no single model dominates all failure modes. The most noise-robust model is among the most fragile under channel dropout and much of the dropout fragility disappears when channels are removed rather than zero-padded. (ii) Interpretability: we present the first application of Attention-Aware Layer-Wise Relevance Propagation (AttnLRP) to EEG-FMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.