Concept-based explainability for an EEG transformer model

Anders Gj{\o}lbye; William Lehn-Schi{\o}ler; \'Ashildur J\'onsd\'ottir; Bergd\'is Arnard\'ottir; Lars Kai Hansen

arXiv:2307.12745·cs.LG·May 26, 2025·1 cites

Concept-based explainability for an EEG transformer model

Anders Gj{\o}lbye, William Lehn-Schi{\o}ler, \'Ashildur J\'onsd\'ottir, Bergd\'is Arnard\'ottir, Lars Kai Hansen

PDF

Open Access 1 Repo

TL;DR

This paper explores concept-based explainability for EEG transformer models by adapting Concept Activation Vectors, introducing novel anatomically defined concepts, and demonstrating their effectiveness in understanding EEG model representations.

Contribution

It applies concept-based explainability to EEG data, introducing anatomically defined concepts and evaluating their utility in understanding transformer models.

Findings

01

Both concept formation methods provide valuable insights.

02

Anatomically defined concepts are effective for EEG explainability.

03

Externally labeled datasets aid in grounding concepts.

Abstract

Deep learning models are complex due to their size, structure, and inherent randomness in training procedures. Additional complexity arises from the selection of datasets and inductive biases. Addressing these challenges for explainability, Kim et al. (2018) introduced Concept Activation Vectors (CAVs), which aim to understand deep models' internal states in terms of human-aligned concepts. These concepts correspond to directions in latent space, identified using linear discriminants. Although this method was first applied to image classification, it was later adapted to other domains, including natural language processing. In this work, we attempt to apply the method to electroencephalogram (EEG) data for explainability in Kostas et al.'s BENDR (2021), a large-scale transformer model. A crucial part of this endeavor involves defining the explanatory concepts and selecting relevant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andersgmadsen/tcav-bendr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · EEG and Brain-Computer Interfaces

MethodsFocus