Concept-based explainability for an EEG transformer model
Anders Gj{\o}lbye, William Lehn-Schi{\o}ler, \'Ashildur J\'onsd\'ottir, Bergd\'is Arnard\'ottir, Lars Kai Hansen

TL;DR
This paper explores concept-based explainability for EEG transformer models by adapting Concept Activation Vectors, introducing novel anatomically defined concepts, and demonstrating their effectiveness in understanding EEG model representations.
Contribution
It applies concept-based explainability to EEG data, introducing anatomically defined concepts and evaluating their utility in understanding transformer models.
Findings
Both concept formation methods provide valuable insights.
Anatomically defined concepts are effective for EEG explainability.
Externally labeled datasets aid in grounding concepts.
Abstract
Deep learning models are complex due to their size, structure, and inherent randomness in training procedures. Additional complexity arises from the selection of datasets and inductive biases. Addressing these challenges for explainability, Kim et al. (2018) introduced Concept Activation Vectors (CAVs), which aim to understand deep models' internal states in terms of human-aligned concepts. These concepts correspond to directions in latent space, identified using linear discriminants. Although this method was first applied to image classification, it was later adapted to other domains, including natural language processing. In this work, we attempt to apply the method to electroencephalogram (EEG) data for explainability in Kostas et al.'s BENDR (2021), a large-scale transformer model. A crucial part of this endeavor involves defining the explanatory concepts and selecting relevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · EEG and Brain-Computer Interfaces
MethodsFocus
