SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning
Anders Gj{\o}lbye, Lina Skerath, William Lehn-Schi{\o}ler, Nicolas Langer, Lars Kai Hansen

TL;DR
This paper introduces SPEED, a Python-based EEG preprocessing pipeline optimized for self-supervised learning, which improves data quality and model performance on large-scale EEG datasets.
Contribution
The paper presents a scalable, optimized preprocessing pipeline specifically designed to enhance self-supervised learning on EEG data, addressing limitations of existing methods.
Findings
Preprocessing with SPEED stabilizes SSL training.
Enhanced downstream task performance with SPEED preprocessing.
Efficient handling of large EEG datasets.
Abstract
Electroencephalography (EEG) research typically focuses on tasks with narrowly defined objectives, but recent studies are expanding into the use of unlabeled data within larger models, aiming for a broader range of applications. This addresses a critical challenge in EEG research. For example, Kostas et al. (2021) show that self-supervised learning (SSL) outperforms traditional supervised methods. Given the high noise levels in EEG data, we argue that further improvements are possible with additional preprocessing. Current preprocessing methods often fail to efficiently manage the large data volumes required for SSL, due to their lack of optimization, reliance on subjective manual corrections, and validation processes or inflexible protocols that limit SSL. We propose a Python-based EEG preprocessing pipeline optimized for self-supervised learning, designed to efficiently process…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · EEG and Brain-Computer Interfaces
