SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning

Anders Gj{\o}lbye; Lina Skerath; William Lehn-Schi{\o}ler; Nicolas Langer; Lars Kai Hansen

arXiv:2408.08065·eess.SP·May 26, 2025

SPEED: Scalable Preprocessing of EEG Data for Self-Supervised Learning

Anders Gj{\o}lbye, Lina Skerath, William Lehn-Schi{\o}ler, Nicolas Langer, Lars Kai Hansen

PDF

Open Access 1 Repo

TL;DR

This paper introduces SPEED, a Python-based EEG preprocessing pipeline optimized for self-supervised learning, which improves data quality and model performance on large-scale EEG datasets.

Contribution

The paper presents a scalable, optimized preprocessing pipeline specifically designed to enhance self-supervised learning on EEG data, addressing limitations of existing methods.

Findings

01

Preprocessing with SPEED stabilizes SSL training.

02

Enhanced downstream task performance with SPEED preprocessing.

03

Efficient handling of large EEG datasets.

Abstract

Electroencephalography (EEG) research typically focuses on tasks with narrowly defined objectives, but recent studies are expanding into the use of unlabeled data within larger models, aiming for a broader range of applications. This addresses a critical challenge in EEG research. For example, Kostas et al. (2021) show that self-supervised learning (SSL) outperforms traditional supervised methods. Given the high noise levels in EEG data, we argue that further improvements are possible with additional preprocessing. Current preprocessing methods often fail to efficiently manage the large data volumes required for SSL, due to their lack of optimization, reliance on subjective manual corrections, and validation processes or inflexible protocols that limit SSL. We propose a Python-based EEG preprocessing pipeline optimized for self-supervised learning, designed to efficiently process…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andersgmadsen/speed
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · EEG and Brain-Computer Interfaces