Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teacher Distillation
Chenqi Li, Yu Liu, Shuo Zhang, Timothy Denison, Tingting Zhu

TL;DR
This paper introduces a multi-teacher distillation framework that leverages well-established models from other modalities to pretrain EEG foundation models, overcoming data scarcity and noise challenges.
Contribution
It proposes the MTDP framework with a two-stage distillation process, significantly improving EEG model performance with less pretraining data.
Findings
MTDP outperforms self-supervised methods across multiple tasks
Achieves comparable performance with only 25% of pretraining data
Mainstream models from vision and time series transfer well to EEG domain
Abstract
Pretraining for electroencephalogram (EEG) foundation models has predominantly relied on self-supervised masked reconstruction, a paradigm largely adapted from and inspired by the success of vision and language foundation models. However, unlike images and text, EEG datasets are notoriously expensive to collect and characterized by low signal-to-noise ratio. These challenges introduce difficulties in scaling the EEG foundation models and capturing the underlying neural semantics through reconstruction. In this work, we ask the question: can we stand on the shoulders of well-established foundation models from well-represented modalities to bootstrap the pretraining of EEG foundation models? We first demonstrate that mainstream foundation models, such as those from vision and time series, transfer surprisingly well to EEG domain. To this end, we propose the Multi-Teacher Distillation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Advanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices
