EEG-Language Pretraining for Highly Label-Efficient Clinical Phenotyping

Sam Gijsen; Kerstin Ritter

arXiv:2409.07480·eess.SP·August 12, 2025·2 cites

EEG-Language Pretraining for Highly Label-Efficient Clinical Phenotyping

Sam Gijsen, Kerstin Ritter

PDF

Open Access

TL;DR

This paper introduces EEG-language models trained on clinical reports and EEG data, achieving significant improvements in clinical phenotyping tasks and enabling zero-shot classification and retrieval of neural signals and reports.

Contribution

It pioneers multimodal EEG-language modeling for clinical phenotyping, combining alignment, cropping, segmentation, and multiple instance learning to handle data misalignment.

Findings

01

Significant improvement over EEG-only models in four clinical evaluations.

02

First demonstration of zero-shot classification and retrieval in this domain.

03

Potential for advancing clinical applications with multimodal EEG-language models.

Abstract

Multimodal language modeling has enabled breakthroughs for representation learning, yet remains unexplored in the realm of functional brain data for clinical phenotyping. This paper pioneers EEG-language models (ELMs) trained on clinical reports and 15000 EEGs. We propose to combine multimodal alignment in this novel domain with timeseries cropping and text segmentation, enabling an extension based on multiple instance learning to alleviate misalignment between irrelevant EEG or text segments. Our multimodal models significantly improve over EEG-only models across four clinical evaluations and for the first time enable zero-shot classification as well as retrieval of both neural signals and reports. In sum, these results highlight the potential of ELMs, representing significant progress for clinical applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Brain Tumor Detection and Classification

MethodsContrastive Learning