Tokenizing Single-Channel EEG with Time-Frequency Motif Learning

Jathurshan Pradeepkumar; Xihao Piao; Zheng Chen; Jimeng Sun

arXiv:2502.16060·cs.LG·May 15, 2026

Tokenizing Single-Channel EEG with Time-Frequency Motif Learning

Jathurshan Pradeepkumar, Xihao Piao, Zheng Chen, Jimeng Sun

PDF

1 Repo 1 Models 1 Video

TL;DR

The paper introduces TFM-Tokenizer, a novel EEG tokenization framework that learns time-frequency motifs to improve foundation model performance across diverse EEG tasks and devices.

Contribution

It proposes a dual-path architecture with time-frequency masking for robust motif learning, supporting various foundation models and operating at single-channel level.

Findings

01

Achieves up to 11% improvement in Cohen's Kappa on EEG benchmarks.

02

Boosts performance of foundation models like BIOT and LaBraM.

03

Outperforms baselines by 14% on ear-EEG sleep staging.

Abstract

Foundation models are reshaping EEG analysis, yet an important problem of EEG tokenization remains a challenge. This paper presents TFM-Tokenizer, a novel tokenization framework that learns a vocabulary of time-frequency motifs from single-channel EEG signals and encodes them into discrete tokens. We propose a dual-path architecture with time-frequency masking to capture robust motif representations, and it is model-agnostic, supporting both lightweight transformers and existing foundation models for downstream tasks. Our study demonstrates three key benefits: Accuracy: Experiments on four diverse EEG benchmarks demonstrate consistent performance gains across both single- and multi-dataset pretraining settings, achieving up to $11%$ improvement in Cohen's Kappa over strong baselines. Generalization: Moreover, as a plug-and-play component, it consistently boosts the performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Jathurshan0330/TFM-Tokenizer
github

Models

🤗
Jathurshan/TFM-Tokenizer
model· 18 dl
18 dl

Videos

Tokenizing Single-Channel EEG with Time-Frequency Motif Learning· slideslive