EEG-VLM: A Hierarchical Vision-Language Model with Multi-Level Feature Alignment and Visually Enhanced Language-Guided Reasoning for EEG Image-Based Sleep Stage Prediction
Xihe Qiu, Gengchen Ma, Haoyu Wang, Chen Zhan, Xiaoyu Tan, and Shuo Li

TL;DR
This paper introduces EEG-VLM, a hierarchical vision-language model that enhances EEG sleep stage classification by integrating multi-level feature alignment and reasoning, leading to improved accuracy and interpretability.
Contribution
The paper presents a novel hierarchical vision-language framework with multi-level feature alignment and visually enhanced reasoning for EEG sleep stage prediction, addressing limitations of existing models.
Findings
Significantly improved sleep stage classification accuracy.
Enhanced interpretability through logical reasoning steps.
Demonstrated potential for clinical EEG analysis.
Abstract
Sleep stage classification based on electroencephalography (EEG) is fundamental for assessing sleep quality and diagnosing sleep-related disorders. However, most traditional machine learning methods rely heavily on prior knowledge and handcrafted features, while existing deep learning models still struggle to jointly capture fine-grained time-frequency patterns and achieve clinical interpretability. Recently, vision-language models (VLMs) have made significant progress in the medical domain, yet their performance remains constrained when applied to physiological waveform data, especially EEG signals, due to their limited visual understanding and insufficient reasoning capability. To address these challenges, we propose EEG-VLM, a hierarchical vision-language framework that integrates multi-level feature alignment with visually enhanced language-guided reasoning for interpretable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Sleep and Wakefulness Research · Sleep and related disorders
