Interpretable EEG-to-Image Generation with Semantic Prompts

Arshak Rezvani; Ali Akbari; Kosar Sanjar Arani; Maryam Mirian; Emad Arasteh; Martin J. McKeown

arXiv:2507.07157·cs.CV·July 11, 2025

Interpretable EEG-to-Image Generation with Semantic Prompts

Arshak Rezvani, Ali Akbari, Kosar Sanjar Arani, Maryam Mirian, Emad Arasteh, Martin J. McKeown

PDF

Open Access

TL;DR

This paper introduces a novel EEG-to-image decoding framework that uses semantic prompts from language models to improve interpretability and accuracy in visual reconstruction from brain signals.

Contribution

The study presents a new method that aligns EEG signals with semantic captions to generate images, bypassing direct EEG-to-image mapping and enhancing interpretability.

Findings

01

Achieved state-of-the-art visual decoding accuracy on EEG data.

02

Revealed semantic topography across the scalp via saliency maps.

03

Demonstrated alignment with neurocognitive pathways through interpretability analyses.

Abstract

Decoding visual experience from brain signals offers exciting possibilities for neuroscience and interpretable AI. While EEG is accessible and temporally precise, its limitations in spatial detail hinder image reconstruction. Our model bypasses direct EEG-to-image generation by aligning EEG signals with multilevel semantic captions -- ranging from object-level to abstract themes -- generated by a large language model. A transformer-based EEG encoder maps brain activity to these captions through contrastive learning. During inference, caption embeddings retrieved via projection heads condition a pretrained latent diffusion model for image generation. This text-mediated framework yields state-of-the-art visual decoding on the EEGCVPR dataset, with interpretable alignment to known neurocognitive pathways. Dominant EEG-caption associations reflected the importance of different semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · EEG and Brain-Computer Interfaces · Face Recognition and Perception