CATVis: Context-Aware Thought Visualization

Tariq Mehmood; Hamza Ahmad; Muhammad Haroon Shakeel; Murtaza Taj

arXiv:2507.11522·cs.CV·July 16, 2025

CATVis: Context-Aware Thought Visualization

Tariq Mehmood, Hamza Ahmad, Muhammad Haroon Shakeel, Murtaza Taj

PDF

Open Access

TL;DR

This paper introduces CATVis, a novel framework that decodes visual representations from EEG signals to generate images, leveraging cross-modal alignment and re-ranking to improve semantic accuracy and image quality.

Contribution

It presents a new 5-stage EEG-to-image generation framework that outperforms existing methods in accuracy and image quality through innovative cross-modal techniques.

Findings

01

Outperforms SOTA in classification accuracy by 13.43%

02

Achieves 15.21% higher generation accuracy

03

Reduces Fréchet Inception Distance by 36.61%

Abstract

EEG-based brain-computer interfaces (BCIs) have shown promise in various applications, such as motor imagery and cognitive state monitoring. However, decoding visual representations from EEG signals remains a significant challenge due to their complex and noisy nature. We thus propose a novel 5-stage framework for decoding visual representations from EEG signals: (1) an EEG encoder for concept classification, (2) cross-modal alignment of EEG and text embeddings in CLIP feature space, (3) caption refinement via re-ranking, (4) weighted interpolation of concept and caption embeddings for richer semantics, and (5) image generation using a pre-trained Stable Diffusion model. We enable context-aware EEG-to-image generation through cross-modal alignment and re-ranking. Experimental results demonstrate that our method generates high-quality images aligned with visual stimuli, outperforming…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics · Scientific Computing and Data Management

MethodsContrastive Language-Image Pre-training · Diffusion