SYNAPSE: Synergizing an Adapter and Finetuning for High-Fidelity EEG Synthesis from a CLIP-Aligned Encoder
Jeyoung Lee, Hochul Kang

TL;DR
SYNAPSE is a novel two-stage framework that combines a CLIP-aligned EEG autoencoder with a lightweight adaptation of Stable Diffusion to generate high-fidelity, semantically coherent images from EEG signals, outperforming prior models.
Contribution
It introduces a semantically structured EEG autoencoder aligned with CLIP and a minimal-parameter adaptation of diffusion models for improved EEG-to-image synthesis.
Findings
Achieves state-of-the-art perceptual fidelity on CVPR40 dataset.
Demonstrates effective generalization across subjects.
Preserves visual semantics even with reduced class agreement.
Abstract
Recent progress in diffusion-based generative models has enabled high-quality image synthesis conditioned on diverse modalities. Extending such models to brain signals could deepen our understanding of human perception and mental representations. However,electroencephalography (EEG) presents major challenges for image generation due to high noise, low spatial resolution, and strong inter-subject variability. Existing approaches,such as DreamDiffusion, BrainVis, and GWIT, primarily adapt EEG features to pre-trained Stable Diffusion models using complex alignment or classification pipelines, often resulting in large parameter counts and limited interpretability. We introduce SYNAPSE, a two-stage framework that bridges EEG signal representation learning and high-fidelity image synthesis. In Stage1, a CLIP-aligned EEG autoencoder learns a semantically structured latent representation by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Face Recognition and Perception · Generative Adversarial Networks and Image Synthesis
