Visual Image Reconstruction from Brain Activity via Latent Representation

Yukiyasu Kamitani; Misato Tanaka; and Ken Shirakawa

arXiv:2505.08429·cs.CV·June 23, 2025

Visual Image Reconstruction from Brain Activity via Latent Representation

Yukiyasu Kamitani, Misato Tanaka, and Ken Shirakawa

PDF

TL;DR

This paper reviews recent advances in reconstructing visual images from brain activity using deep neural networks, highlighting progress, challenges, and future directions for more accurate and generalizable models.

Contribution

It provides a comprehensive overview of the evolution, current state, and future challenges in neural decoding for visual reconstruction using deep learning techniques.

Findings

01

Progress in detailed image reconstruction from brain signals

02

Identification of key challenges like zero-shot generalization

03

Discussion on ethical considerations and future applications

Abstract

Visual image reconstruction, the decoding of perceptual content from brain activity into images, has advanced significantly with the integration of deep neural networks (DNNs) and generative models. This review traces the field's evolution from early classification approaches to sophisticated reconstructions that capture detailed, subjective visual experiences, emphasizing the roles of hierarchical latent representations, compositional strategies, and modular architectures. Despite notable progress, challenges remain, such as achieving true zero-shot generalization for unseen images and accurately modeling the complex, subjective aspects of perception. We discuss the need for diverse datasets, refined evaluation metrics aligned with human perceptual judgments, and compositional representations that strengthen model robustness and generalizability. Ethical issues, including privacy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.