Visual Image Reconstruction from Brain Activity via Latent Representation
Yukiyasu Kamitani, Misato Tanaka, and Ken Shirakawa

TL;DR
This paper reviews recent advances in reconstructing visual images from brain activity using deep neural networks, highlighting progress, challenges, and future directions for more accurate and generalizable models.
Contribution
It provides a comprehensive overview of the evolution, current state, and future challenges in neural decoding for visual reconstruction using deep learning techniques.
Findings
Progress in detailed image reconstruction from brain signals
Identification of key challenges like zero-shot generalization
Discussion on ethical considerations and future applications
Abstract
Visual image reconstruction, the decoding of perceptual content from brain activity into images, has advanced significantly with the integration of deep neural networks (DNNs) and generative models. This review traces the field's evolution from early classification approaches to sophisticated reconstructions that capture detailed, subjective visual experiences, emphasizing the roles of hierarchical latent representations, compositional strategies, and modular architectures. Despite notable progress, challenges remain, such as achieving true zero-shot generalization for unseen images and accurately modeling the complex, subjective aspects of perception. We discuss the need for diverse datasets, refined evaluation metrics aligned with human perceptual judgments, and compositional representations that strengthen model robustness and generalizability. Ethical issues, including privacy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
