Describing Semantic Representations of Brain Activity Evoked by Visual Stimuli
Eri Matsuo, Ichiro Kobayashi, Shinji Nishimoto, Satoshi Nishida,, Hideki Asoh

TL;DR
This paper presents a method to generate natural language descriptions from human brain activity evoked by visual stimuli, using a deep learning model trained on brain activity and image features.
Contribution
It introduces a novel approach combining regression models and pre-trained image-captioning networks to decode semantic content from brain activity.
Findings
The model successfully generates natural language descriptions from brain activity.
Semantic information is distributed across the entire cortex.
The approach works with limited brain data.
Abstract
Quantitative modeling of human brain activity based on language representations has been actively studied in systems neuroscience. However, previous studies examined word-level representation, and little is known about whether we could recover structured sentences from brain activity. This study attempts to generate natural language descriptions of semantic contents from human brain activity evoked by visual stimuli. To effectively use a small amount of available brain activity data, our proposed method employs a pre-trained image-captioning network model using a deep learning framework. To apply brain activity to the image-captioning network, we train regression models that learn the relationship between brain activity and deep-layer image features. The results demonstrate that the proposed model can decode brain activity and generate descriptions using natural language sentences. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning
