Looking through the mind's eye via multimodal encoder-decoder networks
Arman Afrasiyabi, Erica Busch, Rahul Singh, Dhananjay Bhaskar, Laurent, Caplette, Nicholas Turk-Browne, Smita Krishnaswamy

TL;DR
This paper presents a multimodal encoder-decoder approach to decode visual mental imagery from fMRI data by aligning brain activity with video representations using textual prompts, demonstrating effective mapping and decoding.
Contribution
The work introduces a novel method to decode mental imagery from fMRI signals by aligning fMRI embeddings with video representations via textual prompts, expanding dataset coverage.
Findings
Successful mapping between fMRI signals and video representations
Effective decoding of mental imagery from brain activity
Enhanced dataset improves model robustness
Abstract
In this work, we explore the decoding of mental imagery from subjects using their fMRI measurements. In order to achieve this decoding, we first created a mapping between a subject's fMRI signals elicited by the videos the subjects watched. This mapping associates the high dimensional fMRI activation states with visual imagery. Next, we prompted the subjects textually, primarily with emotion labels which had no direct reference to visual objects. Then to decode visual imagery that may have been in a person's mind's eye, we align a latent representation of these fMRI measurements with a corresponding video-fMRI based on textual labels given to the videos themselves. This alignment has the effect of overlapping the video fMRI embedding with the text-prompted fMRI embedding, thus allowing us to use our fMRI-to-video mapping to decode. Additionally, we enhance an existing fMRI dataset,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Science and Education Research · Anomaly Detection Techniques and Applications · Emotion and Mood Recognition
MethodsALIGN
