Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
Yizhuo Lu, Changde Du, Chong Wang, Xuanliu Zhu, Liuyun, Jiang, Xujin Li, Huiguang He

TL;DR
This paper introduces Mind-Animator, a two-stage model that decouples semantic, structural, and motion features from fMRI data to improve the reconstruction of dynamic natural vision, achieving state-of-the-art results.
Contribution
The paper presents a novel two-stage framework that decouples features from fMRI signals and integrates them into videos using inflation of Stable Diffusion, addressing previous limitations in video reconstruction from brain activity.
Findings
Achieves state-of-the-art performance on multiple datasets.
Effectively decouples semantic, structure, and motion features from fMRI.
Provides neurobiologically interpretable visualizations.
Abstract
Reconstructing human dynamic vision from brain activity is a challenging task with great scientific significance. Although prior video reconstruction methods have made substantial progress, they still suffer from several limitations, including: (1) difficulty in simultaneously reconciling semantic (e.g. categorical descriptions), structure (e.g. size and color), and consistent motion information (e.g. order of frames); (2) low temporal resolution of fMRI, which poses a challenge in decoding multiple frames of video dynamics from a single fMRI frame; (3) reliance on video generation models, which introduces ambiguity regarding whether the dynamics observed in the reconstructed videos are genuinely derived from fMRI data or are hallucinations from generative model. To overcome these limitations, we propose a two-stage model named Mind-Animator. During the fMRI-to-feature stage, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Neural dynamics and brain function
MethodsContrastive Learning · Diffusion
