NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
Zixuan Gong, Guangyin Bao, Qi Zhang, Zhongwei Wan, Duoqian Miao,, Shoujin Wang, Lei Zhu, Changwei Wang, Rongtao Xu, Liang Hu, Ke Liu, Yu Zhang

TL;DR
NeuroClips introduces a novel framework that decodes high-fidelity, smooth videos from fMRI data by combining semantic and perceptual reconstructions, significantly outperforming existing models in quality and temporal consistency.
Contribution
The paper presents NeuroClips, a new method that effectively reconstructs continuous videos from fMRI by integrating semantic keyframes and low-level perception flows with a diffusion model.
Findings
Achieves up to 6 seconds of video at 8 FPS with high fidelity.
Improves SSIM by 128% over previous models.
Enhances spatiotemporal reconstruction metrics by 81%.
Abstract
Reconstruction of static visual stimuli from non-invasion brain activity fMRI achieves great success, owning to advanced deep learning models such as CLIP and Stable Diffusion. However, the research on fMRI-to-video reconstruction remains limited since decoding the spatiotemporal perception of continuous visual experiences is formidably challenging. We contend that the key to addressing these challenges lies in accurately decoding both high-level semantics and low-level perception flows, as perceived by the brain in response to video stimuli. To the end, we propose NeuroClips, an innovative framework to decode high-fidelity and smooth video from fMRI. NeuroClips utilizes a semantics reconstructor to reconstruct video keyframes, guiding semantic accuracy and consistency, and employs a perception reconstructor to capture low-level perceptual details, ensuring video smoothness. During…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroscience and Neural Engineering · Cell Image Analysis Techniques · Advanced Fluorescence Microscopy Techniques
MethodsContrastive Language-Image Pre-training · Diffusion
