Reanimating Images using Neural Representations of Dynamic Stimuli
Jacob Yeung, Andrew F. Luo, Gabriel Sarch, Margaret M. Henderson, Deva, Ramanan, Michael J. Tarr

TL;DR
This paper introduces BrainNRDS, a method that uses neural representations and brain activity to decode and reanimate dynamic visual stimuli, advancing understanding of motion perception and improving video prediction models.
Contribution
The paper presents a novel approach combining brain imaging and video diffusion models to decode and generate dynamic visual stimuli from neural activity.
Findings
Decoded fine-grained optical flow from brain activity.
Video encoders outperform image-based models in predicting brain responses.
Enabled realistic video reanimation from initial frames.
Abstract
While computer vision models have made incredible strides in static image recognition, they still do not match human performance in tasks that require the understanding of complex, dynamic motion. This is notably true for real-world scenarios where embodied agents face complex and motion-rich environments. Our approach, BrainNRDS (Brain-Neural Representations of Dynamic Stimuli), leverages state-of-the-art video diffusion models to decouple static image representation from motion generation, enabling us to utilize fMRI brain activity for a deeper understanding of human responses to dynamic visual stimuli. Conversely, we also demonstrate that information about the brain's representation of motion can enhance the prediction of optical flow in artificial systems. Our novel approach leads to four main findings: (1) Visual motion, represented as fine-grained, object-level resolution optical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual perception and processing mechanisms
MethodsDiffusion
