Reanimating Images using Neural Representations of Dynamic Stimuli

Jacob Yeung; Andrew F. Luo; Gabriel Sarch; Margaret M. Henderson; Deva; Ramanan; Michael J. Tarr

arXiv:2406.02659·q-bio.NC·April 9, 2025

Reanimating Images using Neural Representations of Dynamic Stimuli

Jacob Yeung, Andrew F. Luo, Gabriel Sarch, Margaret M. Henderson, Deva, Ramanan, Michael J. Tarr

PDF

Open Access

TL;DR

This paper introduces BrainNRDS, a method that uses neural representations and brain activity to decode and reanimate dynamic visual stimuli, advancing understanding of motion perception and improving video prediction models.

Contribution

The paper presents a novel approach combining brain imaging and video diffusion models to decode and generate dynamic visual stimuli from neural activity.

Findings

01

Decoded fine-grained optical flow from brain activity.

02

Video encoders outperform image-based models in predicting brain responses.

03

Enabled realistic video reanimation from initial frames.

Abstract

While computer vision models have made incredible strides in static image recognition, they still do not match human performance in tasks that require the understanding of complex, dynamic motion. This is notably true for real-world scenarios where embodied agents face complex and motion-rich environments. Our approach, BrainNRDS (Brain-Neural Representations of Dynamic Stimuli), leverages state-of-the-art video diffusion models to decouple static image representation from motion generation, enabling us to utilize fMRI brain activity for a deeper understanding of human responses to dynamic visual stimuli. Conversely, we also demonstrate that information about the brain's representation of motion can enhance the prediction of optical flow in artificial systems. Our novel approach leads to four main findings: (1) Visual motion, represented as fine-grained, object-level resolution optical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual perception and processing mechanisms

MethodsDiffusion