IE2Video: Adapting Pretrained Diffusion Models for Event-Based Video Reconstruction
Dmitrii Torbunov, Onur Okuducu, Yi Huang, Odera Dim, Rebecca Coles, Yonggang Cui, Yihui Ren

TL;DR
This paper introduces IE2Video, a hybrid event-based video reconstruction method that combines sparse RGB keyframes with event streams, utilizing diffusion models to produce high-quality videos with lower power consumption.
Contribution
It proposes a novel task of reconstructing RGB videos from a single frame and event data, and explores two architectural strategies including a diffusion-based approach that outperforms autoregressive models.
Findings
Diffusion-based approach achieves 33% better perceptual quality than autoregressive baseline.
Method demonstrates robust cross-dataset generalization across multiple event camera datasets.
Reconstruction maintains high quality over varying sequence lengths and unseen configurations.
Abstract
Continuous video monitoring in surveillance, robotics, and wearable systems faces a fundamental power constraint: conventional RGB cameras consume substantial energy through fixed-rate capture. Event cameras offer sparse, motion-driven sensing with low power consumption, but produce asynchronous event streams rather than RGB video. We propose a hybrid capture paradigm that records sparse RGB keyframes alongside continuous event streams, then reconstructs full RGB video offline -- reducing capture power consumption while maintaining standard video output for downstream applications. We introduce the Image and Event to Video (IE2Video) task: reconstructing RGB video sequences from a single initial frame and subsequent event camera data. We investigate two architectural strategies: adapting an autoregressive model (HyperE2VID) for RGB generation, and injecting event representations into a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Age of Information Optimization · Human Pose and Action Recognition
