Spatio-Temporal Difference Guided Motion Deblurring with the Complementary Vision Sensor
Yapeng Meng, Lin Yang, Yuguo Chen, Xiangru Chen, Taoyi Wang, Lijian Wang, Zheyu Yang, Yihan Lin, Rong Zhao

TL;DR
This paper introduces STGDNet, a novel neural network that leverages the complementary vision sensor's structural and motion data to significantly improve motion deblurring in extreme dynamic scenes.
Contribution
The paper proposes a recurrent multi-branch architecture that effectively fuses structural and motion cues from the CVS to enhance RGB deblurring, outperforming existing methods.
Findings
Outperforms current RGB and event-based deblurring approaches.
Demonstrates strong generalization across over 100 real-world scenarios.
Effectively restores details in extreme motion scenes.
Abstract
Motion blur arises when rapid scene changes occur during the exposure period, collapsing rich intra-exposure motion into a single RGB frame. Without explicit structural or temporal cues, RGB-only deblurring is highly ill-posed and often fails under extreme motion. Inspired by the human visual system, brain-inspired vision sensors introduce temporally dense information to alleviate this problem. However, event cameras still suffer from event rate saturation under rapid motion, while the event modality entangles edge features and motion cues, which limits their effectiveness. As a recent breakthrough, the complementary vision sensor (CVS), Tianmouc, captures synchronized RGB frames together with high-frame-rate, multi-bit spatial difference (SD, encoding structural edges) and temporal difference (TD, encoding motion cues) data within a single RGB exposure, offering a promising solution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
