DynamicStereo: Consistent Dynamic Depth from Stereo Videos
Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova,, Andrea Vedaldi, Christian Rupprecht

TL;DR
DynamicStereo is a transformer-based model that improves temporal consistency in depth estimation from stereo videos, addressing flickering issues crucial for immersive AR/VR experiences.
Contribution
It introduces a novel architecture with divided attention layers for efficient processing and a new dataset, Dynamic Replica, for training and benchmarking dynamic stereo depth estimation.
Findings
Enhanced temporal consistency in depth predictions.
Improved accuracy using the Dynamic Replica dataset.
Effective processing of stereo videos with divided attention layers.
Abstract
We consider the problem of reconstructing a dynamic scene observed from a stereo camera. Most existing methods for depth from stereo treat different stereo frames independently, leading to temporally inconsistent depth predictions. Temporal consistency is especially important for immersive AR or VR scenarios, where flickering greatly diminishes the user experience. We propose DynamicStereo, a novel transformer-based architecture to estimate disparity for stereo videos. The network learns to pool information from neighboring frames to improve the temporal consistency of its predictions. Our architecture is designed to process stereo videos efficiently through divided attention layers. We also introduce Dynamic Replica, a new benchmark dataset containing synthetic videos of people and animals in scanned environments, which provides complementary training and evaluation data for dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques
