NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation
Yiran Wang, Min Shi, Jiaqi Li, Chaoyi Hong, Zihao Huang, Juewen Peng,, Zhiguo Cao, Jianming Zhang, Ke Xian, Guosheng Lin

TL;DR
NVDS+ is a versatile, efficient neural stabilizer that improves video depth estimation consistency across various models and applications, supported by a large-scale dataset and bidirectional inference strategy.
Contribution
Introduces NVDS+, a plug-and-play stabilizer for video depth estimation, along with the largest natural-scene video depth dataset VDW, and extends to multiple downstream tasks.
Findings
Significant improvements in depth consistency and accuracy
Effective across various single-image models and applications
Provides a large-scale dataset for training and evaluation
Abstract
Video depth estimation aims to infer temporally consistent depth. One approach is to finetune a single-image model on each video with geometry constraints, which proves inefficient and lacks robustness. An alternative is learning to enforce consistency from data, which requires well-designed models and sufficient video depth data. To address both challenges, we introduce NVDS+ that stabilizes inconsistent depth estimated by various single-image models in a plug-and-play manner. We also elaborate a large-scale Video Depth in the Wild (VDW) dataset, which contains 14,203 videos with over two million frames, making it the largest natural-scene video depth dataset. Additionally, a bidirectional inference strategy is designed to improve consistency by adaptively fusing forward and backward predictions. We instantiate a model family ranging from small to large scales for different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques
