SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation
Mykola Lavreniuk

TL;DR
SPIdepth enhances pose network capabilities in self-supervised monocular depth estimation, achieving state-of-the-art results on multiple datasets by leveraging strengthened pose information for better scene understanding.
Contribution
The paper introduces SPIdepth, a novel method that emphasizes strengthening pose information to significantly improve depth estimation accuracy.
Findings
Achieves state-of-the-art results on KITTI, Cityscapes, and Make3D datasets.
Surpasses previous methods in depth estimation metrics without using motion masks.
Outperforms video-based methods using only a single image for inference.
Abstract
Self-supervised monocular depth estimation has garnered considerable attention for its applications in autonomous driving and robotics. While recent methods have made strides in leveraging techniques like the Self Query Layer (SQL) to infer depth from motion, they often overlook the potential of strengthening pose information. In this paper, we introduce SPIdepth, a novel approach that prioritizes enhancing the pose network for improved depth estimation. Building upon the foundation laid by SQL, SPIdepth emphasizes the importance of pose information in capturing fine-grained scene structures. By enhancing the pose network's capabilities, SPIdepth achieves remarkable advancements in scene understanding and depth estimation. Experimental results on benchmark datasets such as KITTI, Cityscapes, and Make3D showcase SPIdepth's state-of-the-art performance, surpassing previous methods by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image Processing Techniques and Applications
MethodsLinear Layer · Multi-Head Attention · Attention Is All You Need · Transformer
