SPIdepth: Strengthened Pose Information for Self-supervised Monocular   Depth Estimation

Mykola Lavreniuk

arXiv:2404.12501·cs.CV·September 4, 2024·2 cites

SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation

Mykola Lavreniuk

PDF

Open Access 1 Repo 1 Models

TL;DR

SPIdepth enhances pose network capabilities in self-supervised monocular depth estimation, achieving state-of-the-art results on multiple datasets by leveraging strengthened pose information for better scene understanding.

Contribution

The paper introduces SPIdepth, a novel method that emphasizes strengthening pose information to significantly improve depth estimation accuracy.

Findings

01

Achieves state-of-the-art results on KITTI, Cityscapes, and Make3D datasets.

02

Surpasses previous methods in depth estimation metrics without using motion masks.

03

Outperforms video-based methods using only a single image for inference.

Abstract

Self-supervised monocular depth estimation has garnered considerable attention for its applications in autonomous driving and robotics. While recent methods have made strides in leveraging techniques like the Self Query Layer (SQL) to infer depth from motion, they often overlook the potential of strengthening pose information. In this paper, we introduce SPIdepth, a novel approach that prioritizes enhancing the pose network for improved depth estimation. Building upon the foundation laid by SQL, SPIdepth emphasizes the importance of pose information in capturing fine-grained scene structures. By enhancing the pose network's capabilities, SPIdepth achieves remarkable advancements in scene understanding and depth estimation. Experimental results on benchmark datasets such as KITTI, Cityscapes, and Make3D showcase SPIdepth's state-of-the-art performance, surpassing previous methods by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Lavreniuk/SPIdepth
pytorchOfficial

Models

🤗
MykolaL/SPIdepth
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image Processing Techniques and Applications

MethodsLinear Layer · Multi-Head Attention · Attention Is All You Need · Transformer