MonoIndoor++:Towards Better Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments
Runze Li, Pan Ji, Yi Xu, Bir Bhanu

TL;DR
MonoIndoor++ advances self-supervised monocular depth estimation for indoor scenes by addressing scale variation and rotational motion challenges with a depth factorization module, residual pose estimation, and coordinate encoding, achieving state-of-the-art results.
Contribution
Introduces a novel framework with a depth scale factor, residual pose estimation, and coordinate encoding to improve indoor monocular depth estimation.
Findings
Achieves state-of-the-art performance on multiple indoor datasets.
Effectively handles depth scale variation and rotational motions.
Outperforms previous methods in indoor depth estimation accuracy.
Abstract
Self-supervised monocular depth estimation has seen significant progress in recent years, especially in outdoor environments. However, depth prediction results are not satisfying in indoor scenes where most of the existing data are captured with hand-held devices. As compared to outdoor environments, estimating depth of monocular videos for indoor environments, using self-supervised methods, results in two additional challenges: (i) the depth range of indoor video sequences varies a lot across different frames, making it difficult for the depth network to induce consistent depth cues for training; (ii) the indoor sequences recorded with handheld devices often contain much more rotational motions, which cause difficulties for the pose network to predict accurate relative camera poses. In this work, we propose a novel framework-MonoIndoor++ by giving special considerations to those…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image Processing Techniques and Applications
