GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth

Aur\'elien Cecille; Stefan Duffner; Franck Davoine; Thibault Neveu and; R\'emi Agier

arXiv:2409.14850·cs.CV·September 24, 2024

GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth

Aur\'elien Cecille, Stefan Duffner, Franck Davoine, Thibault Neveu and, R\'emi Agier

PDF

1 Repo

TL;DR

This paper introduces GroCo, a ground constraint method for self-supervised monocular depth estimation that improves scale recovery and model generalization across diverse datasets and camera poses.

Contribution

We propose a novel ground constraint mechanism tailored for self-supervised monocular depth estimation, enhancing scale recovery and cross-dataset generalization.

Findings

01

Outperforms existing scale recovery methods on KITTI.

02

Enhances robustness across diverse camera rotations.

03

Improves zero-shot generalization to unseen datasets.

Abstract

Monocular depth estimation has greatly improved in the recent years but models predicting metric depth still struggle to generalize across diverse camera poses and datasets. While recent supervised methods mitigate this issue by leveraging ground prior information at inference, their adaptability to self-supervised settings is limited due to the additional challenge of scale recovery. Addressing this gap, we propose in this paper a novel constraint on ground areas designed specifically for the self-supervised paradigm. This mechanism not only allows to accurately recover the scale but also ensures coherence between the depth prediction and the ground prior. Experimental results show that our method surpasses existing scale recovery techniques on the KITTI benchmark and significantly enhances model generalization capabilities. This improvement can be observed by its more robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Visual-Behavior/GroCo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.