Learning Depth Estimation for Transparent and Mirror Surfaces

Alex Costanzino; Pierluigi Zama Ramirez; Matteo Poggi; Fabio Tosi,; Stefano Mattoccia; Luigi Di Stefano

arXiv:2307.15052·cs.CV·July 28, 2023

Learning Depth Estimation for Transparent and Mirror Surfaces

Alex Costanzino, Pierluigi Zama Ramirez, Matteo Poggi, Fabio Tosi,, Stefano Mattoccia, Luigi Di Stefano

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method for depth estimation of transparent and mirror surfaces using pseudo labels generated through in-painting, enabling neural networks to learn without ground-truth annotations.

Contribution

The authors propose a simple, annotation-free pipeline that improves depth estimation for ToM surfaces by generating reliable pseudo labels via in-painting and fine-tuning existing models.

Findings

01

Significant accuracy improvements on the Booster dataset.

02

Effective pseudo label generation without ground-truth annotations.

03

Applicable to both monocular and stereo depth estimation models.

Abstract

Inferring the depth of transparent or mirror (ToM) surfaces represents a hard challenge for either sensors, algorithms, or deep networks. We propose a simple pipeline for learning to estimate depth properly for such surfaces with neural networks, without requiring any ground-truth annotation. We unveil how to obtain reliable pseudo labels by in-painting ToM objects in images and processing them with a monocular depth estimation model. These labels can be used to fine-tune existing monocular or stereo networks, to let them learn how to deal with ToM surfaces. Experimental results on the Booster dataset show the dramatic improvements enabled by our remarkably simple proposal.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cvlab-unibo/depth4tom-code
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · 3D Surveying and Cultural Heritage