Matching-Free Depth Recovery from Structured Light

Zhuohang Yu; Kai Wang; Kun Huang; Juyong Zhang

arXiv:2501.07113·cs.CV·June 26, 2025

Matching-Free Depth Recovery from Structured Light

Zhuohang Yu, Kai Wang, Kun Huang, Juyong Zhang

PDF

TL;DR

This paper presents a new matching-free depth recovery method from structured light images using a density voxel grid and self-supervised volume rendering, achieving faster convergence and improved accuracy over existing techniques.

Contribution

It introduces a novel matching-free depth estimation approach employing a density voxel grid and self-supervised differentiable volume rendering, enhancing speed and geometric accuracy.

Findings

01

Achieves ~30% reduction in depth errors compared to matching-based methods.

02

Approximately three times faster training than previous implicit matching-free methods.

03

Outperforms existing techniques in synthetic and real-world scenes.

Abstract

We introduce a novel approach for depth estimation using images obtained from monocular structured light systems. In contrast to many existing methods that depend on image matching, our technique employs a density voxel grid to represent scene geometry. This grid is trained through self-supervised differentiable volume rendering. Our method leverages color fields derived from the projected patterns in structured light systems during the rendering process, facilitating the isolated optimization of the geometry field. This innovative approach leads to faster convergence and high-quality results. Additionally, we integrate normalized device coordinates (NDC), a distortion loss, and a distinctive surface-based color loss to enhance geometric fidelity. Experimental results demonstrate that our method outperforms current matching-based techniques in terms of geometric performance in few-shot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings