Depth Prompting for Sensor-Agnostic Depth Estimation
Jin-Hwi Park, Chanhwi Jeong, Junoh Lee, Hae-Gon Jeon

TL;DR
This paper introduces a depth prompt module that enhances sensor-agnostic depth estimation by mitigating biases and enabling foundation models to produce accurate, scale-aware depth maps across various sensors and scene configurations.
Contribution
The paper proposes a novel depth prompt module that disentangles input modalities, allowing foundation models to generalize better across different sensors and scene setups.
Findings
Improves depth estimation accuracy across multiple sensor types.
Enables foundation models to produce absolute scale depth maps.
Demonstrates effectiveness through extensive evaluations.
Abstract
Dense depth maps have been used as a key element of visual perception tasks. There have been tremendous efforts to enhance the depth quality, ranging from optimization-based to learning-based methods. Despite the remarkable progress for a long time, their applicability in the real world is limited due to systematic measurement biases such as density, sensing pattern, and scan range. It is well-known that the biases make it difficult for these methods to achieve their generalization. We observe that learning a joint representation for input modalities (e.g., images and depth), which most recent methods adopt, is sensitive to the biases. In this work, we disentangle those modalities to mitigate the biases with prompt engineering. For this, we design a novel depth prompt module to allow the desirable feature representation according to new depth distributions from either sensor types or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection
