DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation
Wenzhou Lyu, Jialing Lin, Wenqi Ren, Ruihao Xia, Feng Qian, Yang Tang

TL;DR
DidSee introduces a diffusion-based depth completion framework that effectively handles non-Lambertian objects, improves generalization, and enhances downstream robotic perception tasks through novel bias mitigation and semantic integration techniques.
Contribution
The paper presents DidSee, a novel diffusion-based depth completion method with bias correction and semantic enhancement, improving accuracy and robustness for non-Lambertian objects in robotic perception.
Findings
Achieves state-of-the-art results on multiple benchmarks.
Demonstrates robust real-world generalization.
Enhances downstream tasks like pose estimation and grasping.
Abstract
Commercial RGB-D cameras often produce noisy, incomplete depth maps for non-Lambertian objects. Traditional depth completion methods struggle to generalize due to the limited diversity and scale of training data. Recent advances exploit visual priors from pre-trained text-to-image diffusion models to enhance generalization in dense prediction tasks. However, we find that biases arising from training-inference mismatches in the vanilla diffusion framework significantly impair depth completion performance. Additionally, the lack of distinct visual features in non-Lambertian regions further hinders precise prediction. To address these issues, we propose \textbf{DidSee}, a diffusion-based framework for depth completion on non-Lambertian objects. First, we integrate a rescaled noise scheduler enforcing a zero terminal signal-to-noise ratio to eliminate signal leakage bias. Second, we devise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Industrial Vision Systems and Defect Detection · Manufacturing Process and Optimization
MethodsDiffusion
