UAMD-Net: A Unified Adaptive Multimodal Neural Network for Dense Depth Completion
Guancheng Chen, Junli Lin, Huabiao Qin

TL;DR
UAMD-Net is a novel neural network that fuses stereo matching and sparse point cloud data for dense depth completion, featuring an adaptive training strategy to handle modal dependence and improve robustness.
Contribution
The paper introduces UAMD-Net, a multimodal neural network with a new Modal-dropout training strategy, enabling robust depth completion under various modal input conditions.
Findings
Outperforms state-of-the-art methods on KITTI benchmark.
Demonstrates robustness with various modal input combinations.
Achieves accurate dense depth maps in autonomous driving scenarios.
Abstract
Depth prediction is a critical problem in robotics applications especially autonomous driving. Generally, depth prediction based on binocular stereo matching and fusion of monocular image and laser point cloud are two mainstream methods. However, the former usually suffers from overfitting while building cost volume, and the latter has a limited generalization due to the lack of geometric constraint. To solve these problems, we propose a novel multimodal neural network, namely UAMD-Net, for dense depth completion based on fusion of binocular stereo matching and the weak constrain from the sparse point clouds. Specifically, the sparse point clouds are converted to sparse depth map and sent to the multimodal feature encoder (MFE) with binocular image, constructing a cross-modal cost volume. Then, it will be further processed by the multimodal feature aggregator (MFA) and the depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image Processing Techniques and Applications
