TL;DR
The paper introduces ACMNet, a novel depth completion model that uses adaptive graph propagation and multi-modal fusion to effectively recover dense depth maps from sparse data, outperforming existing methods.
Contribution
It proposes a new adaptive graph propagation approach combined with multi-modal fusion and a symmetric gated strategy for improved depth completion.
Findings
Achieves state-of-the-art results on KITTI and NYU-v2 benchmarks.
Uses fewer parameters than previous models.
Effectively models observed spatial contexts with graph-based methods.
Abstract
Depth completion aims to recover a dense depth map from the sparse depth data and the corresponding single RGB image. The observed pixels provide the significant guidance for the recovery of the unobserved pixels' depth. However, due to the sparsity of the depth data, the standard convolution operation, exploited by most of existing methods, is not effective to model the observed contexts with depth values. To address this issue, we propose to adopt the graph propagation to capture the observed spatial contexts. Specifically, we first construct multiple graphs at different scales from observed pixels. Since the graph structure varies from sample to sample, we then apply the attention mechanism on the propagation, which encourages the network to model the contextual information adaptively. Furthermore, considering the mutli-modality of input data, we exploit the graph propagation on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
