CHADET: Cross-Hierarchical-Attention for Depth-Completion Using Unsupervised Lightweight Transformer

Kevin Christiansen Marsim; Jinwoo Jeon; Yeeun Kim; Myeongwoo Jeong; Hyun Myung

arXiv:2507.15189·cs.RO·July 22, 2025

CHADET: Cross-Hierarchical-Attention for Depth-Completion Using Unsupervised Lightweight Transformer

Kevin Christiansen Marsim, Jinwoo Jeon, Yeeun Kim, Myeongwoo Jeong, Hyun Myung

PDF

TL;DR

CHADET introduces a lightweight transformer-based depth completion method that enhances depth map accuracy and efficiency for real-time robotic applications by utilizing a novel cross-hierarchical-attention mechanism.

Contribution

It proposes a novel cross-hierarchical-attention module within a lightweight transformer framework for improved depth completion from RGB and sparse depth data.

Findings

01

Improves depth map quality on KITTI, NYUv2, and VOID datasets.

02

Reduces memory usage compared to existing methods.

03

Achieves real-time inference suitable for robotic tasks.

Abstract

Depth information which specifies the distance between objects and current position of the robot is essential for many robot tasks such as navigation. Recently, researchers have proposed depth completion frameworks to provide dense depth maps that offer comprehensive information about the surrounding environment. However, existing methods show significant trade-offs between computational efficiency and accuracy during inference. The substantial memory and computational requirements make them unsuitable for real-time applications, highlighting the need to improve the completeness and accuracy of depth information while improving processing speed to enhance robot performance in various tasks. To address these challenges, in this paper, we propose CHADET(cross-hierarchical-attention depth-completion transformer), a lightweight depth-completion network that can generate accurate dense depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.