Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset

Zixun Huang; Keling Yao; Seth Z. Zhao; Chuanyu Pan; Allen Y. Yang

arXiv:2309.13570·cs.CV·June 4, 2025·1 cites

Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset

Zixun Huang, Keling Yao, Seth Z. Zhao, Chuanyu Pan, Allen Y. Yang

PDF

Open Access 3 Repos 2 Datasets

TL;DR

This paper introduces DTTDNet, a transformer-based 6DoF pose estimation method that is highly robust to depth noise, validated on a new mobile dataset, and significantly outperforms existing approaches in accuracy and noise resilience.

Contribution

We propose DTTDNet, a novel transformer-based 6DoF pose estimation approach with a geometric filtering module and Chamfer loss, and introduce the DTTD-Mobile dataset for evaluating robustness against depth noise.

Findings

01

DTTDNet outperforms state-of-the-art methods by at least 4.32 in ADD metrics.

02

DTTDNet achieves up to 60.74 points improvement in accuracy.

03

Our approach demonstrates superior robustness to depth measurement noise.

Abstract

Robust 6DoF pose estimation with mobile devices is the foundation for applications in robotics, augmented reality, and digital twin localization. In this paper, we extensively investigate the robustness of existing RGBD-based 6DoF pose estimation methods against varying levels of depth sensor noise. We highlight that existing 6DoF pose estimation methods suffer significant performance discrepancies due to depth measurement inaccuracies. In response to the robustness issue, we present a simple and effective transformer-based 6DoF pose estimation approach called DTTDNet, featuring a novel geometric feature filtering module and a Chamfer distance loss for training. Moreover, we advance the field of robust 6DoF pose estimation and introduce a new dataset -- Digital Twin Tracking Dataset Mobile (DTTD-Mobile), tailored for digital twin object tracking with noisy depth data from the mobile…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVirtual Reality Applications and Impacts · Augmented Reality Applications · Surgical Simulation and Training