Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset
Zixun Huang, Keling Yao, Seth Z. Zhao, Chuanyu Pan, Allen Y. Yang

TL;DR
This paper introduces DTTDNet, a transformer-based 6DoF pose estimation method that is highly robust to depth noise, validated on a new mobile dataset, and significantly outperforms existing approaches in accuracy and noise resilience.
Contribution
We propose DTTDNet, a novel transformer-based 6DoF pose estimation approach with a geometric filtering module and Chamfer loss, and introduce the DTTD-Mobile dataset for evaluating robustness against depth noise.
Findings
DTTDNet outperforms state-of-the-art methods by at least 4.32 in ADD metrics.
DTTDNet achieves up to 60.74 points improvement in accuracy.
Our approach demonstrates superior robustness to depth measurement noise.
Abstract
Robust 6DoF pose estimation with mobile devices is the foundation for applications in robotics, augmented reality, and digital twin localization. In this paper, we extensively investigate the robustness of existing RGBD-based 6DoF pose estimation methods against varying levels of depth sensor noise. We highlight that existing 6DoF pose estimation methods suffer significant performance discrepancies due to depth measurement inaccuracies. In response to the robustness issue, we present a simple and effective transformer-based 6DoF pose estimation approach called DTTDNet, featuring a novel geometric feature filtering module and a Chamfer distance loss for training. Moreover, we advance the field of robust 6DoF pose estimation and introduce a new dataset -- Digital Twin Tracking Dataset Mobile (DTTD-Mobile), tailored for digital twin object tracking with noisy depth data from the mobile…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVirtual Reality Applications and Impacts · Augmented Reality Applications · Surgical Simulation and Training
