Multi-Modal Dataset Acquisition for Photometrically Challenging Object

HyunJun Jung; Patrick Ruhkamp; Nassir Navab; Benjamin Busam

arXiv:2308.10621·cs.CV·August 22, 2023

Multi-Modal Dataset Acquisition for Photometrically Challenging Object

HyunJun Jung, Patrick Ruhkamp, Nassir Navab, Benjamin Busam

PDF

Open Access

TL;DR

This paper introduces a novel multi-modal dataset acquisition pipeline that improves the accuracy, realism, and coverage of 3D perception datasets for photometrically challenging objects, using robotic and freehand methods.

Contribution

It presents a new annotation and data collection pipeline combining robotic kinematics, infrared tracking, and calibration for high-quality 3D datasets.

Findings

01

Enhanced dataset accuracy and realism

02

Wider viewpoint coverage achieved with freehand procedure

03

Improved 3D object and camera pose annotations

Abstract

This paper addresses the limitations of current datasets for 3D vision tasks in terms of accuracy, size, realism, and suitable imaging modalities for photometrically challenging objects. We propose a novel annotation and acquisition pipeline that enhances existing 3D perception and 6D object pose datasets. Our approach integrates robotic forward-kinematics, external infrared trackers, and improved calibration and annotation procedures. We present a multi-modal sensor rig, mounted on a robotic end-effector, and demonstrate how it is integrated into the creation of highly accurate datasets. Additionally, we introduce a freehand procedure for wider viewpoint coverage. Both approaches yield high-quality 3D data with accurate object and camera pose annotations. Our methods overcome the limitations of existing datasets and provide valuable resources for 3D vision research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage