Rapid Pose Label Generation through Sparse Representation of Unknown   Objects

Rohan Pratap Singh; Mehdi Benallegue; Yusuke Yoshiyasu; Fumio Kanehiro

arXiv:2011.03790·cs.CV·July 27, 2022

Rapid Pose Label Generation through Sparse Representation of Unknown Objects

Rohan Pratap Singh, Mehdi Benallegue, Yusuke Yoshiyasu, Fumio Kanehiro

PDF

1 Repo

TL;DR

This paper introduces a fast method for generating pose-labeled RGB-D data for unknown objects without needing 3D models or complex setups, enabling effective training of pose estimation networks.

Contribution

It presents a novel sparse representation approach that allows rapid pose label generation for unknown objects using minimal human input and optimization, bypassing traditional modeling requirements.

Findings

01

Generated datasets enable effective training of pose estimation networks.

02

Sparse models scale efficiently to many scenes.

03

Method avoids need for 3D models and complex setups.

Abstract

Deep Convolutional Neural Networks (CNNs) have been successfully deployed on robots for 6-DoF object pose estimation through visual perception. However, obtaining labeled data on a scale required for the supervised training of CNNs is a difficult task - exacerbated if the object is novel and a 3D model is unavailable. To this end, this work presents an approach for rapidly generating real-world, pose-annotated RGB-D data for unknown objects. Our method not only circumvents the need for a prior 3D object model (textured or otherwise) but also bypasses complicated setups of fiducial markers, turntables, and sensors. With the help of a human user, we first source minimalistic labelings of an ordered set of arbitrarily chosen keypoints over a set of RGB-D videos. Then, by solving an optimization problem, we combine these labels under a world frame to recover a sparse, keypoint-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rohanpsingh/RapidPoseLabels
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.