Good Grasps Only: A data engine for self-supervised fine-tuning of pose   estimation using grasp poses for verification

Frederik Hagelskj{\ae}r

arXiv:2409.11512·cs.RO·December 13, 2024

Good Grasps Only: A data engine for self-supervised fine-tuning of pose estimation using grasp poses for verification

Frederik Hagelskj{\ae}r

PDF

Open Access 1 Models

TL;DR

This paper introduces a self-supervised method for robot pose estimation that automatically generates training data through grasp verification, enabling rapid setup and improved accuracy in bin picking tasks.

Contribution

The proposed approach allows for continuous fine-tuning of pose estimation during operation without manual labeling, outperforming existing methods in bin picking scenarios.

Findings

01

Enhanced pose estimation accuracy across four objects

02

Outperforms state-of-the-art methods trained on CAD models

03

Enables rapid, self-supervised system setup

Abstract

In this paper, we present a novel method for self-supervised fine-tuning of pose estimation. Leveraging zero-shot pose estimation, our approach enables the robot to automatically obtain training data without manual labeling. After pose estimation the object is grasped, and in-hand pose estimation is used for data validation. Our pipeline allows the system to fine-tune while the process is running, removing the need for a learning phase. The motivation behind our work lies in the need for rapid setup of pose estimation solutions. Specifically, we address the challenging task of bin picking, which plays a pivotal role in flexible robotic setups. Our method is implemented on a robotics work-cell, and tested with four different objects. For all objects, our method increases the performance and outperforms a state-of-the-art method trained on the CAD model of the objects. Project page…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
frhag/keymatchnet_gogo_engine
model· 2 dl
2 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Human Motion and Animation