Learning a Generative Transition Model for Uncertainty-Aware Robotic   Manipulation

Lars Berscheid; Pascal Mei{\ss}ner; Torsten Kr\"oger

arXiv:2107.02464·cs.RO·July 7, 2021

Learning a Generative Transition Model for Uncertainty-Aware Robotic Manipulation

Lars Berscheid, Pascal Mei{\ss}ner, Torsten Kr\"oger

PDF

TL;DR

This paper introduces a generative transition model for robotic manipulation that predicts future states and their uncertainties, enabling faster bin-picking and optimized action planning, resulting in significant efficiency improvements.

Contribution

The paper presents a novel image-to-image transition model trained on real-world data that enhances manipulation speed and planning in robotic bin-picking tasks.

Findings

01

Increased picks per hour by around 15% using the model.

02

Achieved over 700 PPH in the YCB Box and Blocks Test.

03

Enabled planning of action sequences to minimize total actions.

Abstract

Robot learning of real-world manipulation tasks remains challenging and time consuming, even though actions are often simplified by single-step manipulation primitives. In order to compensate the removed time dependency, we additionally learn an image-to-image transition model that is able to predict a next state including its uncertainty. We apply this approach to bin picking, the task of emptying a bin using grasping as well as pre-grasping manipulation as fast as possible. The transition model is trained with up to 42000 pairs of real-world images before and after a manipulation action. Our approach enables two important skills: First, for applications with flange-mounted cameras, picks per hours (PPH) can be increased by around 15% by skipping image measurements. Second, we use the model to plan action sequences ahead of time and optimize time-dependent rewards, e.g. to minimize the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.