Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking

Kate Sanders; Michael Danielczuk; Jeffrey Mahler; Ajay Tanwani; Ken; Goldberg

arXiv:2007.10420·cs.RO·July 22, 2020·1 cites

Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking

Kate Sanders, Michael Danielczuk, Jeffrey Mahler, Ajay Tanwani, Ken, Goldberg

PDF

Open Access

TL;DR

This paper introduces non-Markov policies with memory to reduce sequential grasp failures in robot bin picking, significantly improving success rates and efficiency over traditional Markov policies.

Contribution

It characterizes a new class of objects prone to sequential failures and proposes memory-based policies that outperform Markov policies in simulation and real-world tests.

Findings

01

Non-Markov policies reduce sequential failure rates.

02

Non-Markov policies increase mean successful picks per hour by over 100%.

03

Simulation and physical experiments validate the effectiveness of the proposed methods.

Abstract

A new generation of automated bin picking systems using deep learning is evolving to support increasing demand for e-commerce. To accommodate a wide variety of products, many automated systems include multiple gripper types and/or tool changers. However, for some objects, sequential grasp failures are common: when a computed grasp fails to lift and remove the object, the bin is often left unchanged; as the sensor input is consistent, the system retries the same grasp over and over, resulting in a significant reduction in mean successful picks per hour (MPPH). Based on an empirical study of sequential failures, we characterize a class of "sequential failure objects" (SFOs) -- objects prone to sequential failures based on a novel taxonomy. We then propose three non-Markov picking policies that incorporate memory of past failures to modify subsequent actions. Simulation experiments on SFO…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Modular Robots and Swarm Intelligence