# Improving Robot Success Detection using Static Object Data

**Authors:** Rosario Scalise, Jesse Thomason, Yonatan Bisk, Siddhartha Srinivasa

arXiv: 1904.01650 · 2019-08-02

## TL;DR

This paper demonstrates that incorporating static object data, such as images and descriptions, significantly enhances a robot's success detection accuracy in stacking and nesting tasks, especially with unseen objects.

## Contribution

The authors introduce a method that leverages static object information to improve success detection in robotic manipulation, achieving substantial accuracy gains over baseline models.

## Key findings

- Up to 57% improvement in success detection accuracy on unseen objects.
- Static object data reduces ambiguity caused by visual perception issues.
- Model trained on egocentric manipulation data generalizes well to new objects.

## Abstract

We use static object data to improve success detection for stacking objects on and nesting objects in one another. Such actions are necessary for certain robotics tasks, e.g., clearing a dining table or packing a warehouse bin. However, using an RGB-D camera to detect success can be insufficient: same-colored objects can be difficult to differentiate, and reflective silverware cause noisy depth camera perception. We show that adding static data about the objects themselves improves the performance of an end-to-end pipeline for classifying action outcomes. Images of the objects, and language expressions describing them, encode prior geometry, shape, and size information that refine classification accuracy. We collect over 13 hours of egocentric manipulation data for training a model to reason about whether a robot successfully placed unseen objects in or on one another. The model achieves up to a 57% absolute gain over the task baseline on pairs of previously unseen objects.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.01650/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1904.01650/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/1904.01650/full.md

---
Source: https://tomesphere.com/paper/1904.01650