# GQ-STN: Optimizing One-Shot Grasp Detection based on Robustness   Classifier

**Authors:** Alexandre Gari\'epy, Jean-Christophe Ruel, Brahim Chaib-draa and, Philippe Gigu\`ere

arXiv: 1903.02489 · 2019-08-02

## TL;DR

GQ-STN is a real-time, one-shot grasp detection network that uses a robustness classifier for training and evaluation, achieving high accuracy and speed in robotic grasping tasks.

## Contribution

The paper introduces GQ-STN, a novel one-shot grasp detection network that incorporates a robustness classifier for efficient training and improved grasp quality assessment.

## Key findings

- 92.4% accuracy on Dex-Net 2.0 dataset
- More than 60 times faster than previous methods
- Detects more robust grasps in physical benchmarks

## Abstract

Grasping is a fundamental robotic task needed for the deployment of household robots or furthering warehouse automation. However, few approaches are able to perform grasp detection in real time (frame rate). To this effect, we present Grasp Quality Spatial Transformer Network (GQ-STN), a one-shot grasp detection network. Being based on the Spatial Transformer Network (STN), it produces not only a grasp configuration, but also directly outputs a depth image centered at this configuration. By connecting our architecture to an externally-trained grasp robustness evaluation network, we can train efficiently to satisfy a robustness metric via the backpropagation of the gradient emanating from the evaluation network. This removes the difficulty of training detection networks on sparsely annotated databases, a common issue in grasping. We further propose to use this robustness classifier to compare approaches, being more reliable than the traditional rectangle metric. Our GQ-STN is able to detect robust grasps on the depth images of the Dex-Net 2.0 dataset with 92.4 % accuracy in a single pass of the network. We finally demonstrate in a physical benchmark that our method can propose robust grasps more often than previous sampling-based methods, while being more than 60 times faster.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.02489/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/1903.02489/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1903.02489/full.md

---
Source: https://tomesphere.com/paper/1903.02489