Classification based Grasp Detection using Spatial Transformer Network
Dongwon Park, Se Young Chun

TL;DR
This paper introduces a novel classification-based robotic grasp detection method utilizing multi-stage spatial transformer networks, achieving state-of-the-art accuracy and real-time performance while providing intermediate observability of grasp configurations.
Contribution
The paper presents a new classification-based approach with multi-stage STN that outperforms existing methods in accuracy and offers intermediate results for grasp configurations.
Findings
Achieved state-of-the-art accuracy in grasp detection
Operates in real-time with efficient computation
Provides intermediate observations of grasp configurations
Abstract
Robotic grasp detection task is still challenging, particularly for novel objects. With the recent advance of deep learning, there have been several works on detecting robotic grasp using neural networks. Typically, regression based grasp detection methods have outperformed classification based detection methods in computation complexity with excellent accuracy. However, classification based robotic grasp detection still seems to have merits such as intermediate step observability and straightforward back propagation routine for end-to-end training. In this work, we propose a novel classification based robotic grasp detection method with multiple-stage spatial transformer networks (STN). Our proposed method was able to achieve state-of-the-art performance in accuracy with real- time computation. Additionally, unlike other regression based grasp detection methods, our proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Hand Gesture Recognition Systems · Advanced Neural Network Applications
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Spatial Transformer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam
