Self-Supervised Learning of Grasping Arbitrary Objects On-the-Move
Takuya Kiyokawa, Eiki Nagata, Yoshihisa Tsurumine, Yuhwan Kwon,, Takamitsu Matsubara

TL;DR
This paper presents a self-supervised learning framework enabling mobile robots to grasp arbitrary objects on-the-move by predicting grasp primitives and movement adjustments from visual data, improving accuracy and generalization.
Contribution
It introduces a novel two-stage learning approach with three FCN models for mobile grasping, simplifying complex actions into primitives for better learning and generalization.
Findings
Achieved highest grasping accuracy in experiments.
Enhanced pick-and-place efficiency with the proposed method.
Demonstrated effective generalization through simulation with varied objects.
Abstract
Mobile grasping enhances manipulation efficiency by utilizing robots' mobility. This study aims to enable a commercial off-the-shelf robot for mobile grasping, requiring precise timing and pose adjustments. Self-supervised learning can develop a generalizable policy to adjust the robot's velocity and determine grasp position and orientation based on the target object's shape and pose. Due to mobile grasping's complexity, action primitivization and step-by-step learning are crucial to avoid data sparsity in learning from trial and error. This study simplifies mobile grasping into two grasp action primitives and a moving action primitive, which can be operated with limited degrees of freedom for the manipulator. This study introduces three fully convolutional neural network (FCN) models to predict static grasp primitive, dynamic grasp primitive, and residual moving velocity error from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Robot Manipulation and Learning · Image Processing and 3D Reconstruction
MethodsConvolution · Max Pooling · Fully Convolutional Network
