Self-Supervised Learning of Grasping Arbitrary Objects On-the-Move

Takuya Kiyokawa; Eiki Nagata; Yoshihisa Tsurumine; Yuhwan Kwon,; Takamitsu Matsubara

arXiv:2411.09904·cs.RO·November 18, 2024

Self-Supervised Learning of Grasping Arbitrary Objects On-the-Move

Takuya Kiyokawa, Eiki Nagata, Yoshihisa Tsurumine, Yuhwan Kwon,, Takamitsu Matsubara

PDF

Open Access

TL;DR

This paper presents a self-supervised learning framework enabling mobile robots to grasp arbitrary objects on-the-move by predicting grasp primitives and movement adjustments from visual data, improving accuracy and generalization.

Contribution

It introduces a novel two-stage learning approach with three FCN models for mobile grasping, simplifying complex actions into primitives for better learning and generalization.

Findings

01

Achieved highest grasping accuracy in experiments.

02

Enhanced pick-and-place efficiency with the proposed method.

03

Demonstrated effective generalization through simulation with varied objects.

Abstract

Mobile grasping enhances manipulation efficiency by utilizing robots' mobility. This study aims to enable a commercial off-the-shelf robot for mobile grasping, requiring precise timing and pose adjustments. Self-supervised learning can develop a generalizable policy to adjust the robot's velocity and determine grasp position and orientation based on the target object's shape and pose. Due to mobile grasping's complexity, action primitivization and step-by-step learning are crucial to avoid data sparsity in learning from trial and error. This study simplifies mobile grasping into two grasp action primitives and a moving action primitive, which can be operated with limited degrees of freedom for the manipulator. This study introduces three fully convolutional neural network (FCN) models to predict static grasp primitive, dynamic grasp primitive, and residual moving velocity error from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Robot Manipulation and Learning · Image Processing and 3D Reconstruction

MethodsConvolution · Max Pooling · Fully Convolutional Network