MPGNet: Learning Move-Push-Grasping Synergy for Target-Oriented Grasping   in Occluded Scenes

Dayou Li; Chenkun Zhao; Shuo Yang; Ran Song; Xiaolei Li; Wei Zhang

arXiv:2408.10525·cs.RO·August 21, 2024

MPGNet: Learning Move-Push-Grasping Synergy for Target-Oriented Grasping in Occluded Scenes

Dayou Li, Chenkun Zhao, Shuo Yang, Ran Song, Xiaolei Li, Wei Zhang

PDF

Open Access

TL;DR

This paper introduces MPGNet, a three-branch neural network that learns to synergize moving, pushing, and grasping actions for efficient target-oriented grasping in occluded scenes, validated through simulations and real-world tests.

Contribution

The paper presents MPGNet, a novel multi-branch network with a multi-stage training strategy for integrated move-push-grasping in occluded environments, advancing target-oriented robotic manipulation.

Findings

01

Effective in occluded scenes with minimal manipulations

02

Outperforms existing methods in simulated and real-world tests

03

Demonstrates robust synergy between actions

Abstract

This paper focuses on target-oriented grasping in occluded scenes, where the target object is specified by a binary mask and the goal is to grasp the target object with as few robotic manipulations as possible. Most existing methods rely on a push-grasping synergy to complete this task. To deliver a more powerful target-oriented grasping pipeline, we present MPGNet, a three-branch network for learning a synergy between moving, pushing, and grasping actions. We also propose a multi-stage training strategy to train the MPGNet which contains three policy networks corresponding to the three actions. The effectiveness of our method is demonstrated via both simulated and real-world experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Multimodal Machine Learning Applications