Hierarchical Policy for Non-prehensile Multi-object Rearrangement with   Deep Reinforcement Learning and Monte Carlo Tree Search

Fan Bai; Fei Meng; Jianbang Liu; Jiankun Wang; Max Q.-H. Meng

arXiv:2109.08973·cs.RO·September 21, 2021·6 cites

Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree Search

Fan Bai, Fei Meng, Jianbang Liu, Jiankun Wang, Max Q.-H. Meng

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical reinforcement learning and Monte Carlo Tree Search approach for complex multi-object rearrangement tasks in robotics, improving success rates and efficiency over existing methods.

Contribution

It presents a novel hierarchical policy combining MCTS and deep learning for non-prehensile multi-object rearrangement, addressing planning complexity.

Findings

01

Higher success rate compared to state-of-the-art methods

02

Fewer steps and shorter paths in rearrangement tasks

03

Effective integration of imitation and reinforcement learning

Abstract

Non-prehensile multi-object rearrangement is a robotic task of planning feasible paths and transferring multiple objects to their predefined target poses without grasping. It needs to consider how each object reaches the target and the order of object movement, which significantly deepens the complexity of the problem. To address these challenges, we propose a hierarchical policy to divide and conquer for non-prehensile multi-object rearrangement. In the high-level policy, guided by a designed policy network, the Monte Carlo Tree Search efficiently searches for the optimal rearrangement sequence among multiple objects, which benefits from imitation and reinforcement. In the low-level policy, the robot plans the paths according to the order of path primitives and manipulates the objects to approach the goal poses one by one. We verify through experiments that the proposed method can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

baifanxxx/NPMO-Rearrangement
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotic Path Planning Algorithms · Reinforcement Learning in Robotics