Efficient Object Manipulation Planning with Monte Carlo Tree Search

Huaijiang Zhu; Avadesh Meduri; Ludovic Righetti

arXiv:2206.09023·cs.RO·March 21, 2023

Efficient Object Manipulation Planning with Monte Carlo Tree Search

Huaijiang Zhu, Avadesh Meduri, Ludovic Righetti

PDF

Open Access

TL;DR

This paper introduces an efficient object manipulation planning method combining Monte Carlo Tree Search with learned policy-value networks and heuristics, demonstrating improved success rates in simulations and real hardware.

Contribution

It presents a novel integration of MCTS with learned policies and heuristics for scalable and efficient object manipulation planning.

Findings

01

Significantly improves planning success rate.

02

Scales favorably for long manipulation sequences.

03

Demonstrates effectiveness in both simulation and real hardware.

Abstract

This paper presents an efficient approach to object manipulation planning using Monte Carlo Tree Search (MCTS) to find contact sequences and an efficient ADMM-based trajectory optimization algorithm to evaluate the dynamic feasibility of candidate contact sequences. To accelerate MCTS, we propose a methodology to learn a goal-conditioned policy-value network to direct the search towards promising nodes. Further, manipulation-specific heuristics enable to drastically reduce the search space. Systematic object manipulation experiments in a physics simulator and on real hardware demonstrate the efficiency of our approach. In particular, our approach scales favorably for long manipulation sequences thanks to the learned policy-value network, significantly improving planning success rate.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning