Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer
Minchan Kim, Junhyek Han, Jaehyung Kim, Beomjoon Kim

TL;DR
This paper introduces a deep reinforcement learning approach for non-prehensile manipulation that efficiently generalizes to new objects and environments without prior physical knowledge, enabling effective sim-to-real transfer.
Contribution
It presents a novel policy decomposition method with a specialized action space and curriculum learning for zero-shot transfer in complex manipulation tasks.
Findings
Successfully generalizes to unseen objects in real-world tests
Achieves real-time inference with neural network predictions
Outperforms traditional planning algorithms in contact-rich tasks
Abstract
We present a system for non-prehensile manipulation that require a significant number of contact mode transitions and the use of environmental contacts to successfully manipulate an object to a target location. Our method is based on deep reinforcement learning which, unlike state-of-the-art planning algorithms, does not require apriori knowledge of the physical parameters of the object or environment such as friction coefficients or centers of mass. The planning time is reduced to the simple feed-forward prediction time on a neural network. We propose a computational structure, action space design, and curriculum learning scheme that facilitates efficient exploration and sim-to-real transfer. In challenging real-world non-prehensile manipulation tasks, we show that our method can generalize over different objects, and succeed even for novel objects not seen during training. Project…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
