Plan-Guided Reinforcement Learning for Whole-Body Manipulation
Mengchao Zhang, Jose Barreiros, Aykut Ozgun Onol

TL;DR
This paper introduces a plan-guided reinforcement learning framework that combines model-based planning with RL to enable robust whole-body manipulation with minimal human supervision, demonstrated on a humanoid robot handling large objects.
Contribution
The proposed PGRL method integrates model-based plans with RL, reducing supervision and improving robustness in complex manipulation tasks.
Findings
Effective in guiding RL exploration with minimal supervision
Improves robustness of manipulation policies
Preliminary results on humanoid robot are promising
Abstract
Synthesizing complex whole-body manipulation behaviors has fundamental challenges due to the rapidly growing combinatorics inherent to contact interaction planning. While model-based methods have shown promising results in solving long-horizon manipulation tasks, they often work under strict assumptions, such as known model parameters, oracular observation of the environment state, and simplified dynamics, resulting in plans that cannot easily transfer to hardware. Learning-based approaches, such as imitation learning (IL) and reinforcement learning (RL), have been shown to be robust when operating over in-distribution states; however, they need heavy human supervision. Specifically, model-free RL requires a tedious reward-shaping process. IL methods, on the other hand, rely on human demonstrations that involve advanced teleoperation methods. In this work, we propose a plan-guided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Muscle activation and electromyography studies · Prosthetics and Rehabilitation Robotics
