PLANRL: A Motion Planning and Imitation Learning Framework to Bootstrap   Reinforcement Learning

Amisha Bhaskar; Zahiruddin Mahammad; Sachin R Jadhav; Pratap; Tokekar

arXiv:2408.04054·cs.AI·October 18, 2024

PLANRL: A Motion Planning and Imitation Learning Framework to Bootstrap Reinforcement Learning

Amisha Bhaskar, Zahiruddin Mahammad, Sachin R Jadhav, Pratap, Tokekar

PDF

Open Access

TL;DR

PLANRL is a hybrid framework combining classical motion planning, reinforcement learning, and imitation data to improve robotic task performance, efficiency, and generalization in simulation and real-world environments.

Contribution

It introduces a dynamic switching framework with ModeNet, NavNet, and InteractNet, integrating RL and imitation learning for enhanced robotic manipulation.

Findings

01

Surpasses baseline success rates by 10-15% in simulation at 30k samples

02

Achieves 30-40% higher success in real-world tasks compared to baselines

03

Successfully handles complex two-stage manipulation tasks

Abstract

Reinforcement Learning (RL) has shown remarkable progress in simulation environments, yet its application to real-world robotic tasks remains limited due to challenges in exploration and generalization. To address these issues, we introduce PLANRL, a framework that chooses when the robot should use classical motion planning and when it should learn a policy. To further improve the efficiency in exploration, we use imitation data to bootstrap the exploration. PLANRL dynamically switches between two modes of operation: reaching a waypoint using classical techniques when away from the objects and reinforcement learning for fine-grained manipulation control when about to interact with objects. PLANRL architecture is composed of ModeNet for mode classification, NavNet for waypoint prediction, and InteractNet for precise manipulation. By combining the strengths of RL and Imitation Learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics