PPMC RL Training Algorithm: Rough Terrain Intelligent Robots through Reinforcement Learning
Tamir Blum, Kazuya Yoshida

TL;DR
This paper introduces a novel reinforcement learning-based training algorithm, PPMC, enabling robots to effectively navigate and generalize in rough, uneven terrains, demonstrated on different robot structures with high success rates.
Contribution
The paper presents the first generic training algorithm for PPMC in rough environments applicable to any robot structure using reinforcement learning.
Findings
Successfully applied to wheeled and quadruped robots
Achieved 100% success rate in generalizing to new rough terrains
Demonstrated effectiveness in highly uneven terrains
Abstract
Robots can now learn how to make decisions and control themselves, generalizing learned behaviors to unseen scenarios. In particular, AI powered robots show promise in rough environments like the lunar surface, due to the environmental uncertainties. We address this critical generalization aspect for robot locomotion in rough terrain through a training algorithm we have created called the Path Planning and Motion Control (PPMC) Training Algorithm. This algorithm is coupled with any generic reinforcement learning algorithm to teach robots how to respond to user commands and to travel to designated locations on a single neural network. In this paper, we show that the algorithm works independent of the robot structure, demonstrating that it works on a wheeled rover in addition the past results on a quadruped walking robot. Further, we take several big steps towards real world practicality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Robot Manipulation and Learning
MethodsPath Planning and Motion Control
