Discovering Self-Protective Falling Policy for Humanoid Robot via Deep Reinforcement Learning
Diyuan Shi, Shangke Lyu, Donglin Wang

TL;DR
This paper employs deep reinforcement learning with curriculum learning to enable humanoid robots to autonomously discover self-protective falling strategies, significantly reducing damage during falls and improving safety.
Contribution
It introduces a novel training approach that allows humanoid robots to learn effective falling protection behaviors without human prior, using carefully designed rewards and curriculum strategies.
Findings
Humanoid robots can learn to form a protective 'triangle' structure during falls.
The learned policy significantly reduces damage compared to existing methods.
The approach successfully transfers from simulation to real-world platforms.
Abstract
Humanoid robots have received significant research interests and advancements in recent years. Despite many successes, due to their morphology, dynamics and limitation of control policy, humanoid robots are prone to fall as compared to other embodiments like quadruped or wheeled robots. And its large weight, tall Center of Mass, high Degree-of-Freedom would cause serious hardware damages when falling uncontrolled, to both itself and surrounding objects. Existing researches in this field mostly focus on using control based methods that struggle to cater diverse falling scenarios and may introduce unsuitable human prior. On the other hand, large-scale Deep Reinforcement Learning and Curriculum Learning could be employed to incentivize humanoid agent discovering falling protection policy that fits its own nature and property. In this work, with carefully designed reward functions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics · Robot Manipulation and Learning
