Robust Feedback Motion Policy Design Using Reinforcement Learning on a   3D Digit Bipedal Robot

Guillermo A. Castillo; Bowen Weng; Wei Zhang; and Ayonga Hereid

arXiv:2103.15309·cs.RO·March 30, 2021

Robust Feedback Motion Policy Design Using Reinforcement Learning on a 3D Digit Bipedal Robot

Guillermo A. Castillo, Bowen Weng, Wei Zhang, and Ayonga Hereid

PDF

Open Access

TL;DR

This paper introduces a hierarchical reinforcement learning framework with feedback control for robust bipedal walking on the Digit robot, achieving successful sim-to-real transfer without extensive tuning or curriculum learning.

Contribution

The authors develop a cascade-structure controller integrating feedback regulation with reinforcement learning, enabling stable, robust walking gait transfer from simulation to hardware with minimal tuning.

Findings

01

Successful transfer of learned policy to Digit robot hardware

02

Robust walking gait under external disturbances and challenging terrains

03

Reduced sampling efficiency due to simplified state and action spaces

Abstract

In this paper, a hierarchical and robust framework for learning bipedal locomotion is presented and successfully implemented on the 3D biped robot Digit built by Agility Robotics. We propose a cascade-structure controller that combines the learning process with intuitive feedback regulations. This design allows the framework to realize robust and stable walking with a reduced-dimension state and action spaces of the policy, significantly simplifying the design and reducing the sampling efficiency of the learning method. The inclusion of feedback regulation into the framework improves the robustness of the learned walking gait and ensures the success of the sim-to-real transfer of the proposed controller with minimal tuning. We specifically present a learning pipeline that considers hardware-feasible initial poses of the robot within the learning process to ensure the initial state of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Prosthetics and Rehabilitation Robotics · Reinforcement Learning in Robotics