Preference-Based Learning for User-Guided HZD Gait Generation on Bipedal   Walking Robots

Maegan Tucker; Noel Csomay-Shanklin; Wen-Loong Ma; and Aaron D. Ames

arXiv:2011.05424·cs.RO·March 31, 2021

Preference-Based Learning for User-Guided HZD Gait Generation on Bipedal Walking Robots

Maegan Tucker, Noel Csomay-Shanklin, Wen-Loong Ma, and Aaron D. Ames

PDF

1 Repo

TL;DR

This paper introduces a novel framework combining control theory and preference-based learning to generate stable, robust, and natural bipedal walking gaits on robots without manual tuning or simulation reliance.

Contribution

It presents a new approach that integrates hybrid zero dynamics optimization with human preference-based learning, eliminating the need for carefully crafted reward functions.

Findings

01

Achieved stable walking in fewer than 50 iterations

02

Demonstrated robustness with added model uncertainty

03

Generated natural gait without simulation or manual tuning

Abstract

This paper presents a framework that leverages both control theory and machine learning to obtain stable and robust bipedal locomotion without the need for manual parameter tuning. Traditionally, gaits are generated through trajectory optimization methods and then realized experimentally -- a process that often requires extensive tuning due to differences between the models and hardware. In this work, the process of gait realization via hybrid zero dynamics (HZD) based optimization is formally combined with preference-based learning to systematically realize dynamically stable walking. Importantly, this learning approach does not require a carefully constructed reward function, but instead utilizes human pairwise preferences. The power of the proposed approach is demonstrated through two experiments on a planar biped AMBER-3M: the first with rigid point-feet, and the second with induced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maegant/ICRA2021-LearningHZD
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.