Accessibility-Based Clustering for Efficient Learning of Locomotion   Skills

Chong Zhang; Wanming Yu; Zhibin Li

arXiv:2109.11191·cs.RO·March 2, 2022

Accessibility-Based Clustering for Efficient Learning of Locomotion Skills

Chong Zhang, Wanming Yu, Zhibin Li

PDF

Open Access

TL;DR

This paper introduces the K-Access algorithm that uses accessibility metrics for automatic state-space clustering, significantly improving data efficiency and robustness in quadruped locomotion learning, especially for fall recovery and complex skills.

Contribution

The novel K-Access clustering method automatically discovers static-pose centroids to enhance initial state selection, boosting learning efficiency and robustness in model-free deep reinforcement learning.

Findings

01

Faster convergence, requiring only 60% of training episodes.

02

Achieved 99.4% success rate in fall recovery within 3 seconds.

03

Successfully generalized to skills like backflipping.

Abstract

For model-free deep reinforcement learning of quadruped locomotion, the initialization of robot configurations is crucial for data efficiency and robustness. This work focuses on algorithmic improvements of data efficiency and robustness simultaneously through automatic discovery of initial states, which is achieved by our proposed K-Access algorithm based on accessibility metrics. Specifically, we formulated accessibility metrics to measure the difficulty of transitions between two arbitrary states, and proposed a novel K-Access algorithm for state-space clustering that automatically discovers the centroids of the static-pose clusters based on the accessibility metrics. By using the discovered centroidal static poses as the initial states, we can improve data efficiency by reducing redundant explorations, and enhance the robustness by more effective explorations from the centroids to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Prosthetics and Rehabilitation Robotics · Reinforcement Learning in Robotics