Run, skeleton, run: skeletal model in a physics-based simulation

Mikhail Pavlov; Sergey Kolesnikov; Sergey M. Plis

arXiv:1711.06922·cs.AI·January 30, 2018·6 cites

Run, skeleton, run: skeletal model in a physics-based simulation

Mikhail Pavlov, Sergey Kolesnikov, Sergey M. Plis

PDF

Open Access 1 Repo

TL;DR

This paper develops a physics-based reinforcement learning approach to train a human skeletal model to navigate obstacle courses efficiently, demonstrating improved stability and generalization across scenarios.

Contribution

The paper benchmarks policy-gradient methods for complex physics-based tasks and introduces stabilization techniques, with Deep Deterministic Policy Gradient proving most effective.

Findings

01

Deep Deterministic Policy Gradient outperforms other methods

02

Training stabilization techniques improve sample efficiency

03

Models generalize to new physical obstacle scenarios

Abstract

In this paper, we present our approach to solve a physics-based reinforcement learning challenge "Learning to Run" with objective to train physiologically-based human model to navigate a complex obstacle course as quickly as possible. The environment is computationally expensive, has a high-dimensional continuous action space and is stochastic. We benchmark state of the art policy-gradient methods and test several improvements, such as layer normalization, parameter noise, action and state reflecting, to stabilize training and improve its sample-efficiency. We found that the Deep Deterministic Policy Gradient method is the most efficient method for this environment and the improvements we have introduced help to stabilize training. Learned models are able to generalize to new physical scenarios, e.g. different obstacle courses.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Scitator/Run-Skeleton-Run
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Human Motion and Animation