Teaching a Robot to Walk Using Reinforcement Learning

Jack Dibachi; Jacob Azoulay

arXiv:2112.07031·cs.LG·December 15, 2021

Teaching a Robot to Walk Using Reinforcement Learning

Jack Dibachi, Jacob Azoulay

PDF

Open Access

TL;DR

This paper explores reinforcement learning techniques, specifically deep Q-learning and augmented random search, to teach a simulated bipedal robot to walk, demonstrating ARS's superior performance in solving complex locomotion tasks.

Contribution

The study compares deep Q-learning and ARS for robotic walking, showing ARS's effectiveness in achieving optimal policies in a complex simulation environment.

Findings

01

ARS successfully solves the BipedalWalker-v3 problem.

02

Deep Q-learning often converges prematurely to suboptimal policies.

03

Naive policies serve as benchmarks for evaluating learning algorithms.

Abstract

Classical control techniques such as PID and LQR have been used effectively in maintaining a system state, but these techniques become more difficult to implement when the model dynamics increase in complexity and sensitivity. For adaptive robotic locomotion tasks with several degrees of freedom, this task becomes infeasible with classical control techniques. Instead, reinforcement learning can train optimal walking policies with ease. We apply deep Q-learning and augmented random search (ARS) to teach a simulated two-dimensional bipedal robot how to walk using the OpenAI Gym BipedalWalker-v3 environment. Deep Q-learning did not yield a high reward policy, often prematurely converging to suboptimal local maxima likely due to the coarsely discretized action space. ARS, however, resulted in a better trained robot, and produced an optimal policy which officially "solves" the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Reinforcement Learning in Robotics

MethodsQ-Learning · Random Search