Asynchronous Reinforcement Learning for Real-Time Control of Physical   Robots

Yufeng Yuan; A. Rupam Mahmood

arXiv:2203.12759·cs.RO·April 1, 2022

Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots

Yufeng Yuan, A. Rupam Mahmood

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that asynchronous reinforcement learning enables real-time control of physical robots more effectively than sequential methods, especially when learning updates are computationally expensive, leading to faster, more responsive robotic behaviors.

Contribution

The authors systematically compare sequential and asynchronous reinforcement learning on real robots, showing asynchronous methods maintain responsiveness and outperform sequential ones under costly updates.

Findings

01

Asynchronous RL maintains appropriate action cycle times under high update costs.

02

Sequential RL performance degrades with increased learning update times.

03

The system learns to reach and track visual targets from pixels within two hours on real robots.

Abstract

An oft-ignored challenge of real-world reinforcement learning is that the real world does not pause when agents make learning updates. As standard simulated environments do not address this real-time aspect of learning, most available implementations of RL algorithms process environment interactions and learning updates sequentially. As a consequence, when such implementations are deployed in the real world, they may make decisions based on significantly delayed observations and not act responsively. Asynchronous learning has been proposed to solve this issue, but no systematic comparison between sequential and asynchronous reinforcement learning was conducted using real-world environments. In this work, we set up two vision-based tasks with a robotic arm, implement an asynchronous learning system that extends a previous architecture, and compare sequential and asynchronous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yufengyuan/ur5_async_rl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Optimization and Search Problems