Reactive Reinforcement Learning in Asynchronous Environments

Jaden B. Travnik; Kory W. Mathewson; Richard S. Sutton; Patrick M.; Pilarski

arXiv:1802.06139·cs.AI·June 29, 2018

Reactive Reinforcement Learning in Asynchronous Environments

Jaden B. Travnik, Kory W. Mathewson, Richard S. Sutton, Patrick M., Pilarski

PDF

TL;DR

This paper introduces reactive reinforcement learning algorithms designed for asynchronous environments, enabling agents to act immediately after observation, thereby reducing reaction time and improving safety in robotic tasks.

Contribution

The paper proposes a new class of reactive RL algorithms that act instantly after observations, addressing asynchronous environment challenges and enhancing response times.

Findings

01

Reactive SARSA reduces reaction time by approximately the duration of the learning update.

02

Reactive algorithms improve safety and decision speed in robotic tasks.

03

Comparison shows reactive RL outperforms conventional methods in asynchronous settings.

Abstract

The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision Processes (SMDP), do not capture the fact that, in an asynchronous environment, the state of the environment may change during computation performed by the agent. In an asynchronous environment, minimizing reaction time---the time it takes for an agent to react to an observation---also minimizes the time in which the state of the environment may change following observation. In many environments, the reaction time of an agent directly impacts task performance by permitting the environment to transition into either an undesirable terminal state or a state where performing the chosen action is inappropriate. We propose a class of reactive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSarsa