Time Adaptive Reinforcement Learning

Chris Reinke

arXiv:2004.08600·cs.LG·April 21, 2020·1 cites

Time Adaptive Reinforcement Learning

Chris Reinke

PDF

Open Access

TL;DR

This paper introduces two novel model-free, value-based algorithms that enable reinforcement learning agents to adapt instantly to changing time constraints in tasks, enhancing flexibility and applicability.

Contribution

The paper proposes the first zero-shot, model-free algorithms for time adaptive reinforcement learning, broadening the scope of RL in dynamic time-restricted environments.

Findings

01

Algorithms enable instant adaptation to new time limits

02

Compatible with many existing RL methods

03

Demonstrated effectiveness in time adaptive tasks

Abstract

Reinforcement learning (RL) allows to solve complex tasks such as Go often with a stronger performance than humans. However, the learned behaviors are usually fixed to specific tasks and unable to adapt to different contexts. Here we consider the case of adapting RL agents to different time restrictions, such as finishing a task with a given time limit that might change from one task execution to the next. We define such problems as Time Adaptive Markov Decision Processes and introduce two model-free, value-based algorithms: the Independent Gamma-Ensemble and the n-Step Ensemble. In difference to classical approaches, they allow a zero-shot adaptation between different time restrictions. The proposed approaches represent general mechanisms to handle time adaptive tasks making them compatible with many existing RL methods, algorithms, and scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Smart Grid Energy Management · Evolutionary Algorithms and Applications