Time Adaptive Reinforcement Learning
Chris Reinke

TL;DR
This paper introduces two novel model-free, value-based algorithms that enable reinforcement learning agents to adapt instantly to changing time constraints in tasks, enhancing flexibility and applicability.
Contribution
The paper proposes the first zero-shot, model-free algorithms for time adaptive reinforcement learning, broadening the scope of RL in dynamic time-restricted environments.
Findings
Algorithms enable instant adaptation to new time limits
Compatible with many existing RL methods
Demonstrated effectiveness in time adaptive tasks
Abstract
Reinforcement learning (RL) allows to solve complex tasks such as Go often with a stronger performance than humans. However, the learned behaviors are usually fixed to specific tasks and unable to adapt to different contexts. Here we consider the case of adapting RL agents to different time restrictions, such as finishing a task with a given time limit that might change from one task execution to the next. We define such problems as Time Adaptive Markov Decision Processes and introduce two model-free, value-based algorithms: the Independent Gamma-Ensemble and the n-Step Ensemble. In difference to classical approaches, they allow a zero-shot adaptation between different time restrictions. The proposed approaches represent general mechanisms to handle time adaptive tasks making them compatible with many existing RL methods, algorithms, and scenarios.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Smart Grid Energy Management · Evolutionary Algorithms and Applications
