Faster Reinforcement Learning Using Active Simulators

Vikas Jain; Theja Tulabandhula

arXiv:1703.07853·cs.LG·November 22, 2017·2 cites

Faster Reinforcement Learning Using Active Simulators

Vikas Jain, Theja Tulabandhula

PDF

Open Access 1 Repo

TL;DR

This paper introduces online methods to construct effective training curricula from multiple tasks, significantly accelerating reinforcement learning by optimizing task sequences and leveraging domain knowledge.

Contribution

It presents novel online curriculum construction methods that improve RL training efficiency and are adaptable across different algorithms and domains.

Findings

01

Methods reduce total training time compared to training on target task alone.

02

Curricula tailored with domain knowledge further enhance learning speed.

03

Applicable across multiple domains and RL algorithms.

Abstract

In this work, we propose several online methods to build a \emph{learning curriculum} from a given set of target-task-specific training tasks in order to speed up reinforcement learning (RL). These methods can decrease the total training time needed by an RL agent compared to training on the target task from scratch. Unlike traditional transfer learning, we consider creating a sequence from several training tasks in order to provide the most benefit in terms of reducing the total time to train. Our methods utilize the learning trajectory of the agent on the curriculum tasks seen so far to decide which tasks to train on next. An attractive feature of our methods is that they are weakly coupled to the choice of the RL algorithm as well as the transfer learning method. Further, when there is domain information available, our methods can incorporate such knowledge to further speed up the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thejat/active-curricula-for-efficient-reinforcement-learning
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Advanced Bandit Algorithms Research

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings