ISL: A novel approach for deep exploration

Lucas Cassano; Ali H. Sayed

arXiv:1909.06293·cs.LG·June 8, 2020·1 cites

ISL: A novel approach for deep exploration

Lucas Cassano, Ali H. Sayed

PDF

Open Access 1 Repo

TL;DR

The paper introduces ISL, a new deep exploration algorithm that combines regularization with RL objectives, deriving learning and exploration strategies simultaneously, and demonstrates superior performance on challenging benchmarks.

Contribution

We propose the ISL algorithm, a novel deep exploration method that jointly derives learning and exploration strategies from a well-posed optimization problem.

Findings

01

State-of-the-art performance on deep exploration benchmarks

02

Efficient deep exploration through regularized RL

03

Unified derivation of learning and exploration strategies

Abstract

In this article we explore an alternative approach to address deep exploration and we introduce the ISL algorithm, which is efficient at performing deep exploration. Similarly to maximum entropy RL, we derive the algorithm by augmenting the traditional RL objective with a novel regularization term. A distinctive feature of our approach is that, as opposed to other works that tackle the problem of deep exploration, in our derivation both the learning equations and the exploration-exploitation strategy are derived in tandem as the solution to a well-posed optimization problem whose minimization leads to the optimal value function. Empirically we show that our method exhibits state of the art performance on a range of challenging deep-exploration benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lcassano/ISL
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Machine Learning and Algorithms