Deep Intrinsically Motivated Exploration in Continuous Control

Baturay Saglam; Suleyman S. Kozat

arXiv:2210.00293·cs.LG·October 4, 2022

Deep Intrinsically Motivated Exploration in Continuous Control

Baturay Saglam, Suleyman S. Kozat

PDF

Open Access 2 Repos

TL;DR

This paper introduces a scalable intrinsic motivation-based exploration strategy for deep reinforcement learning in continuous control, improving exploration efficiency and outperforming traditional undirected methods.

Contribution

The paper adapts animal motivational theories into RL, proposing a novel directed exploration method based on maximizing value function error, suitable for continuous systems.

Findings

01

Significantly outperforms undirected exploration strategies.

02

Effective in large and diverse state spaces.

03

Enhances baseline performance in continuous control tasks.

Abstract

In continuous control, exploration is often performed through undirected strategies in which parameters of the networks or selected actions are perturbed by random noise. Although the deep setting of undirected exploration has been shown to improve the performance of on-policy methods, they introduce an excessive computational complexity and are known to fail in the off-policy setting. The intrinsically motivated exploration is an effective alternative to the undirected strategies, but they are usually studied for discrete action domains. In this paper, we investigate how intrinsic motivation can effectively be combined with deep reinforcement learning in the control of continuous systems to obtain a directed exploratory behavior. We adapt the existing theories on animal motivational systems into the reinforcement learning paradigm and introduce a novel and scalable directed exploration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics