Deep Intrinsically Motivated Exploration in Continuous Control
Baturay Saglam, Suleyman S. Kozat

TL;DR
This paper introduces a scalable intrinsic motivation-based exploration strategy for deep reinforcement learning in continuous control, improving exploration efficiency and outperforming traditional undirected methods.
Contribution
The paper adapts animal motivational theories into RL, proposing a novel directed exploration method based on maximizing value function error, suitable for continuous systems.
Findings
Significantly outperforms undirected exploration strategies.
Effective in large and diverse state spaces.
Enhances baseline performance in continuous control tasks.
Abstract
In continuous control, exploration is often performed through undirected strategies in which parameters of the networks or selected actions are perturbed by random noise. Although the deep setting of undirected exploration has been shown to improve the performance of on-policy methods, they introduce an excessive computational complexity and are known to fail in the off-policy setting. The intrinsically motivated exploration is an effective alternative to the undirected strategies, but they are usually studied for discrete action domains. In this paper, we investigate how intrinsic motivation can effectively be combined with deep reinforcement learning in the control of continuous systems to obtain a directed exploratory behavior. We adapt the existing theories on animal motivational systems into the reinforcement learning paradigm and introduce a novel and scalable directed exploration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
