CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng, Xingjun Ma, Cong Wang

TL;DR
This paper introduces CIM, a novel approach that uses constrained intrinsic motivation and Lagrangian methods to improve exploration and sample efficiency in sparse-reward continuous control tasks.
Contribution
The paper proposes CIM, a new method that leverages task priors and adaptive balancing to enhance exploration in reinforcement learning with sparse rewards.
Findings
CIM outperforms state-of-the-art methods on multiple tasks.
CIM improves sample efficiency significantly.
Key techniques can enhance existing algorithms.
Abstract
Intrinsic motivation is a promising exploration technique for solving reinforcement learning tasks with sparse or absent extrinsic rewards. There exist two technical challenges in implementing intrinsic motivation: 1) how to design a proper intrinsic objective to facilitate efficient exploration; and 2) how to combine the intrinsic objective with the extrinsic objective to help find better solutions. In the current literature, the intrinsic objectives are all designed in a task-agnostic manner and combined with the extrinsic objective via simple addition (or used by itself for reward-free pre-training). In this work, we show that these designs would fail in typical sparse-reward continuous control tasks. To address the problem, we propose Constrained Intrinsic Motivation (CIM) to leverage readily attainable task priors to construct a constrained intrinsic objective, and at the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural and Behavioral Psychology Studies
Methodsfail
