CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous   Control

Xiang Zheng; Xingjun Ma; Cong Wang

arXiv:2211.15205·cs.LG·May 19, 2023

CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control

Xiang Zheng, Xingjun Ma, Cong Wang

PDF

Open Access

TL;DR

This paper introduces CIM, a novel approach that uses constrained intrinsic motivation and Lagrangian methods to improve exploration and sample efficiency in sparse-reward continuous control tasks.

Contribution

The paper proposes CIM, a new method that leverages task priors and adaptive balancing to enhance exploration in reinforcement learning with sparse rewards.

Findings

01

CIM outperforms state-of-the-art methods on multiple tasks.

02

CIM improves sample efficiency significantly.

03

Key techniques can enhance existing algorithms.

Abstract

Intrinsic motivation is a promising exploration technique for solving reinforcement learning tasks with sparse or absent extrinsic rewards. There exist two technical challenges in implementing intrinsic motivation: 1) how to design a proper intrinsic objective to facilitate efficient exploration; and 2) how to combine the intrinsic objective with the extrinsic objective to help find better solutions. In the current literature, the intrinsic objectives are all designed in a task-agnostic manner and combined with the extrinsic objective via simple addition (or used by itself for reward-free pre-training). In this work, we show that these designs would fail in typical sparse-reward continuous control tasks. To address the problem, we propose Constrained Intrinsic Motivation (CIM) to leverage readily attainable task priors to construct a constrained intrinsic objective, and at the same…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural and Behavioral Psychology Studies

Methodsfail