Fostering Intrinsic Motivation in Reinforcement Learning with Pretrained   Foundation Models

Alain Andres; Javier Del Ser

arXiv:2410.07404·cs.AI·November 26, 2024

Fostering Intrinsic Motivation in Reinforcement Learning with Pretrained Foundation Models

Alain Andres, Javier Del Ser

PDF

Open Access

TL;DR

This paper investigates how pretrained foundation models can enhance exploration and intrinsic motivation in reinforcement learning, demonstrating improved sample efficiency and faster learning in complex environments.

Contribution

It introduces the use of foundation model embeddings for intrinsic motivation and analyzes the impact of episodic novelty and full state information on exploration.

Findings

01

Foundation model embeddings can outperform agent-constructed embeddings in exploration.

02

Using full state information with intrinsic modules improves sample efficiency.

03

Episodic novelty enhances exploration effectiveness.

Abstract

Exploration remains a significant challenge in reinforcement learning, especially in environments where extrinsic rewards are sparse or non-existent. The recent rise of foundation models, such as CLIP, offers an opportunity to leverage pretrained, semantically rich embeddings that encapsulate broad and reusable knowledge. In this work we explore the potential of these foundation models not just to drive exploration, but also to analyze the critical role of the episodic novelty term in enhancing exploration effectiveness of the agent. We also investigate whether providing the intrinsic module with complete state information -- rather than just partial observations -- can improve exploration, despite the difficulties in handling small variations within large state spaces. Our experiments in the MiniGrid domain reveal that intrinsic modules can effectively utilize full state information,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsContrastive Language-Image Pre-training