Fostering Intrinsic Motivation in Reinforcement Learning with Pretrained Foundation Models
Alain Andres, Javier Del Ser

TL;DR
This paper investigates how pretrained foundation models can enhance exploration and intrinsic motivation in reinforcement learning, demonstrating improved sample efficiency and faster learning in complex environments.
Contribution
It introduces the use of foundation model embeddings for intrinsic motivation and analyzes the impact of episodic novelty and full state information on exploration.
Findings
Foundation model embeddings can outperform agent-constructed embeddings in exploration.
Using full state information with intrinsic modules improves sample efficiency.
Episodic novelty enhances exploration effectiveness.
Abstract
Exploration remains a significant challenge in reinforcement learning, especially in environments where extrinsic rewards are sparse or non-existent. The recent rise of foundation models, such as CLIP, offers an opportunity to leverage pretrained, semantically rich embeddings that encapsulate broad and reusable knowledge. In this work we explore the potential of these foundation models not just to drive exploration, but also to analyze the critical role of the episodic novelty term in enhancing exploration effectiveness of the agent. We also investigate whether providing the intrinsic module with complete state information -- rather than just partial observations -- can improve exploration, despite the difficulties in handling small variations within large state spaces. Our experiments in the MiniGrid domain reveal that intrinsic modules can effectively utilize full state information,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsContrastive Language-Image Pre-training
