PreND: Enhancing Intrinsic Motivation in Reinforcement Learning through   Pre-trained Network Distillation

Mohammadamin Davoodabadi; Negin Hashemi Dijujin; Mahdieh Soleymani; Baghshah

arXiv:2410.01745·cs.LG·October 3, 2024

PreND: Enhancing Intrinsic Motivation in Reinforcement Learning through Pre-trained Network Distillation

Mohammadamin Davoodabadi, Negin Hashemi Dijujin, Mahdieh Soleymani, Baghshah

PDF

Open Access

TL;DR

PreND introduces a novel method that leverages pre-trained models to improve intrinsic motivation in reinforcement learning, leading to better exploration and performance in sparse reward settings.

Contribution

The paper proposes PreND, a new approach that incorporates pre-trained representations into intrinsic motivation mechanisms, outperforming existing methods like RND in RL tasks.

Findings

01

PreND significantly outperforms RND in Atari experiments.

02

PreND provides more stable and meaningful intrinsic rewards.

03

Enhanced exploration and sample efficiency in sparse reward environments.

Abstract

Intrinsic motivation, inspired by the psychology of developmental learning in infants, stimulates exploration in agents without relying solely on sparse external rewards. Existing methods in reinforcement learning like Random Network Distillation (RND) face significant limitations, including (1) relying on raw visual inputs, leading to a lack of meaningful representations, (2) the inability to build a robust latent space, (3) poor target network initialization and (4) rapid degradation of intrinsic rewards. In this paper, we introduce Pre-trained Network Distillation (PreND), a novel approach to enhance intrinsic motivation in reinforcement learning (RL) by improving upon the widely used prediction-based method, RND. PreND addresses these challenges by incorporating pre-trained representation models into both the target and predictor networks, resulting in more meaningful and stable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing