An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration Environments
Alain Andres, Esther Villar-Rodriguez, Javier Del Ser

TL;DR
This study evaluates how different design choices and parameters affect the performance of intrinsic motivation techniques in reinforcement learning environments with sparse rewards, emphasizing the need for fair comparisons.
Contribution
It highlights the impact of design variability on intrinsic motivation methods and advocates for standardized evaluation to ensure fair comparisons across techniques.
Findings
Design choices significantly influence intrinsic motivation effectiveness.
Variability in algorithms and architectures affects performance comparisons.
Careful parameter selection is crucial for reliable evaluation.
Abstract
In the last few years, the research activity around reinforcement learning tasks formulated over environments with sparse rewards has been especially notable. Among the numerous approaches proposed to deal with these hard exploration problems, intrinsic motivation mechanisms are arguably among the most studied alternatives to date. Advances reported in this area over time have tackled the exploration issue by proposing new algorithmic ideas to generate alternative mechanisms to measure the novelty. However, most efforts in this direction have overlooked the influence of different design choices and parameter settings that have also been introduced to improve the effect of the generated intrinsic bonus, forgetting the application of those choices to other intrinsic motivation techniques that may also benefit of them. Furthermore, some of those intrinsic methods are applied with different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsEntropy Regularization · Proximal Policy Optimization · Balanced Selection
