ELDEN: Exploration via Local Dependencies
Jiaheng Hu, Zizhao Wang, Peter Stone, Roberto Martin-Martin

TL;DR
ELDEN introduces a novel intrinsic reward based on local dependency uncertainty, enabling efficient exploration in environments with complex factored state spaces and chained dependencies, leading to improved policy learning.
Contribution
The paper proposes a new intrinsic reward method that models local dependencies via partial derivatives of learned dynamics, enhancing exploration in complex environments.
Findings
ELDEN accurately identifies local dependencies in various environments.
ELDEN outperforms previous exploration methods in multiple domains.
The approach effectively encourages discovery of new interactions between entities.
Abstract
Tasks with large state space and sparse rewards present a longstanding challenge to reinforcement learning. In these tasks, an agent needs to explore the state space efficiently until it finds a reward. To deal with this problem, the community has proposed to augment the reward function with intrinsic reward, a bonus signal that encourages the agent to visit interesting states. In this work, we propose a new way of defining interesting states for environments with factored state spaces and complex chained dependencies, where an agent's actions may change the value of one entity that, in order, may affect the value of another entity. Our insight is that, in these environments, interesting states for exploration are states where the agent is uncertain whether (as opposed to how) entities such as the agent or objects have some influence on each other. We present ELDEN, Exploration via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning in Healthcare
