Neighboring State-based Exploration for Reinforcement Learning

Yu-Teng Li; Justin Lin; Jeffery Cheng; Pedro Pachuca

arXiv:2212.10712·cs.LG·November 4, 2025

Neighboring State-based Exploration for Reinforcement Learning

Yu-Teng Li, Justin Lin, Jeffery Cheng, Pedro Pachuca

PDF

Open Access

TL;DR

This paper introduces neighboring state-based exploration algorithms for reinforcement learning, demonstrating significant performance improvements over baseline methods by focusing on nearby states during exploration.

Contribution

The paper proposes two novel algorithms for neighboring state-based exploration, with one method, ${\rho}$-explore, outperforming standard Double DQN in discrete environments.

Findings

01

${\rho}$-explore outperforms Double DQN by 49% in Eval Reward Return

02

Neighboring state-based exploration improves early-stage decision-making

03

Proposed algorithms effectively leverage local state information

Abstract

Reinforcement Learning is a powerful tool to model decision-making processes. However, it relies on an exploration-exploitation trade-off that remains an open challenge for many tasks. In this work, we study neighboring state-based, model-free exploration led by the intuition that, for an early-stage agent, considering actions derived from a bounded region of nearby states may lead to better actions when exploring. We propose two algorithms that choose exploratory actions based on a survey of nearby states, and find that one of our methods, $ρ$ -explore, consistently outperforms the Double DQN baseline in an discrete environment by 49% in terms of Eval Reward Return.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning

MethodsDense Connections · Experience Replay · Q-Learning · Convolution · Double Q-learning · Deep Q-Network · Double DQN