Dynamic Subgoal-based Exploration via Bayesian Optimization

Yijia Wang; Matthias Poloczek; Daniel R. Jiang

arXiv:1910.09143·math.OC·October 13, 2023

Dynamic Subgoal-based Exploration via Bayesian Optimization

Yijia Wang, Matthias Poloczek, Daniel R. Jiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a cost-aware Bayesian optimization method for dynamic subgoal exploration in sparse-reward reinforcement learning, improving efficiency and performance in real-world navigation tasks.

Contribution

It proposes a novel Bayesian optimization framework that adaptively searches for effective subgoal strategies, considering exploration costs and environmental variability.

Findings

01

Outperforms existing baselines in multiple problem domains

02

Effectively balances exploration cost and reward acquisition

03

Theoretically guarantees near-optimal subgoal design asymptotically

Abstract

Reinforcement learning in sparse-reward navigation environments with expensive and limited interactions is challenging and poses a need for effective exploration. Motivated by complex navigation tasks that require real-world training (when cheap simulators are not available), we consider an agent that faces an unknown distribution of environments and must decide on an exploration strategy. It may leverage a series of training environments to improve its policy before it is evaluated in a test environment drawn from the same environment distribution. Most existing approaches focus on fixed exploration strategies, while the few that view exploration as a meta-optimization problem tend to ignore the need for cost-efficient exploration. We propose a cost-aware Bayesian optimization approach that efficiently searches over a class of dynamic subgoal-based exploration strategies. The algorithm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yjwang0618/subgoal-based-exploration
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Multi-Objective Optimization Algorithms · Robotics and Sensor-Based Localization · Reinforcement Learning in Robotics

MethodsTest