Optimistic Simulated Exploration as an Incentive for Real Exploration

Ivo Danihelka

arXiv:0903.2972·cs.LG·May 20, 2009

Optimistic Simulated Exploration as an Incentive for Real Exploration

Ivo Danihelka

PDF

Open Access

TL;DR

This paper introduces a method that uses optimistic simulated exploration to identify promising paths, reducing the need for extensive real-world exploration in environments with many states.

Contribution

It proposes a novel approach combining optimistic simulated exploration with real exploration to improve efficiency in environments with unlimited states.

Findings

01

Reduces real exploration needs significantly

02

Effective in environments with large or infinite state spaces

03

Improves exploration efficiency over traditional methods

Abstract

Many reinforcement learning exploration techniques are overly optimistic and try to explore every state. Such exploration is impossible in environments with the unlimited number of states. I propose to use simulated exploration with an optimistic model to discover promising paths for real exploration. This reduces the needs for the real exploration.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReservoir Engineering and Simulation Methods · Distributed and Parallel Computing Systems · Simulation Techniques and Applications