A Version of Geiringer-like Theorem for Decision Making in the   Environments with Randomness and Incomplete Information

Boris Mitavskiy; Jonathan Rowe; Chris Cannings

arXiv:1110.4657·cs.AI·October 24, 2011

A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Boris Mitavskiy, Jonathan Rowe, Chris Cannings

PDF

Open Access

TL;DR

This paper establishes a mathematical foundation inspired by population genetics to improve Monte-Carlo sampling methods for decision-making in environments with randomness and incomplete information, enhancing AI capabilities.

Contribution

It introduces a version of Geiringer-like theorem applicable to decision processes with uncertainty, providing theoretical support for new Monte-Carlo algorithms.

Findings

01

Provides a rigorous mathematical framework for decision-making under uncertainty.

02

Lays groundwork for algorithms that handle randomness and incomplete information.

03

Enhances understanding of exploration-exploitation balance in complex environments.

Abstract

Purpose: In recent years Monte-Carlo sampling methods, such as Monte Carlo tree search, have achieved tremendous success in model free reinforcement learning. A combination of the so called upper confidence bounds policy to preserve the "exploration vs. exploitation" balance to select actions for sample evaluations together with massive computing power to store and to update dynamically a rather large pre-evaluated game tree lead to the development of software that has beaten the top human player in the game of Go on a 9 by 9 board. Much effort in the current research is devoted to widening the range of applicability of the Monte-Carlo sampling methodology to partially observable Markov decision processes with non-immediate payoffs. The main challenge introduced by randomness and incomplete information is to deal with the action evaluation at the chance nodes due to drastic differences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Evolutionary Algorithms and Applications · Reinforcement Learning in Robotics