A Monte Carlo Algorithm for Universally Optimal Bayesian Sequence   Prediction and Planning

Anthony Di Franco

arXiv:1001.2813·nlin.AO·January 19, 2010

A Monte Carlo Algorithm for Universally Optimal Bayesian Sequence Prediction and Planning

Anthony Di Franco

PDF

Open Access

TL;DR

This paper explores the design of rational decision-making agents using a Monte Carlo algorithm to approximate optimal Bayesian sequence prediction and planning within resource-bounded computational models.

Contribution

It introduces a Monte Carlo method for approximating universally optimal Bayesian sequence prediction and planning in resource-bounded settings, advancing AI decision-making theory.

Findings

01

Monte Carlo algorithm effectively approximates Bayesian prediction and planning.

02

Resource limitations influence the architecture of learning systems.

03

Insights into implementing near-optimal decision makers in practical AI systems.

Abstract

The aim of this work is to address the question of whether we can in principle design rational decision-making agents or artificial intelligences embedded in computable physics such that their decisions are optimal in reasonable mathematical senses. Recent developments in rare event probability estimation, recursive bayesian inference, neural networks, and probabilistic planning are sufficient to explicitly approximate reinforcement learners of the AIXI style with non-trivial model classes (here, the class of resource-bounded Turing machines). Consideration of the effects of resource limitations in a concrete implementation leads to insights about possible architectures for learning systems using optimal decision makers as components.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Computability, Logic, AI Algorithms · AI-based Problem Solving and Planning