Feature Dynamic Bayesian Networks

Marcus Hutter

arXiv:0812.4581·cs.AI·December 30, 2009

Feature Dynamic Bayesian Networks

Marcus Hutter

PDF

Open Access

TL;DR

This paper introduces PhiDBN, an extension of PhiMDPs using Dynamic Bayesian Networks, with a new cost criterion for automatic feature extraction to improve learning in complex environments.

Contribution

It develops a cost-based method for automatic feature selection in structured MDPs, enabling more effective modeling of large-scale environments.

Findings

01

Derived a cost criterion for feature relevance

02

Proposed a complete learning algorithm framework

03

Enhanced environment modeling with PhiDBN

Abstract

Feature Markov Decision Processes (PhiMDPs) are well-suited for learning agents in general environments. Nevertheless, unstructured (Phi)MDPs are limited to relatively simple environments. Structured MDPs like Dynamic Bayesian Networks (DBNs) are used for large-scale real-world problems. In this article I extend PhiMDP to PhiDBN. The primary contribution is to derive a cost criterion that allows to automatically extract the most relevant features from the environment, leading to the "best" DBN representation. I discuss all building blocks required for a complete general learning algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Reinforcement Learning in Robotics · Data Stream Mining Techniques