A Direct Approximation of AIXI Using Logical State Abstractions

Samuel Yang-Zhao; Tianyu Wang; Kee Siong Ng

arXiv:2210.06917·cs.AI·October 14, 2022·1 cites

A Direct Approximation of AIXI Using Logical State Abstractions

Samuel Yang-Zhao, Tianyu Wang, Kee Siong Ng

PDF

Open Access 1 Video

TL;DR

This paper introduces a practical method to approximate AIXI using logical state abstractions, enabling reinforcement learning in complex, structured, and history-dependent environments with improved model selection and Bayesian learning.

Contribution

It combines logical state abstraction with AIXI, using higher-order logic and a generalized Context Tree Weighting for Bayesian model learning, expanding AIXI's applicability to complex environments.

Findings

01

Validated on epidemic control in large contact networks

02

Demonstrated effective state abstraction and model selection

03

Achieved scalable Bayesian model learning

Abstract

We propose a practical integration of logical state abstraction with AIXI, a Bayesian optimality notion for reinforcement learning agents, to significantly expand the model class that AIXI agents can be approximated over to complex history-dependent and structured environments. The state representation and reasoning framework is based on higher-order logic, which can be used to define and enumerate complex features on non-Markovian and structured environments. We address the problem of selecting the right subset of features to form state abstractions by adapting the $Φ$ -MDP optimisation criterion from state abstraction theory. Exact Bayesian model learning is then achieved using a suitable generalisation of Context Tree Weighting over abstract state sequences. The resultant architecture can be integrated with different planning algorithms. Experimental results on controlling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Direct Approximation of AIXI Using Logical State Abstractions· slideslive

Taxonomy

TopicsBayesian Modeling and Causal Inference · Reinforcement Learning in Robotics · Data Stream Mining Techniques