Feature Selection for Value Function Approximation Using Bayesian Model   Selection

Tobias Jung; Peter Stone

arXiv:1201.6615·cs.AI·February 1, 2012

Feature Selection for Value Function Approximation Using Bayesian Model Selection

Tobias Jung, Peter Stone

PDF

Open Access

TL;DR

This paper introduces a Bayesian model selection approach for feature selection in reinforcement learning, enabling automatic, scalable, and more accurate value function approximation using Gaussian processes.

Contribution

It proposes a novel method for feature selection via marginal likelihood optimization within the GPTD framework, improving scalability and prediction accuracy in RL.

Findings

01

Automatic feature selection from sample transitions

02

Enhanced computational efficiency through subspace identification

03

Improved value function approximation accuracy

Abstract

Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of the main challenges in scaling RL to real-world applications. Here we consider the Gaussian process based framework GPTD for approximate policy evaluation, and propose feature selection through marginal likelihood optimization of the associated hyperparameters. Our approach has two appealing benefits: (1) given just sample transitions, we can solve the policy evaluation problem fully automatically (without looking at the learning task, and, in theory, independent of the dimensionality of the state space), and (2) model selection allows us to consider more sophisticated kernels, which in turn enable us to identify relevant subspaces and eliminate irrelevant state variables such that we can achieve substantial computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Reinforcement Learning in Robotics