Effective sketching methods for value function approximation

Yangchen Pan; Erfan Sadeqi Azer; Martha White

arXiv:1708.01298·cs.LG·August 7, 2017·2 cites

Effective sketching methods for value function approximation

Yangchen Pan, Erfan Sadeqi Azer, Martha White

PDF

Open Access

TL;DR

This paper investigates sketching techniques to improve the computational efficiency of high-dimensional value function approximation methods in reinforcement learning, proposing a less biased approach and empirically validating its effectiveness.

Contribution

It introduces a novel sketching approach using only a left-sided sketch to reduce bias and computational cost in matrix-based RL algorithms, with empirical validation.

Findings

01

Sketching directly on high-dimensional features can cause bias.

02

Left-sided sketching reduces bias and computational cost.

03

Empirical results show improved efficiency across multiple domains.

Abstract

High-dimensional representations, such as radial basis function networks or tile coding, are common choices for policy evaluation in reinforcement learning. Learning with such high-dimensional representations, however, can be expensive, particularly for matrix methods, such as least-squares temporal difference learning or quasi-Newton methods that approximate matrix step-sizes. In this work, we explore the utility of sketching for these two classes of algorithms. We highlight issues with sketching the high-dimensional features directly, which can incur significant bias. As a remedy, we demonstrate how to use sketching more sparingly, with only a left-sided sketch, that can still enable significant computational gains and the use of these matrix-based learning algorithms that are less sensitive to parameters. We empirically investigate these algorithms, in four domains with a variety of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Model Reduction and Neural Networks