Effective sketching methods for value function approximation
Yangchen Pan, Erfan Sadeqi Azer, Martha White

TL;DR
This paper investigates sketching techniques to improve the computational efficiency of high-dimensional value function approximation methods in reinforcement learning, proposing a less biased approach and empirically validating its effectiveness.
Contribution
It introduces a novel sketching approach using only a left-sided sketch to reduce bias and computational cost in matrix-based RL algorithms, with empirical validation.
Findings
Sketching directly on high-dimensional features can cause bias.
Left-sided sketching reduces bias and computational cost.
Empirical results show improved efficiency across multiple domains.
Abstract
High-dimensional representations, such as radial basis function networks or tile coding, are common choices for policy evaluation in reinforcement learning. Learning with such high-dimensional representations, however, can be expensive, particularly for matrix methods, such as least-squares temporal difference learning or quasi-Newton methods that approximate matrix step-sizes. In this work, we explore the utility of sketching for these two classes of algorithms. We highlight issues with sketching the high-dimensional features directly, which can incur significant bias. As a remedy, we demonstrate how to use sketching more sparingly, with only a left-sided sketch, that can still enable significant computational gains and the use of these matrix-based learning algorithms that are less sensitive to parameters. We empirically investigate these algorithms, in four domains with a variety of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Model Reduction and Neural Networks
