Greedy Algorithms for Sparse Reinforcement Learning

Christopher Painter-Wakefield (Duke University); Ronald Parr (Duke; University)

arXiv:1206.6485·cs.LG·July 3, 2012·ICML·30 cites

Greedy Algorithms for Sparse Reinforcement Learning

Christopher Painter-Wakefield (Duke University), Ronald Parr (Duke, University)

PDF

Open Access

TL;DR

This paper explores greedy algorithms, specifically variants of orthogonal matching pursuit, for feature selection in sparse reinforcement learning, demonstrating promising theoretical guarantees and empirical performance improvements over existing methods.

Contribution

It introduces and analyzes new greedy algorithms for sparse RL, providing theoretical insights and empirical evidence of their effectiveness compared to $L_1$ regularization approaches.

Findings

01

OMP-BRM offers theoretical guarantees under certain conditions.

02

OMP-TD outperforms prior methods in accuracy and efficiency.

03

Natural sparse recovery scenarios may fail, but variants like OMP-BRM and OMP-TD show promise.

Abstract

Feature selection and regularization are becoming increasingly prominent tools in the efforts of the reinforcement learning (RL) community to expand the reach and applicability of RL. One approach to the problem of feature selection is to impose a sparsity-inducing form of regularization on the learning method. Recent work on $L_{1}$ regularization has adapted techniques from the supervised learning literature for use with RL. Another approach that has received renewed attention in the supervised learning community is that of using a simple algorithm that greedily adds new features. Such algorithms have many of the good properties of the $L_{1}$ regularization methods, while also being extremely efficient and, in some cases, allowing theoretical guarantees on recovery of the true form of a sparse target function from sampled data. This paper considers variants of orthogonal matching pursuit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEnergy Harvesting in Wireless Networks · Sparse and Compressive Sensing Techniques · Advanced MIMO Systems Optimization