Batch-iFDD for Representation Expansion in Large MDPs

Alborz Geramifard; Thomas J. Walsh; Nicholas Roy; Jonathan How

arXiv:1309.6831·cs.LG·September 27, 2013·2 cites

Batch-iFDD for Representation Expansion in Large MDPs

Alborz Geramifard, Thomas J. Walsh, Nicholas Roy, Jonathan How

PDF

Open Access

TL;DR

This paper introduces Batch-iFDD, a scalable feature construction method for large MDPs that improves upon existing matching pursuit algorithms by reducing the need for large feature pools and ensuring convergence.

Contribution

Batch-iFDD is a new MP-based feature discovery algorithm that guarantees convergence and scales efficiently without requiring extensive feature pools.

Findings

01

Outperforms previous MP algorithms in large-scale domains

02

Demonstrates scalability with up to one million states

03

Achieves effective value function approximation

Abstract

Matching pursuit (MP) methods are a promising class of feature construction algorithms for value function approximation. Yet existing MP methods require creating a pool of potential features, mandating expert knowledge or enumeration of a large feature pool, both of which hinder scalability. This paper introduces batch incremental feature dependency discovery (Batch-iFDD) as an MP method that inherits a provable convergence property. Additionally, Batch-iFDD does not require a large pool of features, leading to lower computational complexity. Empirical policy evaluation results across three domains with up to one million states highlight the scalability of Batch-iFDD over the previous state of the art MP algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Optimization and Search Problems