A Matrix Splitting Perspective on Planning with Options

Pierre-Luc Bacon; Doina Precup

arXiv:1612.00916·cs.AI·July 12, 2017·1 cites

A Matrix Splitting Perspective on Planning with Options

Pierre-Luc Bacon, Doina Precup

PDF

Open Access

TL;DR

This paper presents a novel perspective on the options framework in reinforcement learning by relating the Bellman operator to matrix splitting, revealing how options' timescales influence convergence rates and computational trade-offs.

Contribution

It introduces a matrix splitting viewpoint for the options framework, connecting convergence behavior to options' timescales and highlighting computational trade-offs.

Findings

01

Convergence rate depends on options' inherent timescales.

02

Trade-off identified between asymptotic performance and computational cost.

03

Matrix splitting perspective offers new insights into options-based planning.

Abstract

We show that the Bellman operator underlying the options framework leads to a matrix splitting, an approach traditionally used to speed up convergence of iterative solvers for large linear systems of equations. Based on standard comparison theorems for matrix splittings, we then show how the asymptotic rate of convergence varies as a function of the inherent timescales of the options. This new perspective highlights a trade-off between asymptotic performance and the cost of computation associated with building a good set of options.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Computability, Logic, AI Algorithms

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings