A Concentration Bound for LSPE($\lambda$)

Siddharth Chandak; Vivek S. Borkar; Harsh Dolhare

arXiv:2111.02644·cs.LG·December 1, 2022

A Concentration Bound for LSPE($\lambda$)

Siddharth Chandak, Vivek S. Borkar, Harsh Dolhare

PDF

Open Access

TL;DR

This paper derives a concentration bound for the LSPE(λ) algorithm, providing high probability performance guarantees for policy evaluation over time.

Contribution

It introduces a novel concentration bound for LSPE(λ), enhancing theoretical understanding of its performance guarantees.

Findings

01

High probability performance guarantees established

02

Concentration bound derived for LSPE(λ)

03

Theoretical insights into LSPE(λ) behavior

Abstract

The popular LSPE( $λ$ ) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Water resources management and optimization