An Error Bound for Aggregation in Approximate Dynamic Programming

Yuchao Li; Dimitri Bertsekas

arXiv:2507.01324·math.OC·May 6, 2026

An Error Bound for Aggregation in Approximate Dynamic Programming

Yuchao Li, Dimitri Bertsekas

PDF

TL;DR

This paper derives a broad error bound for aggregation methods in approximate dynamic programming, applicable to various aggregation schemes, aiding the analysis of RL algorithms.

Contribution

It extends a known error bound to more general aggregation schemes in DP, including soft and feature-based aggregation.

Findings

01

Derived a general error bound for aggregation in DP.

02

Bound applies to soft and feature-based aggregation schemes.

03

Extends previous bounds from hard aggregation to broader cases.

Abstract

We consider a general aggregation framework for discounted finite-state infinite horizon dynamic programming (DP) problems. It defines an aggregate problem whose optimal cost function can be obtained off-line by exact DP and then used as a terminal cost approximation for an on-line reinforcement learning (RL) scheme. We derive a bound on the error between the optimal cost functions of the aggregate problem and the original problem. This bound was first derived by Tsitsiklis and van Roy [TvR96] for the special case of hard aggregation. Our bound is similar but applies far more broadly, including to soft aggregation and feature-based aggregation schemes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.