A New Optimal Stepsize For Approximate Dynamic Programming

Ilya O. Ryzhov; Peter I. Frazier; Warren B. Powell

arXiv:1407.2676·math.OC·July 15, 2014

A New Optimal Stepsize For Approximate Dynamic Programming

Ilya O. Ryzhov, Peter I. Frazier, Warren B. Powell

PDF

TL;DR

This paper introduces a new stepsize rule for approximate dynamic programming that enhances convergence speed and robustness by optimizing prediction error with minimal tuning.

Contribution

A novel stepsize rule for ADP that automatically adapts to noise levels, improving short-term performance and convergence without extensive parameter tuning.

Findings

01

Faster convergence in numerical experiments

02

Robust performance across different noise levels

03

Minimal parameter tuning required

Abstract

Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems. The design of effective ADP algorithms has many dimensions, but one crucial factor is the stepsize rule used to update a value function approximation. Many operations research applications are computationally intensive, and it is important to obtain good results quickly. Furthermore, the most popular stepsize formulas use tunable parameters and can produce very poor results if tuned improperly. We derive a new stepsize rule that optimizes the prediction error in order to improve the short-term performance of an ADP algorithm. With only one, relatively insensitive tunable parameter, the new rule adapts to the level of noise in the problem and produces faster convergence in numerical experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.