Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation   in Long-Horizon Optimal Control

Shankarachary Ragi; Hans D. Mittelmann

arXiv:2009.07354·eess.SY·September 17, 2020

Random-Sampling Monte-Carlo Tree Search Methods for Cost Approximation in Long-Horizon Optimal Control

Shankarachary Ragi, Hans D. Mittelmann

PDF

TL;DR

This paper introduces Monte-Carlo sampling methods for approximating costs in long-horizon optimal control, providing convergence analysis and demonstrating effectiveness through case studies.

Contribution

It develops and analyzes convergence properties of RS-MHP Monte-Carlo methods for cost approximation in long-horizon control problems.

Findings

01

Convergence in probability of cost approximation error as sample size increases

02

Effective application demonstrated in linear quadratic and UAV path optimization problems

03

Provides theoretical bounds on approximation accuracy

Abstract

In this paper, we develop Monte-Carlo based heuristic approaches to approximate the objective function in long horizon optimal control problems. In these approaches, to approximate the expectation operator in the objective function, we evolve the system state over multiple trajectories into the future while sampling the noise disturbances at each time-step, and find the average (or weighted average) of the costs along all the trajectories. We call these methods random sampling - multipath hypothesis propagation or RS-MHP. These methods (or variants) exist in the literature; however, the literature lacks results on how well these approximation strategies converge. This paper fills this knowledge gap to a certain extent. We derive convergence results for the cost approximation error from the RS-MHP methods and discuss their convergence (in probability) as the sample size increases. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.