Stochastic Approximation for Nonlinear Discrete Stochastic Control: Finite-Sample Bounds

Hoang Huy Nguyen; Siva Theja Maguluri

arXiv:2304.11854·math.OC·September 3, 2025·1 cites

Stochastic Approximation for Nonlinear Discrete Stochastic Control: Finite-Sample Bounds

Hoang Huy Nguyen, Siva Theja Maguluri

PDF

Open Access

TL;DR

This paper develops finite-sample convergence bounds for a nonlinear stochastic control system using stochastic approximation and Lyapunov functions, providing practical guarantees for control policy performance.

Contribution

It introduces a method to derive finite-sample bounds for nonlinear stochastic control systems using Lyapunov functions and smoothing techniques, extending existing convergence results.

Findings

01

Finite-sample bounds are established for four cases based on Lyapunov function properties.

02

The bounds depend on exponential or sub-exponential convergence rates.

03

Numerical experiments validate the theoretical convergence rates.

Abstract

We consider a nonlinear discrete stochastic control system, and our goal is to design a feedback control policy in order to lead the system to a prespecified state. We adopt a stochastic approximation viewpoint of this problem. It is known that by solving the corresponding continuous-time deterministic system, and using the resulting feedback control policy, one ensures almost sure convergence to the prespecified state in the discrete system. In this paper, we adopt such a control mechanism and provide its finite-sample convergence bounds whenever a Lyapunov function is known for the continuous system. In particular, we consider four cases based on whether the Lyapunov function for the continuous system gives exponential or sub-exponential rates and based on whether it is smooth or not. We provide the finite-time bounds in all cases. Our proof relies on constructing a Lyapunov function…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Markov Chains and Monte Carlo Methods · Stochastic processes and financial applications