Robustness to Model Approximation, Model Learning From Data, and Sample Complexity in Wasserstein Regular MDPs

Yichen Zhou; Yanglei Song; Serdar Y\"uksel

arXiv:2410.14116·eess.SY·March 10, 2026

Robustness to Model Approximation, Model Learning From Data, and Sample Complexity in Wasserstein Regular MDPs

Yichen Zhou, Yanglei Song, Serdar Y\"uksel

PDF

Open Access

TL;DR

This paper analyzes how Wasserstein model approximation affects the robustness of stochastic optimal control policies, providing bounds on performance loss and sample complexity in empirical model learning scenarios.

Contribution

It establishes robustness bounds for control policies under Wasserstein approximation and connects these to empirical learning and disturbance estimation.

Findings

01

Performance loss is bounded by Wasserstein-1 distance between models.

02

Sample complexity bounds are derived for empirical model learning.

03

Robustness results apply to both discounted and average-cost criteria.

Abstract

The paper studies the robustness properties of discrete-time stochastic optimal control under Wasserstein model approximation for both discounted-cost and average-cost criteria. Specifically, we study the performance loss when applying an optimal policy designed for an approximate model to the true dynamics compared with the optimal cost for the true model under the sup-norm-induced metric, and relate it to the Wasserstein-1 distance between the approximate and true transition kernels. A primary motivation of this analysis is empirical model learning, as well as empirical noise distribution learning, where Wasserstein convergence holds under mild conditions but stronger convergence criteria, such as total variation, may not. We discuss applications of the results to the disturbance estimation problem, where sample complexity bounds are given, and also to a general empirical model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning