Data-Driven Control via Conditional Mean Embeddings: Formal Guarantees via Uncertain MDP Abstraction

Ibon Gracia; Morteza Lahijanian

arXiv:2512.13940·eess.SY·December 17, 2025

Data-Driven Control via Conditional Mean Embeddings: Formal Guarantees via Uncertain MDP Abstraction

Ibon Gracia, Morteza Lahijanian

PDF

Open Access

TL;DR

This paper introduces a data-driven control framework that uses conditional mean embeddings and uncertain MDPs to provide formal performance guarantees for stochastic systems with unknown dynamics, validated on a temperature regulation benchmark.

Contribution

It develops a novel approach combining CMEs and UMDPs for policy synthesis with formal guarantees in uncertain, data-driven control settings.

Findings

01

Successfully learns transition kernels from data

02

Constructs finite-state UMDP abstractions with uncertainty bounds

03

Provides formal performance guarantees through robust dynamic programming

Abstract

Controlling stochastic systems with unknown dynamics and under complex specifications is specially challenging in safety-critical settings, where performance guarantees are essential. We propose a data-driven policy synthesis framework that yields formal performance guarantees for such systems using conditional mean embeddings (CMEs) and uncertain Markov decision processes (UMDPs). From trajectory data, we learn the system's transition kernel as a CME, then construct a finite-state UMDP abstraction whose transition uncertainties capture learning and discretization errors. Next, we generate a policy with formal performance bounds through robust dynamic programming. We demonstrate and empirically validate our method through a temperature regulation benchmark.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Formal Methods in Verification · Adversarial Robustness in Machine Learning