TL;DR
This paper introduces an incremental, data-driven control synthesis framework for stochastic systems, combining online learning with game-solving to efficiently update control policies as new data arrives.
Contribution
It develops a novel incremental abstraction and game-solving method that refines control policies dynamically with accumulating data, improving efficiency over traditional re-computation.
Findings
Significant computational savings demonstrated in case studies.
Monotonic updates of abstractions and winning regions.
Efficient incremental algorithm for control policy synthesis.
Abstract
We address the synthesis of control policies for unknown discrete-time stochastic dynamical systems to satisfy temporal logic objectives. We present a data-driven, abstraction-based control framework that integrates online learning with novel incremental game-solving. Under appropriate continuity assumptions, our method abstracts the system dynamics into a finite stochastic (2.5-player) game graph derived from data. Given a requirement over time on this graph, we compute the winning region -- i.e., the set of initial states from which the objective is satisfiable -- in the resulting game, together with a corresponding control policy. Our main contribution is the construction of abstractions, winning regions and control policies \emph{incrementally}, as data about the system dynamics accumulates. Concretely, our algorithm refines under- and over-approximations of reachable sets for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
