Synthesis from LTL with Reward Optimization in Sampled Oblivious   Environments

Jean-Fran\c{c}ois Raskin; Yun Chen Tsai

arXiv:2410.08599·cs.FL·October 14, 2024

Synthesis from LTL with Reward Optimization in Sampled Oblivious Environments

Jean-Fran\c{c}ois Raskin, Yun Chen Tsai

PDF

Open Access

TL;DR

This paper presents a method for synthesizing reactive systems that satisfy hard LTL constraints and optimize soft reward-based constraints in sampled, stochastic environments, using an SMT-based approach.

Contribution

It introduces a formal synthesis framework combining LTL constraints with reward optimization in sampled environments, proving NP-completeness and providing an SMT-based solution.

Findings

01

The synthesis problem is NP-complete.

02

The SMT-based approach effectively handles the combined constraints.

03

Case study demonstrates practical applicability.

Abstract

This paper addresses the synthesis of reactive systems that enforce hard constraints while optimizing for quality-based soft constraints. We build on recent advancements in combining reactive synthesis with example-based guidance to handle both types of constraints in stochastic, oblivious environments accessible only through sampling. Our approach constructs examples that satisfy LTL-based hard constraints while maximizing expected rewards-representing the soft constraints-on samples drawn from the environment. We formally define this synthesis problem, prove it to be NP-complete, and propose an SMT-based solution, demonstrating its effectiveness with a case study.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical methods for differential equations