Reinforcement Learning with $\omega$-Regular Objectives and Constraints

Dominik Wagner; Leon Witzman; Luke Ong

arXiv:2511.19849·cs.AI·November 26, 2025

Reinforcement Learning with $\omega$-Regular Objectives and Constraints

Dominik Wagner, Leon Witzman, Luke Ong

PDF

Open Access 1 Video

TL;DR

This paper introduces a model-based reinforcement learning approach that optimizes complex temporal and safety-critical objectives expressed as $oldsymbol{ ext{omega}}$-regular properties, addressing limitations of scalar rewards and safety-performance trade-offs.

Contribution

It develops a linear programming-based RL algorithm for $oldsymbol{ ext{omega}}$-regular objectives with explicit constraints, ensuring safety and performance are managed separately.

Findings

01

Algorithm converges to policies maximizing $oldsymbol{ ext{omega}}$-regular satisfaction probabilities.

02

Provides a translation to constrained limit-average problems with guarantees.

03

Addresses safety-performance trade-offs in complex goal specifications.

Abstract

Reinforcement learning (RL) commonly relies on scalar rewards with limited ability to express temporal, conditional, or safety-critical goals, and can lead to reward hacking. Temporal logic expressible via the more general class of $ω$ -regular objectives addresses this by precisely specifying rich behavioural properties. Even still, measuring performance by a single scalar (be it reward or satisfaction probability) masks safety-performance trade-offs that arise in settings with a tolerable level of risk. We address both limitations simultaneously by combining $ω$ -regular objectives with explicit constraints, allowing safety requirements and optimisation targets to be treated separately. We develop a model-based RL algorithm based on linear programming, which in the limit produces a policy maximising the probability of satisfying an $ω$ -regular objective while also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Reinforcement Learning with $\omega$-Regular Objectives and Constraints· underline

Taxonomy

TopicsFormal Methods in Verification · Reinforcement Learning in Robotics · Advanced Software Engineering Methodologies