Reinforcement Learning for Omega-Regular Specifications on   Continuous-Time MDP

Amin Falah; Shibashis Guha; Ashutosh Trivedi

arXiv:2303.09528·cs.LG·March 17, 2023·1 cites

Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP

Amin Falah, Shibashis Guha, Ashutosh Trivedi

PDF

Open Access

TL;DR

This paper introduces a method to automatically translate omega-regular specifications into scalar rewards for reinforcement learning in continuous-time Markov decision processes, enabling effective learning of complex temporal objectives.

Contribution

It provides the first automatic translation technique for omega-regular objectives in CTMDPs, facilitating model-free RL with dense-time specifications.

Findings

01

Effective translation to scalar rewards demonstrated on benchmarks.

02

Improved RL performance on omega-regular objectives in CTMDPs.

03

Two semantics for dense-time objectives successfully implemented.

Abstract

Continuous-time Markov decision processes (CTMDPs) are canonical models to express sequential decision-making under dense-time and stochastic environments. When the stochastic evolution of the environment is only available via sampling, model-free reinforcement learning (RL) is the algorithm-of-choice to compute optimal decision sequence. RL, on the other hand, requires the learning objective to be encoded as scalar reward signals. Since doing such translations manually is both tedious and error-prone, a number of techniques have been proposed to translate high-level objectives (expressed in logic or automata formalism) to scalar rewards for discrete-time Markov decision processes (MDPs). Unfortunately, no automatic translation exists for CTMDPs. We consider CTMDP environments against the learning objectives expressed as omega-regular languages. Omega-regular languages generalize…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Advanced Software Engineering Methodologies · Receptor Mechanisms and Signaling