Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP
Amin Falah, Shibashis Guha, Ashutosh Trivedi

TL;DR
This paper introduces a method to automatically translate omega-regular specifications into scalar rewards for reinforcement learning in continuous-time Markov decision processes, enabling effective learning of complex temporal objectives.
Contribution
It provides the first automatic translation technique for omega-regular objectives in CTMDPs, facilitating model-free RL with dense-time specifications.
Findings
Effective translation to scalar rewards demonstrated on benchmarks.
Improved RL performance on omega-regular objectives in CTMDPs.
Two semantics for dense-time objectives successfully implemented.
Abstract
Continuous-time Markov decision processes (CTMDPs) are canonical models to express sequential decision-making under dense-time and stochastic environments. When the stochastic evolution of the environment is only available via sampling, model-free reinforcement learning (RL) is the algorithm-of-choice to compute optimal decision sequence. RL, on the other hand, requires the learning objective to be encoded as scalar reward signals. Since doing such translations manually is both tedious and error-prone, a number of techniques have been proposed to translate high-level objectives (expressed in logic or automata formalism) to scalar rewards for discrete-time Markov decision processes (MDPs). Unfortunately, no automatic translation exists for CTMDPs. We consider CTMDP environments against the learning objectives expressed as omega-regular languages. Omega-regular languages generalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Advanced Software Engineering Methodologies · Receptor Mechanisms and Signaling
