Model-Free Reinforcement Learning for Symbolic Automata-encoded Objectives
Anand Balakrishnan, Stefan Jak\v{s}i\'c, Edgar A. Aguilar, Dejan, Ni\v{c}kovi\'c, Jyotirmoy V. Deshmukh

TL;DR
This paper introduces a model-free reinforcement learning approach that uses symbolic automata to define non-sparse, potential-based rewards, improving convergence and satisfaction of high-level formal specifications in robotic path planning.
Contribution
It proposes a novel reward shaping method using symbolic automata, enhancing RL convergence and formal specification satisfaction without requiring model-based techniques.
Findings
Potential-based rewards improve RL convergence.
Automata-based rewards better satisfy formal specifications.
Method achieves higher success rates in task completion.
Abstract
Reinforcement learning (RL) is a popular approach for robotic path planning in uncertain environments. However, the control policies trained for an RL agent crucially depend on user-defined, state-based reward functions. Poorly designed rewards can lead to policies that do get maximal rewards but fail to satisfy desired task objectives or are unsafe. There are several examples of the use of formal languages such as temporal logics and automata to specify high-level task specifications for robots (in lieu of Markovian rewards). Recent efforts have focused on inferring state-based rewards from formal specifications; here, the goal is to provide (probabilistic) guarantees that the policy learned using RL (with the inferred rewards) satisfies the high-level formal specification. A key drawback of several of these techniques is that the rewards that they infer are sparse: the agent receives…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Machine Learning and Algorithms · Software Testing and Debugging Techniques
