Model-Free Reinforcement Learning for Symbolic Automata-encoded   Objectives

Anand Balakrishnan; Stefan Jak\v{s}i\'c; Edgar A. Aguilar; Dejan; Ni\v{c}kovi\'c; Jyotirmoy V. Deshmukh

arXiv:2202.02404·cs.AI·December 5, 2024·1 cites

Model-Free Reinforcement Learning for Symbolic Automata-encoded Objectives

Anand Balakrishnan, Stefan Jak\v{s}i\'c, Edgar A. Aguilar, Dejan, Ni\v{c}kovi\'c, Jyotirmoy V. Deshmukh

PDF

Open Access

TL;DR

This paper introduces a model-free reinforcement learning approach that uses symbolic automata to define non-sparse, potential-based rewards, improving convergence and satisfaction of high-level formal specifications in robotic path planning.

Contribution

It proposes a novel reward shaping method using symbolic automata, enhancing RL convergence and formal specification satisfaction without requiring model-based techniques.

Findings

01

Potential-based rewards improve RL convergence.

02

Automata-based rewards better satisfy formal specifications.

03

Method achieves higher success rates in task completion.

Abstract

Reinforcement learning (RL) is a popular approach for robotic path planning in uncertain environments. However, the control policies trained for an RL agent crucially depend on user-defined, state-based reward functions. Poorly designed rewards can lead to policies that do get maximal rewards but fail to satisfy desired task objectives or are unsafe. There are several examples of the use of formal languages such as temporal logics and automata to specify high-level task specifications for robots (in lieu of Markovian rewards). Recent efforts have focused on inferring state-based rewards from formal specifications; here, the goal is to provide (probabilistic) guarantees that the policy learned using RL (with the inferred rewards) satisfies the high-level formal specification. A key drawback of several of these techniques is that the rewards that they infer are sparse: the agent receives…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Machine Learning and Algorithms · Software Testing and Debugging Techniques