Funnel-based Reward Shaping for Signal Temporal Logic Tasks in   Reinforcement Learning

Naman Saxena; Gorantla Sandeep; Pushpak Jagtap

arXiv:2212.03181·eess.SY·December 5, 2023·1 cites

Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning

Naman Saxena, Gorantla Sandeep, Pushpak Jagtap

PDF

Open Access

TL;DR

This paper introduces a funnel-based reward shaping method to improve reinforcement learning for continuous state space STL tasks, enabling robust satisfaction of complex temporal specifications.

Contribution

It proposes a novel, tractable reinforcement learning algorithm leveraging funnel functions to enhance STL task satisfaction in continuous environments.

Findings

01

Effective in various STL tasks across different environments

02

Improves robustness of STL satisfaction in continuous state spaces

03

Demonstrates tractability and efficiency of the proposed method

Abstract

Signal Temporal Logic (STL) is a powerful framework for describing the complex temporal and logical behaviour of the dynamical system. Numerous studies have attempted to employ reinforcement learning to learn a controller that enforces STL specifications; however, they have been unable to effectively tackle the challenges of ensuring robust satisfaction in continuous state space and maintaining tractability. In this paper, leveraging the concept of funnel functions, we propose a tractable reinforcement learning algorithm to learn a time-dependent policy for robust satisfaction of STL specification in continuous state space. We demonstrate the utility of our approach on several STL tasks using different environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Receptor Mechanisms and Signaling · Evolutionary Algorithms and Applications