Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning
Naman Saxena, Gorantla Sandeep, Pushpak Jagtap

TL;DR
This paper introduces a funnel-based reward shaping method to improve reinforcement learning for continuous state space STL tasks, enabling robust satisfaction of complex temporal specifications.
Contribution
It proposes a novel, tractable reinforcement learning algorithm leveraging funnel functions to enhance STL task satisfaction in continuous environments.
Findings
Effective in various STL tasks across different environments
Improves robustness of STL satisfaction in continuous state spaces
Demonstrates tractability and efficiency of the proposed method
Abstract
Signal Temporal Logic (STL) is a powerful framework for describing the complex temporal and logical behaviour of the dynamical system. Numerous studies have attempted to employ reinforcement learning to learn a controller that enforces STL specifications; however, they have been unable to effectively tackle the challenges of ensuring robust satisfaction in continuous state space and maintaining tractability. In this paper, leveraging the concept of funnel functions, we propose a tractable reinforcement learning algorithm to learn a time-dependent policy for robust satisfaction of STL specification in continuous state space. We demonstrate the utility of our approach on several STL tasks using different environments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Receptor Mechanisms and Signaling · Evolutionary Algorithms and Applications
