Deep reinforcement learning under signal temporal logic constraints   using Lagrangian relaxation

Junya Ikemoto; Toshimitsu Ushio

arXiv:2201.08504·stat.ML·November 22, 2022·1 cites

Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

Junya Ikemoto, Toshimitsu Ushio

PDF

Open Access

TL;DR

This paper introduces a novel deep reinforcement learning approach that incorporates signal temporal logic constraints using a Lagrangian relaxation method, enabling time-sensitive control tasks with constraints.

Contribution

It formulates STL-constrained control as a new extended CMDP and proposes a two-phase constrained DRL algorithm to handle these constraints effectively.

Findings

01

Demonstrates successful learning performance in simulations.

02

Effectively incorporates STL constraints into DRL.

03

Shows potential for time-sensitive control applications.

Abstract

Deep reinforcement learning (DRL) has attracted much attention as an approach to solve optimal control problems without mathematical models of systems. On the other hand, in general, constraints may be imposed on optimal control problems. In this study, we consider the optimal control problems with constraints to complete temporal control tasks. We describe the constraints using signal temporal logic (STL), which is useful for time sensitive control tasks since it can specify continuous signals within bounded time intervals. To deal with the STL constraints, we introduce an extended constrained Markov decision process (CMDP), which is called a $τ$ -CMDP. We formulate the STL-constrained optimal control problem as the $τ$ -CMDP and propose a two-phase constrained DRL algorithm using the Lagrangian relaxation method. Through simulations, we also demonstrate the learning performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Reservoir Computing