A Policy Search Method For Temporal Logic Specified Reinforcement   Learning Tasks

Xiao Li; Yao Ma; Calin Belta

arXiv:1709.09611·cs.AI·September 28, 2017·2 cites

A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks

Xiao Li, Yao Ma, Calin Belta

PDF

Open Access

TL;DR

This paper introduces a model-free reinforcement learning method called TLPS that uses temporal logic to specify tasks, enabling policies to satisfy complex temporal constraints without manual reward tuning.

Contribution

The paper presents a novel temporal logic policy search (TLPS) method that leverages TL specifications for reward-free policy learning in reinforcement learning.

Findings

01

TLPS effectively learns policies satisfying temporal logic specifications.

02

The approach reduces the need for manual reward engineering.

03

Experimental results demonstrate the method's viability in simulated tasks.

Abstract

Reward engineering is an important aspect of reinforcement learning. Whether or not the user's intentions can be correctly encapsulated in the reward function can significantly impact the learning outcome. Current methods rely on manually crafted reward functions that often require parameter tuning to obtain the desired behavior. This operation can be expensive when exploration requires systems to interact with the physical world. In this paper, we explore the use of temporal logic (TL) to specify tasks in reinforcement learning. TL formula can be translated to a real-valued function that measures its level of satisfaction against a trajectory. We take advantage of this function and propose temporal logic policy search (TLPS), a model-free learning technique that finds a policy that satisfies the TL specification. A set of simulated experiments are conducted to evaluate the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Software Engineering Research · Advanced Software Engineering Methodologies