Safe Exploration Method for Reinforcement Learning under Existence of   Disturbance

Yoshihiro Okawa; Tomotake Sasaki; Hitoshi Yanami; Toru Namerikawa

arXiv:2209.15452·cs.LG·March 21, 2023

Safe Exploration Method for Reinforcement Learning under Existence of Disturbance

Yoshihiro Okawa, Tomotake Sasaki, Hitoshi Yanami, Toru Namerikawa

PDF

Open Access 1 Repo

TL;DR

This paper presents a safe exploration method for reinforcement learning that guarantees constraint satisfaction with high probability even under stochastic disturbances, using partial prior knowledge and theoretical safety guarantees.

Contribution

It introduces a novel safe exploration approach that explicitly accounts for disturbances and provides theoretical conditions for conservative input construction to ensure safety.

Findings

01

Guarantees safety with high probability under normal disturbance distributions

02

Validated the method through simulations on inverted pendulum and robot manipulator

03

Provides theoretical conditions for conservative input design

Abstract

Recent rapid developments in reinforcement learning algorithms have been giving us novel possibilities in many fields. However, due to their exploring property, we have to take the risk into consideration when we apply those algorithms to safety-critical problems especially in real environments. In this study, we deal with a safe exploration problem in reinforcement learning under the existence of disturbance. We define the safety during learning as satisfaction of the constraint conditions explicitly defined in terms of the state and propose a safe exploration method that uses partial prior knowledge of a controlled object and disturbance. The proposed method assures the satisfaction of the explicit state constraints with a pre-specified probability even if the controlled object is exposed to a stochastic disturbance following a normal distribution. As theoretical results, we introduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fujitsuresearch/safeexploration
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSafety Systems Engineering in Autonomy · Reinforcement Learning in Robotics · Software Reliability and Analysis Research