Scaffolding Reflection in Reinforcement Learning Framework for   Confinement Escape Problem

Nishant Mohanty; Suresh Sundaram

arXiv:2011.06764·cs.RO·April 20, 2021·1 cites

Scaffolding Reflection in Reinforcement Learning Framework for Confinement Escape Problem

Nishant Mohanty, Suresh Sundaram

PDF

Open Access

TL;DR

This paper introduces SR2L, a reinforcement learning framework with scaffolding reflection, enabling evaders to escape confinement regions more efficiently by improving convergence and performance over traditional methods.

Contribution

The paper presents a novel SR2L framework that integrates scaffolding reflection with actor-critic reinforcement learning to enhance escape strategies in confinement problems.

Findings

01

SR2L converges faster than IAC.

02

SR2L achieves higher rewards in simulations.

03

Outperforms baseline motion planner.

Abstract

In this paper, a novel Scaffolding Reflection in Reinforcement Learning (SR2L) is proposed for solving the confinement escape problem (CEP). In CEP, an evader's objective is to attempt escaping a confinement region patrolled by multiple pursuers. Meanwhile, the pursuers aim to reach and capture the evader. The inverse solution for pursuers to try and capture has been extensively studied in the literature. However, the problem of evaders escaping from the region is still an open issue. The SR2L employs an actor-critic framework to enable the evader to escape the confinement region. A time-varying state representation and reward function have been developed for proper convergence. The formulation uses the sensor information about the observable environment and prior knowledge of the confinement boundary. The conventional Independent Actor-Critic (IAC) method fails to converge due to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGuidance and Control Systems · Reinforcement Learning in Robotics