Scaffolding Reflection in Reinforcement Learning Framework for Confinement Escape Problem
Nishant Mohanty, Suresh Sundaram

TL;DR
This paper introduces SR2L, a reinforcement learning framework with scaffolding reflection, enabling evaders to escape confinement regions more efficiently by improving convergence and performance over traditional methods.
Contribution
The paper presents a novel SR2L framework that integrates scaffolding reflection with actor-critic reinforcement learning to enhance escape strategies in confinement problems.
Findings
SR2L converges faster than IAC.
SR2L achieves higher rewards in simulations.
Outperforms baseline motion planner.
Abstract
In this paper, a novel Scaffolding Reflection in Reinforcement Learning (SR2L) is proposed for solving the confinement escape problem (CEP). In CEP, an evader's objective is to attempt escaping a confinement region patrolled by multiple pursuers. Meanwhile, the pursuers aim to reach and capture the evader. The inverse solution for pursuers to try and capture has been extensively studied in the literature. However, the problem of evaders escaping from the region is still an open issue. The SR2L employs an actor-critic framework to enable the evader to escape the confinement region. A time-varying state representation and reward function have been developed for proper convergence. The formulation uses the sensor information about the observable environment and prior knowledge of the confinement boundary. The conventional Independent Actor-Critic (IAC) method fails to converge due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGuidance and Control Systems · Reinforcement Learning in Robotics
