Safe Reinforcement Learning of Robot Trajectories in the Presence of   Moving Obstacles

Jonas Kiemel; Ludovic Righetti; Torsten Kr\"oger; Tamim Asfour

arXiv:2411.05784·cs.RO·November 11, 2024

Safe Reinforcement Learning of Robot Trajectories in the Presence of Moving Obstacles

Jonas Kiemel, Ludovic Righetti, Torsten Kr\"oger, Tamim Asfour

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning approach for generating collision-free robot trajectories amidst moving obstacles, utilizing a backup policy for evasive maneuvers and risk estimation methods to ensure safety in real-time applications.

Contribution

It presents a novel method combining a backup policy with risk estimation techniques to enable safe, real-time robot trajectory planning in dynamic environments.

Findings

01

Effective collision avoidance in deterministic environments

02

Reduced computational effort with data-based risk estimator

03

Successful real-time implementation on a physical robot

Abstract

In this paper, we present an approach for learning collision-free robot trajectories in the presence of moving obstacles. As a first step, we train a backup policy to generate evasive movements from arbitrary initial robot states using model-free reinforcement learning. When learning policies for other tasks, the backup policy can be used to estimate the potential risk of a collision and to offer an alternative action if the estimated risk is considered too high. No matter which action is selected, our action space ensures that the kinematic limits of the robot joints are not violated. We analyze and evaluate two different methods for estimating the risk of a collision. A physics simulation performed in the background is computationally expensive but provides the best results in deterministic environments. If a data-based risk estimator is used instead, the computational effort is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

translearn/safemotionsrisk
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Robot Manipulation and Learning