Safe Reinforcement Learning of Robot Trajectories in the Presence of Moving Obstacles
Jonas Kiemel, Ludovic Righetti, Torsten Kr\"oger, Tamim Asfour

TL;DR
This paper introduces a reinforcement learning approach for generating collision-free robot trajectories amidst moving obstacles, utilizing a backup policy for evasive maneuvers and risk estimation methods to ensure safety in real-time applications.
Contribution
It presents a novel method combining a backup policy with risk estimation techniques to enable safe, real-time robot trajectory planning in dynamic environments.
Findings
Effective collision avoidance in deterministic environments
Reduced computational effort with data-based risk estimator
Successful real-time implementation on a physical robot
Abstract
In this paper, we present an approach for learning collision-free robot trajectories in the presence of moving obstacles. As a first step, we train a backup policy to generate evasive movements from arbitrary initial robot states using model-free reinforcement learning. When learning policies for other tasks, the backup policy can be used to estimate the potential risk of a collision and to offer an alternative action if the estimated risk is considered too high. No matter which action is selected, our action space ensures that the kinematic limits of the robot joints are not violated. We analyze and evaluate two different methods for estimating the risk of a collision. A physics simulation performed in the background is computationally expensive but provides the best results in deterministic environments. If a data-based risk estimator is used instead, the computational effort is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Robot Manipulation and Learning
