Risk Conditioned Neural Motion Planning

Xin Huang; Meng Feng; Ashkan Jasour; Guy Rosman; Brian Williams

arXiv:2108.01851·cs.LG·August 5, 2021

Risk Conditioned Neural Motion Planning

Xin Huang, Meng Feng, Ashkan Jasour, Guy Rosman, Brian Williams

PDF

Open Access 1 Repo

TL;DR

This paper introduces a risk-conditioned deep reinforcement learning approach for motion planning that efficiently produces risk-bounded plans with adjustable risk levels, outperforming traditional methods in complex scenarios.

Contribution

It extends the soft actor critic model with a risk critic to accurately estimate execution risk and allows dynamic risk level adjustment during planning.

Findings

01

Outperforms mathematical programming baseline in speed and plan quality

02

Handles nonlinear dynamics and larger state spaces effectively

03

Provides adjustable risk bounds for flexible planning

Abstract

Risk-bounded motion planning is an important yet difficult problem for safety-critical tasks. While existing mathematical programming methods offer theoretical guarantees in the context of constrained Markov decision processes, they either lack scalability in solving larger problems or produce conservative plans. Recent advances in deep reinforcement learning improve scalability by learning policy networks as function approximators. In this paper, we propose an extension of soft actor critic model to estimate the execution risk of a plan through a risk critic and produce risk-bounded policies efficiently by adding an extra risk term in the loss function of the policy network. We define the execution risk in an accurate form, as opposed to approximating it through a summation of immediate risks at each time step that leads to conservative plans. Our proposed model is conditioned on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cyrushx/risk_sac
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Robotic Path Planning Algorithms

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Experience Replay · Dense Connections · Adam · Soft Actor Critic