Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive   Navigation

Jinyoung Choi; Christopher R. Dance; Jung-eun Kim; Seulbin Hwang,; Kyung-sik Park

arXiv:2104.03111·cs.LG·April 12, 2021

Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation

Jinyoung Choi, Christopher R. Dance, Jung-eun Kim, Seulbin Hwang,, Kyung-sik Park

PDF

TL;DR

This paper introduces a risk-conditioned distributional RL algorithm for navigation that learns uncertainty-aware policies, allowing dynamic risk measure adjustments and improving safety and performance in complex environments.

Contribution

The proposed method enables risk-sensitive navigation with adaptable risk measures without retraining, addressing safety concerns in uncertain environments.

Findings

01

Outperforms baselines in safety and efficiency in navigation tasks

02

Allows runtime adaptation to different risk preferences

03

Demonstrates robustness to model inaccuracies

Abstract

Modern navigation algorithms based on deep reinforcement learning (RL) show promising efficiency and robustness. However, most deep RL algorithms operate in a risk-neutral manner, making no special attempt to shield users from relatively rare but serious outcomes, even if such shielding might cause little loss of performance. Furthermore, such algorithms typically make no provisions to ensure safety in the presence of inaccuracies in the models on which they were trained, beyond adding a cost-of-collision and some domain randomization while training, in spite of the formidable complexity of the environments in which they operate. In this paper, we present a novel distributional RL algorithm that not only learns an uncertainty-aware policy, but can also change its risk measure without expensive fine-tuning or retraining. Our method shows superior performance and safety over baselines in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.