Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation
Jinyoung Choi, Christopher R. Dance, Jung-eun Kim, Seulbin Hwang,, Kyung-sik Park

TL;DR
This paper introduces a risk-conditioned distributional RL algorithm for navigation that learns uncertainty-aware policies, allowing dynamic risk measure adjustments and improving safety and performance in complex environments.
Contribution
The proposed method enables risk-sensitive navigation with adaptable risk measures without retraining, addressing safety concerns in uncertain environments.
Findings
Outperforms baselines in safety and efficiency in navigation tasks
Allows runtime adaptation to different risk preferences
Demonstrates robustness to model inaccuracies
Abstract
Modern navigation algorithms based on deep reinforcement learning (RL) show promising efficiency and robustness. However, most deep RL algorithms operate in a risk-neutral manner, making no special attempt to shield users from relatively rare but serious outcomes, even if such shielding might cause little loss of performance. Furthermore, such algorithms typically make no provisions to ensure safety in the presence of inaccuracies in the models on which they were trained, beyond adding a cost-of-collision and some domain randomization while training, in spite of the formidable complexity of the environments in which they operate. In this paper, we present a novel distributional RL algorithm that not only learns an uncertainty-aware policy, but can also change its risk measure without expensive fine-tuning or retraining. Our method shows superior performance and safety over baselines in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
