Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments   with Distributional Reinforcement Learning

Cheng Liu; Erik-Jan van Kampen; Guido C.H.E. de Croon

arXiv:2203.14749·cs.RO·September 27, 2022

Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning

Cheng Liu, Erik-Jan van Kampen, Guido C.H.E. de Croon

PDF

Open Access 1 Repo

TL;DR

This paper introduces a distributional reinforcement learning approach enabling nano drones to adaptively navigate cluttered environments by estimating uncertainty and adjusting risk-tendency, improving safety and performance.

Contribution

It proposes a novel framework using tail conditional variance and EWAF to adapt risk-tendency in reinforcement learning for drone navigation.

Findings

01

Adaptive risk-tendency varies across states

02

Achieves better performance than risk-neutral or risk-averse policies

03

Effective in both simulation and real-world scenarios

Abstract

Enabling the capability of assessing risk and making risk-aware decisions is essential to applying reinforcement learning to safety-critical robots like drones. In this paper, we investigate a specific case where a nano quadcopter robot learns to navigate an apriori-unknown cluttered environment under partial observability. We present a distributional reinforcement learning framework to generate adaptive risk-tendency policies. Specifically, we propose to use lower tail conditional variance of the learnt return distribution as intrinsic uncertainty estimation, and use exponentially weighted average forecasting (EWAF) to adapt the risk-tendency in accordance with the estimated uncertainty. In simulation and real-world empirical results, we show that (1) the most effective risk-tendency vary across states, (2) the agent with adaptive risk-tendency achieves superior performance compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tudelft/risk-sensitive-rl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 epidemiological studies · Diffusion and Search Dynamics · Reinforcement Learning in Robotics