NavRL: Learning Safe Flight in Dynamic Environments
Zhefan Xu, Xinming Han, Haoyu Shen, Hanyu Jin, Kenji Shimada

TL;DR
NavRL introduces a deep reinforcement learning framework for UAV navigation that ensures safety in dynamic environments, leveraging simulation-to-real transfer and a safety shield to minimize collisions.
Contribution
The paper presents NavRL, a novel RL-based navigation method with a safety shield and simulation acceleration, enabling safe real-world UAV flight in dynamic cluttered spaces.
Findings
NavRL achieves fewer collisions than benchmark methods.
The safety shield effectively reduces failure rates.
Simulation training accelerates convergence and transferability.
Abstract
Safe flight in dynamic environments requires unmanned aerial vehicles (UAVs) to make effective decisions when navigating cluttered spaces with moving obstacles. Traditional approaches often decompose decision-making into hierarchical modules for prediction and planning. Although these handcrafted systems can perform well in specific settings, they might fail if environmental conditions change and often require careful parameter tuning. Additionally, their solutions could be suboptimal due to the use of inaccurate mathematical model assumptions and simplifications aimed at achieving computational efficiency. To overcome these limitations, this paper introduces the NavRL framework, a deep reinforcement learning-based navigation method built on the Proximal Policy Optimization (PPO) algorithm. NavRL utilizes our carefully designed state and action representations, allowing the learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Reinforcement Learning in Robotics
