Navigation in a simplified Urban Flow through Deep Reinforcement Learning
Federica Tonti, Jean Rabault, Ricardo Vinuesa

TL;DR
This paper develops a deep reinforcement learning approach using PPO+LSTM to enable UAVs to navigate urban environments efficiently, reducing energy use and noise by optimizing trajectories in fluid-flow simulations.
Contribution
It introduces a novel DRL algorithm for UAV navigation in urban settings, validated through fluid-flow simulations and outperforming existing algorithms in success and crash rates.
Findings
PPO+LSTM achieved a success rate of 98.7%.
The method significantly outperformed PPO and TD3 algorithms.
Validated on a fluid-flow simulation representing urban obstacles.
Abstract
The increasing number of unmanned aerial vehicles (UAVs) in urban environments requires a strategy to minimize their environmental impact, both in terms of energy efficiency and noise reduction. In order to reduce these concerns, novel strategies for developing prediction models and optimization of flight planning, for instance through deep reinforcement learning (DRL), are needed. Our goal is to develop DRL algorithms capable of enabling the autonomous navigation of UAVs in urban environments, taking into account the presence of buildings and other UAVs, optimizing the trajectories in order to reduce both energetic consumption and noise. This is achieved using fluid-flow simulations which represent the environment in which UAVs navigate and training the UAV as an agent interacting with an urban environment. In this work, we consider a domain domain represented by a two-dimensional flow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvacuation and Crowd Dynamics · Traffic control and management
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Target Policy Smoothing · Clipped Double Q-learning · Dense Connections · Adam · Experience Replay · Entropy Regularization · Proximal Policy Optimization · Twin Delayed Deep Deterministic
