Navigation in a simplified Urban Flow through Deep Reinforcement   Learning

Federica Tonti; Jean Rabault; Ricardo Vinuesa

arXiv:2409.17922·cs.AI·September 27, 2024·2 cites

Navigation in a simplified Urban Flow through Deep Reinforcement Learning

Federica Tonti, Jean Rabault, Ricardo Vinuesa

PDF

Open Access

TL;DR

This paper develops a deep reinforcement learning approach using PPO+LSTM to enable UAVs to navigate urban environments efficiently, reducing energy use and noise by optimizing trajectories in fluid-flow simulations.

Contribution

It introduces a novel DRL algorithm for UAV navigation in urban settings, validated through fluid-flow simulations and outperforming existing algorithms in success and crash rates.

Findings

01

PPO+LSTM achieved a success rate of 98.7%.

02

The method significantly outperformed PPO and TD3 algorithms.

03

Validated on a fluid-flow simulation representing urban obstacles.

Abstract

The increasing number of unmanned aerial vehicles (UAVs) in urban environments requires a strategy to minimize their environmental impact, both in terms of energy efficiency and noise reduction. In order to reduce these concerns, novel strategies for developing prediction models and optimization of flight planning, for instance through deep reinforcement learning (DRL), are needed. Our goal is to develop DRL algorithms capable of enabling the autonomous navigation of UAVs in urban environments, taking into account the presence of buildings and other UAVs, optimizing the trajectories in order to reduce both energetic consumption and noise. This is achieved using fluid-flow simulations which represent the environment in which UAVs navigate and training the UAV as an agent interacting with an urban environment. In this work, we consider a domain domain represented by a two-dimensional flow…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvacuation and Crowd Dynamics · Traffic control and management

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Target Policy Smoothing · Clipped Double Q-learning · Dense Connections · Adam · Experience Replay · Entropy Regularization · Proximal Policy Optimization · Twin Delayed Deep Deterministic