Distributed Deep Reinforcement Learning for Intelligent Traffic   Monitoring with a Team of Aerial Robots

Behzad Khamidehi; Elvino S. Sousa

arXiv:2107.04924·cs.RO·July 13, 2021

Distributed Deep Reinforcement Learning for Intelligent Traffic Monitoring with a Team of Aerial Robots

Behzad Khamidehi, Elvino S. Sousa

PDF

TL;DR

This paper presents a distributed deep reinforcement learning approach for aerial robots to monitor traffic in road networks, addressing stochastic and non-homogeneous traffic patterns with scalable algorithms under different communication constraints.

Contribution

It introduces a novel POMDP-based path planning framework and a scalable deep reinforcement learning algorithm for multi-robot traffic monitoring with partial and real-time communication modes.

Findings

01

Effective in real road network topology

02

Handles both real-time and limited communication scenarios

03

Reduces uncertainty in traffic monitoring

Abstract

This paper studies the traffic monitoring problem in a road network using a team of aerial robots. The problem is challenging due to two main reasons. First, the traffic events are stochastic, both temporally and spatially. Second, the problem has a non-homogeneous structure as the traffic events arrive at different locations of the road network at different rates. Accordingly, some locations require more visits by the robots compared to other locations. To address these issues, we define an uncertainty metric for each location of the road network and formulate a path planning problem for the aerial robots to minimize the network's average uncertainty. We express this problem as a partially observable Markov decision process (POMDP) and propose a distributed and scalable algorithm based on deep reinforcement learning to solve it. We consider two different scenarios depending on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.