Distributed Deep Reinforcement Learning for Intelligent Traffic Monitoring with a Team of Aerial Robots
Behzad Khamidehi, Elvino S. Sousa

TL;DR
This paper presents a distributed deep reinforcement learning approach for aerial robots to monitor traffic in road networks, addressing stochastic and non-homogeneous traffic patterns with scalable algorithms under different communication constraints.
Contribution
It introduces a novel POMDP-based path planning framework and a scalable deep reinforcement learning algorithm for multi-robot traffic monitoring with partial and real-time communication modes.
Findings
Effective in real road network topology
Handles both real-time and limited communication scenarios
Reduces uncertainty in traffic monitoring
Abstract
This paper studies the traffic monitoring problem in a road network using a team of aerial robots. The problem is challenging due to two main reasons. First, the traffic events are stochastic, both temporally and spatially. Second, the problem has a non-homogeneous structure as the traffic events arrive at different locations of the road network at different rates. Accordingly, some locations require more visits by the robots compared to other locations. To address these issues, we define an uncertainty metric for each location of the road network and formulate a path planning problem for the aerial robots to minimize the network's average uncertainty. We express this problem as a partially observable Markov decision process (POMDP) and propose a distributed and scalable algorithm based on deep reinforcement learning to solve it. We consider two different scenarios depending on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
