UAV-MARL: Multi-Agent Reinforcement Learning for Time-Critical and Dynamic Medical Supply Delivery
Islam Guven, Mehmet Parlak

TL;DR
This paper introduces a multi-agent reinforcement learning framework for coordinating UAV fleets to efficiently deliver medical supplies during emergencies, demonstrating improved performance over other methods using real-world geographic data.
Contribution
It presents a novel MARL framework using PPO for UAV coordination in stochastic medical delivery scenarios, addressing partial observability and scalability challenges.
Findings
Classical PPO outperforms asynchronous variants in coordination tasks.
The framework effectively prioritizes urgent medical requests.
Real-world data validates the approach's practicality.
Abstract
Unmanned aerial vehicles (UAVs) are increasingly used to support time-critical medical supply delivery, providing rapid and flexible logistics during emergencies and resource shortages. However, effective deployment of UAV fleets requires coordination mechanisms capable of prioritizing medical requests, allocating limited aerial resources, and adapting delivery schedules under uncertain operational conditions. This paper presents a multi-agent reinforcement learning (MARL) framework for coordinating UAV fleets in stochastic medical delivery scenarios where requests vary in urgency, location, and delivery deadlines. The problem is formulated as a partially observable Markov decision process (POMDP) in which UAV agents maintain awareness of medical delivery demands while having limited visibility of other agents due to communication and localization constraints. The proposed framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Transportation and Mobility Innovations · Vehicle Routing Optimization Methods
