Drone Swarm Energy Management
Michael Z. Zgurovsky, Pavlo O. Kasyanov, Liliia S. Paliichuk

TL;DR
This paper introduces an analytical framework combining POMDP and DDPG reinforcement learning for adaptive energy management and cooperative control in drone swarms operating under uncertainty, enhancing mission success and efficiency.
Contribution
It extends DDPG with belief-state representation for robust decision-making in partially observable environments, enabling scalable cognitive swarm autonomy.
Findings
Significant improvement in mission success rates
Enhanced energy efficiency over baseline methods
Robust decision-making in uncertain environments
Abstract
This note presents an analytical framework for decision-making in drone swarm systems operating under uncertainty, based on the integration of Partially Observable Markov Decision Processes (POMDP) with Deep Deterministic Policy Gradient (DDPG) reinforcement learning. The proposed approach enables adaptive control and cooperative behavior of unmanned aerial vehicles (UAVs) within a cognitive AI platform, where each agent learns optimal energy management and navigation policies from dynamic environmental states. We extend the standard DDPG architecture with a belief-state representation derived from Bayesian filtering, allowing for robust decision-making in partially observable environments. In this paper, for the Gaussian case, we numerically compare the performance of policies derived from DDPG to optimal policies for discretized versions of the original continuous problem. Simulation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Reinforcement Learning in Robotics · Distributed Control Multi-Agent Systems
