An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments
Zhiwei Chen, Yi Zhong, Xiaohu Ge, Yi Ma

TL;DR
This paper introduces an actor-critic deep reinforcement learning approach for real-time UAV deployment as flying base stations to optimize mobile user throughput in dynamic environments.
Contribution
It proposes a novel AC-based DRL method to efficiently determine near-optimal UAV positions in real-time, addressing the challenges of a complex, time-varying optimization problem.
Findings
Outperforms heuristic, sequential least-squares, and fixed UAV methods in throughput.
Effectively handles infinite state and action spaces.
Provides robust real-time UAV deployment solutions.
Abstract
In this paper, the real-time deployment of unmanned aerial vehicles (UAVs) as flying base stations (BSs) for optimizing the throughput of mobile users is investigated for UAV networks. This problem is formulated as a time-varying mixed-integer non-convex programming (MINP) problem, which is challenging to find an optimal solution in a short time with conventional optimization techniques. Hence, we propose an actor-critic-based (AC-based) deep reinforcement learning (DRL) method to find near-optimal UAV positions at every moment. In the proposed method, the process searching for the solution iteratively at a particular moment is modeled as a Markov decision process (MDP). To handle infinite state and action spaces and improve the robustness of the decision process, two powerful neural networks (NNs) are configured to evaluate the UAV position adjustments and make decisions, respectively.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Distributed Control Multi-Agent Systems · Smart Parking Systems Research
