An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments

Zhiwei Chen; Yi Zhong; Xiaohu Ge; Yi Ma

arXiv:2002.00831·cs.NI·February 4, 2020·5 cites

An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments

Zhiwei Chen, Yi Zhong, Xiaohu Ge, Yi Ma

PDF

Open Access

TL;DR

This paper introduces an actor-critic deep reinforcement learning approach for real-time UAV deployment as flying base stations to optimize mobile user throughput in dynamic environments.

Contribution

It proposes a novel AC-based DRL method to efficiently determine near-optimal UAV positions in real-time, addressing the challenges of a complex, time-varying optimization problem.

Findings

01

Outperforms heuristic, sequential least-squares, and fixed UAV methods in throughput.

02

Effectively handles infinite state and action spaces.

03

Provides robust real-time UAV deployment solutions.

Abstract

In this paper, the real-time deployment of unmanned aerial vehicles (UAVs) as flying base stations (BSs) for optimizing the throughput of mobile users is investigated for UAV networks. This problem is formulated as a time-varying mixed-integer non-convex programming (MINP) problem, which is challenging to find an optimal solution in a short time with conventional optimization techniques. Hence, we propose an actor-critic-based (AC-based) deep reinforcement learning (DRL) method to find near-optimal UAV positions at every moment. In the proposed method, the process searching for the solution iteratively at a particular moment is modeled as a Markov decision process (MDP). To handle infinite state and action spaces and improve the robustness of the decision process, two powerful neural networks (NNs) are configured to evaluate the UAV position adjustments and make decisions, respectively.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUAV Applications and Optimization · Distributed Control Multi-Agent Systems · Smart Parking Systems Research