Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation   Using Privileged Information

Junqiao Wang; Zhongliang Yu; Dong Zhou; Jiaqi Shi; Runran Deng

arXiv:2412.06313·cs.RO·December 10, 2024

Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information

Junqiao Wang, Zhongliang Yu, Dong Zhou, Jiaqi Shi, Runran Deng

PDF

Open Access

TL;DR

This paper introduces DPRL, a deep reinforcement learning algorithm for UAV navigation that uses privileged information during training to improve obstacle avoidance and efficiency in complex environments.

Contribution

The paper presents a novel DPRL algorithm combining privileged learning and multi-agent exploration for improved UAV autonomous navigation under partial observability.

Findings

01

DPRL outperforms existing algorithms in simulation benchmarks.

02

The use of privileged information enhances perception and robustness.

03

Multi-agent exploration accelerates training convergence.

Abstract

The capability of UAVs for efficient autonomous navigation and obstacle avoidance in complex and unknown environments is critical for applications in agricultural irrigation, disaster relief and logistics. In this paper, we propose the DPRL (Distributed Privileged Reinforcement Learning) navigation algorithm, an end-to-end policy designed to address the challenge of high-speed autonomous UAV navigation under partially observable environmental conditions. Our approach combines deep reinforcement learning with privileged learning to overcome the impact of observation data corruption caused by partial observability. We leverage an asymmetric Actor-Critic architecture to provide the agent with privileged information during training, which enhances the model's perceptual capabilities. Additionally, we present a multi-agent exploration strategy across diverse environments to accelerate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Robotics and Sensor-Based Localization · Reinforcement Learning in Robotics