Multi-UAV Formation Control with Static and Dynamic Obstacle Avoidance   via Reinforcement Learning

Yuqing Xie; Chao Yu; Hongzhi Zang; Feng Gao; Wenhao Tang; Jingyi; Huang; Jiayu Chen; Botian Xu; Yi Wu; Yu Wang

arXiv:2410.18495·cs.RO·March 4, 2025

Multi-UAV Formation Control with Static and Dynamic Obstacle Avoidance via Reinforcement Learning

Yuqing Xie, Chao Yu, Hongzhi Zang, Feng Gao, Wenhao Tang, Jingyi, Huang, Jiayu Chen, Botian Xu, Yi Wu, Yu Wang

PDF

Open Access

TL;DR

This paper presents a reinforcement learning approach for multi-UAV formation control that effectively handles static and dynamic obstacle avoidance, demonstrating superior performance in simulation and real-world tests.

Contribution

A novel two-stage RL pipeline with reward function search and curriculum learning, enhanced by an attention-based encoder, for robust multi-UAV formation and obstacle avoidance.

Findings

01

Outperforms planning-based and RL baselines in collision avoidance and formation maintenance

02

Effective in static, dynamic, and mixed obstacle scenarios

03

Ablation confirms the benefits of curriculum learning and attention encoder

Abstract

This paper tackles the challenging task of maintaining formation among multiple unmanned aerial vehicles (UAVs) while avoiding both static and dynamic obstacles during directed flight. The complexity of the task arises from its multi-objective nature, the large exploration space, and the sim-to-real gap. To address these challenges, we propose a two-stage reinforcement learning (RL) pipeline. In the first stage, we randomly search for a reward function that balances key objectives: directed flight, obstacle avoidance, formation maintenance, and zero-shot policy deployment. The second stage applies this reward function to more complex scenarios and utilizes curriculum learning to accelerate policy training. Additionally, we incorporate an attention-based observation encoder to improve formation maintenance and adaptability to varying obstacle densities. Experimental results in both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms