Multi-UAV Formation Control with Static and Dynamic Obstacle Avoidance via Reinforcement Learning
Yuqing Xie, Chao Yu, Hongzhi Zang, Feng Gao, Wenhao Tang, Jingyi, Huang, Jiayu Chen, Botian Xu, Yi Wu, Yu Wang

TL;DR
This paper presents a reinforcement learning approach for multi-UAV formation control that effectively handles static and dynamic obstacle avoidance, demonstrating superior performance in simulation and real-world tests.
Contribution
A novel two-stage RL pipeline with reward function search and curriculum learning, enhanced by an attention-based encoder, for robust multi-UAV formation and obstacle avoidance.
Findings
Outperforms planning-based and RL baselines in collision avoidance and formation maintenance
Effective in static, dynamic, and mixed obstacle scenarios
Ablation confirms the benefits of curriculum learning and attention encoder
Abstract
This paper tackles the challenging task of maintaining formation among multiple unmanned aerial vehicles (UAVs) while avoiding both static and dynamic obstacles during directed flight. The complexity of the task arises from its multi-objective nature, the large exploration space, and the sim-to-real gap. To address these challenges, we propose a two-stage reinforcement learning (RL) pipeline. In the first stage, we randomly search for a reward function that balances key objectives: directed flight, obstacle avoidance, formation maintenance, and zero-shot policy deployment. The second stage applies this reward function to more complex scenarios and utilizes curriculum learning to accelerate policy training. Additionally, we incorporate an attention-based observation encoder to improve formation maintenance and adaptability to varying obstacle densities. Experimental results in both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms
