A Hierarchical Reinforcement Learning Framework for Multi-UAV Combat Using Leader-Follower Strategy
Jinhui Pang, Jinglin He, Noureldin Mohamed Abdelaal Ahmed Mohamed,, Changqing Lin, Zhihui Zhang, Xiaoshuai Hao

TL;DR
This paper introduces a hierarchical reinforcement learning framework for multi-UAV combat that enhances cooperation and maneuverability by using a leader-follower strategy across three decision-making levels.
Contribution
It proposes a novel hierarchical framework with a leader-follower multi-agent approach to improve cooperation and high-dimensional control in multi-UAV combat scenarios.
Findings
Framework effectively improves UAV cooperation in simulations
Hierarchical levels enable better decision-making in complex environments
Leader-follower roles enhance strategic coordination
Abstract
Multi-UAV air combat is a complex task involving multiple autonomous UAVs, an evolving field in both aerospace and artificial intelligence. This paper aims to enhance adversarial performance through collaborative strategies. Previous approaches predominantly discretize the action space into predefined actions, limiting UAV maneuverability and complex strategy implementation. Others simplify the problem to 1v1 combat, neglecting the cooperative dynamics among multiple UAVs. To address the high-dimensional challenges inherent in six-degree-of-freedom space and improve cooperation, we propose a hierarchical framework utilizing the Leader-Follower Multi-Agent Proximal Policy Optimization (LFMAPPO) strategy. Specifically, the framework is structured into three levels. The top level conducts a macro-level assessment of the environment and guides execution policy. The middle level determines…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGuidance and Control Systems · Mathematical and Theoretical Epidemiology and Ecology Models · Adaptive Dynamic Programming Control
