A Hierarchical Reinforcement Learning Framework for Multi-UAV Combat   Using Leader-Follower Strategy

Jinhui Pang; Jinglin He; Noureldin Mohamed Abdelaal Ahmed Mohamed,; Changqing Lin; Zhihui Zhang; Xiaoshuai Hao

arXiv:2501.13132·cs.MA·January 24, 2025

A Hierarchical Reinforcement Learning Framework for Multi-UAV Combat Using Leader-Follower Strategy

Jinhui Pang, Jinglin He, Noureldin Mohamed Abdelaal Ahmed Mohamed,, Changqing Lin, Zhihui Zhang, Xiaoshuai Hao

PDF

Open Access

TL;DR

This paper introduces a hierarchical reinforcement learning framework for multi-UAV combat that enhances cooperation and maneuverability by using a leader-follower strategy across three decision-making levels.

Contribution

It proposes a novel hierarchical framework with a leader-follower multi-agent approach to improve cooperation and high-dimensional control in multi-UAV combat scenarios.

Findings

01

Framework effectively improves UAV cooperation in simulations

02

Hierarchical levels enable better decision-making in complex environments

03

Leader-follower roles enhance strategic coordination

Abstract

Multi-UAV air combat is a complex task involving multiple autonomous UAVs, an evolving field in both aerospace and artificial intelligence. This paper aims to enhance adversarial performance through collaborative strategies. Previous approaches predominantly discretize the action space into predefined actions, limiting UAV maneuverability and complex strategy implementation. Others simplify the problem to 1v1 combat, neglecting the cooperative dynamics among multiple UAVs. To address the high-dimensional challenges inherent in six-degree-of-freedom space and improve cooperation, we propose a hierarchical framework utilizing the Leader-Follower Multi-Agent Proximal Policy Optimization (LFMAPPO) strategy. Specifically, the framework is structured into three levels. The top level conducts a macro-level assessment of the environment and guides execution policy. The middle level determines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGuidance and Control Systems · Mathematical and Theoretical Epidemiology and Ecology Models · Adaptive Dynamic Programming Control