A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV   Air-to-Air Combat

Jiajun Chai; Wenzhang Chen; Yuanheng Zhu; Zong-xin Yao; Dongbin Zhao

arXiv:2212.03830·cs.AI·September 21, 2024·1 cites

A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat

Jiajun Chai, Wenzhang Chen, Yuanheng Zhu, Zong-xin Yao, Dongbin Zhao

PDF

Open Access

TL;DR

This paper introduces a hierarchical deep reinforcement learning framework for 6-DOF UCAV air-to-air combat, dividing decision-making into macro strategy and micro control loops trained with PPO and self-play.

Contribution

It presents a novel hierarchical RL approach with separate loops for strategy and control, using PPO and self-play to enhance combat performance in complex 6-DOF scenarios.

Findings

01

Inner loop controller outperforms PID in tracking accuracy.

02

Outer loop strategy achieves higher winning rates through evolving maneuvers.

03

Hierarchical framework effectively manages complex combat dynamics.

Abstract

Unmanned combat air vehicle (UCAV) combat is a challenging scenario with continuous action space. In this paper, we propose a general hierarchical framework to resolve the within-vision-range (WVR) air-to-air combat problem under 6 dimensions of degree (6-DOF) dynamics. The core idea is to divide the whole decision process into two loops and use reinforcement learning (RL) to solve them separately. The outer loop takes into account the current combat situation and decides the expected macro behavior of the aircraft according to a combat strategy. Then the inner loop tracks the macro behavior with a flight controller by calculating the actual input signals for the aircraft. We design the Markov decision process for both the outer loop strategy and inner loop controller, and train them by proximal policy optimization (PPO) algorithm. For the inner loop controller, we design an effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGuidance and Control Systems · Aerospace and Aviation Technology