Interpretable DRL-based Maneuver Decision of UCAV Dogfight

Haoran Han; Jian Cheng; Maolong Lv

arXiv:2407.01571·cs.RO·July 3, 2024

Interpretable DRL-based Maneuver Decision of UCAV Dogfight

Haoran Han, Jian Cheng, Maolong Lv

PDF

Open Access

TL;DR

This paper introduces a DRL-based framework for UCAV dogfight maneuver decision-making, achieving high success rates and interpretability, with the agent developing advanced tactics like 'Dive and Chase'.

Contribution

It presents a three-layer UCAV dogfight framework using DDQN for maneuver selection, enhancing interpretability and tactical diversity in autonomous aerial combat.

Findings

01

Agent achieves 85.75% win rate against DT strategy.

02

The framework improves interpretability of DRL decisions.

03

Emergence of novel tactics like 'Dive and Chase'.

Abstract

This paper proposes a three-layer unmanned combat aerial vehicle (UCAV) dogfight frame where Deep reinforcement learning (DRL) is responsible for high-level maneuver decision. A four-channel low-level control law is firstly constructed, followed by a library containing eight basic flight maneuvers (BFMs). Double deep Q network (DDQN) is applied for BFM selection in UCAV dogfight, where the opponent strategy during the training process is constructed with DT. Our simulation result shows that, the agent can achieve a win rate of 85.75% against the DT strategy, and positive results when facing various unseen opponents. Based on the proposed frame, interpretability of the DRL-based dogfight is significantly improved. The agent performs yo-yo to adjust its turn rate and gain higher maneuverability. Emergence of "Dive and Chase" behavior also indicates the agent can generate a novel tactic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Target Tracking and Data Fusion in Sensor Networks · Adaptive Control of Nonlinear Systems