Maneuver Decision-Making Through Proximal Policy Optimization And Monte Carlo Tree Search
Zhang Hong-Peng

TL;DR
This paper introduces a novel approach combining proximal policy optimization and Monte Carlo tree search to enhance maneuver decision-making in air combat scenarios, overcoming limitations of traditional reinforcement learning methods.
Contribution
The paper proposes a new method integrating PPO and MCTS for maneuver decision-making, improving decision quality and training efficiency over existing RL algorithms.
Findings
Agents trained with the method make context-dependent decisions.
The approach outperforms traditional RL in maneuvering tasks.
Simulation results validate the effectiveness of the combined method.
Abstract
Maneuver decision-making can be regarded as a Markov decision process and can be address by reinforcement learning. However, original reinforcement learning algorithms can hardly solve the maneuvering decision-making problem. One reason is that agents use random actions in the early stages of training, which makes it difficult to get rewards and learn how to make effective decisions. To address this issue, a method based on proximal policy optimization and Monte Carlo tree search is proposed. The method uses proximal policy optimization to train the agent, and regards the results of air combat as targets to train the value network. Then, based on the value network and the visit count of each node, Monte Carlo tree search is used to find the actions with more expected returns than random actions, which can improve the training performance. The ablation studies and simulation experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGuidance and Control Systems · Aerospace and Aviation Technology · Military Defense Systems Analysis
