Light Aircraft Game : Basic Implementation and training results analysis

Hanzhong Cao

arXiv:2506.14164·cs.LG·June 18, 2025

Light Aircraft Game : Basic Implementation and training results analysis

Hanzhong Cao

PDF

Open Access 1 Repo

TL;DR

This paper explores multi-agent reinforcement learning in a combat environment called LAG, comparing hierarchical on-policy and off-policy algorithms to understand their training stability and coordination capabilities.

Contribution

It introduces a detailed combat environment setup and evaluates two algorithms, HAPPO and HASAC, highlighting their strengths in different scenarios.

Findings

01

HASAC performs well in simple coordination tasks

02

HAPPO shows better adaptability in complex missile combat

03

Insights into on-policy vs off-policy trade-offs in MARL

Abstract

This paper investigates multi-agent reinforcement learning (MARL) in a partially observable, cooperative-competitive combat environment known as LAG. We describe the environment's setup, including agent actions, hierarchical controls, and reward design across different combat modes such as No Weapon and ShootMissile. Two representative algorithms are evaluated: HAPPO, an on-policy hierarchical variant of PPO, and HASAC, an off-policy method based on soft actor-critic. We analyze their training stability, reward progression, and inter-agent coordination capabilities. Experimental results show that HASAC performs well in simpler coordination tasks without weapons, while HAPPO demonstrates stronger adaptability in more dynamic and expressive scenarios involving missile combat. These findings provide insights into the trade-offs between on-policy and off-policy methods in multi-agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiaxiaoguang/lightaircraftgame
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAir Traffic Management and Optimization · Aerospace and Aviation Technology

MethodsProximal Policy Optimization