Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization

Yong Si; Mingfei Lu; Jing Li; Yang Hu; Guijiang Li; Yueheng Song; and Zhaokui Wang

arXiv:2604.07171·cs.LG·April 9, 2026

Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization

Yong Si, Mingfei Lu, Jing Li, Yang Hu, Guijiang Li, Yueheng Song, and Zhaokui Wang

PDF

TL;DR

Smart Commander introduces a hierarchical reinforcement learning framework for optimizing fleet-level decision-making in military aviation PHM, effectively handling large-scale complexity and sparse feedback.

Contribution

It presents a novel HRL approach with layered reward shaping and neural networks, improving scalability and training efficiency over traditional methods.

Findings

01

Significantly reduces training time compared to monolithic DRL.

02

Outperforms rule-based baselines in fleet management tasks.

03

Demonstrates robustness and scalability in failure-prone environments.

Abstract

Decision-making in military aviation Prognostics and Health Management (PHM) faces significant challenges due to the "curse of dimensionality" in large-scale fleet operations, combined with sparse feedback and stochastic mission profiles. To address these issues, this paper proposes Smart Commander, a novel Hierarchical Reinforcement Learning (HRL) framework designed to optimize sequential maintenance and logistics decisions. The framework decomposes the complex control problem into a two-tier hierarchy: a strategic General Commander manages fleet-level availability and cost objectives, while tactical Operation Commanders execute specific actions for sortie generation, maintenance scheduling, and resource allocation. The proposed approach is validated within a custom-built, high-fidelity discrete-event simulation environment that captures the dynamics of aircraft configuration and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.