Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization
Yong Si, Mingfei Lu, Jing Li, Yang Hu, Guijiang Li, Yueheng Song, and Zhaokui Wang

TL;DR
Smart Commander introduces a hierarchical reinforcement learning framework for optimizing fleet-level decision-making in military aviation PHM, effectively handling large-scale complexity and sparse feedback.
Contribution
It presents a novel HRL approach with layered reward shaping and neural networks, improving scalability and training efficiency over traditional methods.
Findings
Significantly reduces training time compared to monolithic DRL.
Outperforms rule-based baselines in fleet management tasks.
Demonstrates robustness and scalability in failure-prone environments.
Abstract
Decision-making in military aviation Prognostics and Health Management (PHM) faces significant challenges due to the "curse of dimensionality" in large-scale fleet operations, combined with sparse feedback and stochastic mission profiles. To address these issues, this paper proposes Smart Commander, a novel Hierarchical Reinforcement Learning (HRL) framework designed to optimize sequential maintenance and logistics decisions. The framework decomposes the complex control problem into a two-tier hierarchy: a strategic General Commander manages fleet-level availability and cost objectives, while tactical Operation Commanders execute specific actions for sortie generation, maintenance scheduling, and resource allocation. The proposed approach is validated within a custom-built, high-fidelity discrete-event simulation environment that captures the dynamics of aircraft configuration and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
