# Enhanced intelligent train operation algorithms for metro train based on expert system and deep reinforcement learning

**Authors:** Yunhu Huang, Wenzhu Lai, Dewang Chen, Geng Lin, Jiateng Yin, Qing-Chang Lu, Qing-Chang Lu, Qing-Chang Lu, Qing-Chang Lu

PMC · DOI: 10.1371/journal.pone.0323478 · PLOS One · 2025-05-21

## TL;DR

This paper introduces enhanced intelligent train operation algorithms that combine expert systems and deep reinforcement learning to improve energy efficiency and passenger comfort in metro trains.

## Contribution

The novel contribution is the integration of expert systems with PPO-based reinforcement learning to optimize train operation without relying on offline speed profiles.

## Key findings

- EITO algorithms outperform existing intelligent and manual driving methods in energy consumption and passenger comfort.
- The EITOP algorithm shows the best performance in tests involving complex track conditions.
- The proposed DMTD method increases coasting distances and reduces energy use.

## Abstract

In recent decades, automatic train operation (ATO) systems have been gradually adopted by many metro systems, primarily due to their cost-effectiveness and practicality. However, a critical examination reveals computational constraints, adaptability to unforeseen conditions and multi-objective balancing that our research aims to address. In this paper, expert knowledge is combined with deep reinforcement learning algorithm (Proximal Policy Optimization, PPO) and two enhanced intelligent train operation algorithms (EITO) are proposed. The first algorithm, EITOE, is based on an expert system containing expert rules and a heuristic expert inference method. On the basis of EITOE, we propose EITOP algorithm using the PPO algorithm to optimize multiple objectives by designing reinforcement learning strategies, rewards, and value functions. We also develop the double minimal-time distribution (DMTD) calculation method in the EITO implementation to achieve longer coasting distances and further optimize the energy consumption. Compared with previous works, EITO enables the control of continuous train operation without reference to offline speed profiles and optimizes several key performance indicators online. Finally, we conducted comparative tests of the manual driving, intelligent driving algorithm (ITOR, STON), and the algorithms proposed in this paper, EITO, using real line data from the Yizhuang Line of Beijing Metro (YLBS). The test results show that the EITO outperform the current intelligent driving algorithms and manual driving in terms of energy consumption and passengers’ comfort. In addition, we further validated the robustness of EITO by selecting some complex lines with speed limits, gradients and different running times for testing on the YLBS. Overall, the EITOP algorithm has the best performance.

## Full-text entities

- **Diseases:** PPO (MESH:D014897), ATO (MESH:D000095027), MTD (MESH:C537356), SD (MESH:D012735), EITO (MESH:C564835)
- **Chemicals:** D (MESH:D003903), DKZ32 (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12094749/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12094749/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12094749/full.md

---
Source: https://tomesphere.com/paper/PMC12094749